Add benchmark results for GLM-4.6 quantized models by vicruz99 · Pull Request #4813 · Aider-AI/aider

vicruz99 · 2026-02-07T18:26:22Z

Benchmark Results: GLM-4.6 (Unsloth Quantized Versions)

This PR adds benchmark results for the Unsloth quantized versions of GLM-4.6 model.

Model Details

Model: GLM-4.6 (Unsloth GGUF quantizations)
Quantizations tested:
- Q5_K_XL (5-bit)
- Q3_K_XL (3-bit)
- Q2_K_XL (2-bit)
Inference engine: llama.cpp server
Hardware: 4x A100 GPUs (local)

Configuration

Models were served locally using llama.cpp with the following settings:

./llama.cpp/bin_guadiana/llama-server \
    --model /scratch/vicstorage/UD-Q3_K_XL/GLM-4.6-UD-Q3_K_XL-00001-of-00004.gguf \
    --jinja -ngl 99 --threads -1 --ctx-size 65536 \
    --temp 1.0 --top-p 0.95 --top-k 40 --prio 3 \
    --host 0.0.0.0 --port 8080 \
    -kvu --cache-ram 0

Key parameters:

Context size: 65536 tokens
Temperature: 1.0
Top-p: 0.95
Top-k: 40

Note on Timeout

These benchmarks were ran with an increased timeout setting. A companion PR (#4650) adds an option to set custom request timeouts in benchmark.py. For these runs, the timeout was set to 10000ms in liteLLM configuration.

Results

The benchmark results are included in the data files updated in this PR.

CLAassistant · 2026-02-07T18:26:29Z

All committers have signed the CLA.

Add benchmark results for GLM-4.6 quantized models

adb4a71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark results for GLM-4.6 quantized models#4813

Add benchmark results for GLM-4.6 quantized models#4813
vicruz99 wants to merge 1 commit intoAider-AI:mainfrom
vicruz99:benchmark-glm4.6-results-quantized

vicruz99 commented Feb 7, 2026

Uh oh!

CLAassistant commented Feb 7, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vicruz99 commented Feb 7, 2026

Benchmark Results: GLM-4.6 (Unsloth Quantized Versions)

Model Details

Configuration

Note on Timeout

Results

Uh oh!

CLAassistant commented Feb 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Feb 7, 2026 •

edited

Loading