Back to feed
Reddit r/LocalLLaMA·

Gemma 4 31B QAT Q4 vs standard Q4 — Top1 KLD benchmark results have me confused. Someone please explain or poke holes in this.

Signal
45
Hype
25
In three linesUser benchmarks Gemma 4 31B QAT Q4 vs standard Q4 quantization on CPU (Xeon Platinum 8358). KLD metric on 5000 wikitext-2 tokens: Q4_K_M outperforms QAT Q4, which loses to standard Q4_0. Counter-intuitive results, reproducible (3 runs, std dev ±0%).
Read source
Your take?
GeminiBenchmarksEvals

Summary generated by Claude — human-verified