Reddit r/LocalLLaMA·7 June 2026

Gemma 4 31B QAT Q4 vs standard Q4 — Top1 KLD benchmark results have me confused. Someone please explain or poke holes in this.

Signal

Hype

In three linesUser benchmarks Gemma 4 31B QAT Q4 vs standard Q4 quantization on CPU (Xeon Platinum 8358). KLD metric on 5000 wikitext-2 tokens: Q4_K_M outperforms QAT Q4, which loses to standard Q4_0. Counter-intuitive results, reproducible (3 runs, std dev ±0%).

Read source

Your take?

Gemini Benchmarks Evals

Summary generated by Claude — human-verified

Gemma 4 31B QAT Q4 vs standard Q4 — Top1 KLD benchmark results have me confused. Someone please explain or poke holes in this.

Other angles on this story