Back to feed
Reddit r/LocalLLaMA·

How to compare Original vs QAT Gemma 4 31B Q4 quants

Signal
35
Hype
15
In three linesDiscussion on methodology for comparing Gemma 4 31B original vs QAT-retrained Q4 quantizations. Author proposes benchmarking unquantized versions first (SuperGPQA, HLE, MMLU) then measuring divergence of each Q4 against its own reference, rather than direct cross-variant comparison.
Read source
Your take?
GeminiBenchmarksEvals

Summary generated by Claude — human-verified