Back to feed
Reddit r/LocalLLaMA·

Gemma 4 12b 8Q Heretic Oneshot Coding

Signal
65
Hype
25
In three linesGemma 4 12B heretic model tested for code generation: single-prompt retro game creation (45k tokens total). Consistent 18.44-18.93 t/s throughput, 4,372-token code generation in 4 minutes. 91.7-96.4% cache reuse on llama.cpp with Ryzen 9 9950X + RX 6800.
Read source
Your take?
GeminiCode generationOpen sourceBenchmarks

Summary generated by Claude — human-verified