Back to feed
Reddit r/LocalLLaMA·

New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both!

Signal
72
Hype
25
In three linesLocal test of Gemma 4 12B vs 26B-A4B on RTX 4090: HTML5 canvas animation with physics. 26B uses 15 GB VRAM (6.9k tokens, 138 tok/s), 12B uses 9 GB (8.9k tokens, 80 tok/s). 26B-A4B outperforms 12B (~1.7x faster with 4B active params), but 12B remains competitive on 16 GB.
Read source
Your take?
GeminiBenchmarksCode generation

Summary generated by Claude — human-verified