Back to feed
Reddit r/LocalLLaMA·

Gemma 4 31B's competence surprised me

Signal
45
Hype
35
In three linesA researcher tests Gemma 4 31B on understanding complex, niche academic code. Gemma 4 31B outperforms Qwen 3.6 (27B and 35B) and matches Claude Opus 4.7 in grasping inter-component dependencies. Qwen 3.6 shows excessive eagerness but spots a local improvement both other models miss.
Read source
Your take?
GeminiQwenClaudeCode generationEvals

Summary generated by Claude — human-verified