Reddit r/LocalLLaMA·8 June 2026

Gemma 4 12b QAT is a regression for my use case, despite all the hype.. Not my main Squeeze

Signal

Hype

In three linesGemma 4 12b QAT regresses for tool calling and agent workflows. Model misconfigures control tokens (<|tool_response|>) at startup, breaking structured function execution. Standard Q5_K_L remains more reliable for coding and story writing.

Read source

Your take?

Gemini AI Agents Code generation Tools

Summary generated by Claude — human-verified

Gemma 4 12b QAT is a regression for my use case, despite all the hype.. Not my main Squeeze

Other angles on this story