Back to feed
Reddit r/LocalLLaMA·

Gemma 4 12b QAT is a regression for my use case, despite all the hype.. Not my main Squeeze

Signal
45
Hype
35
In three linesGemma 4 12b QAT regresses for tool calling and agent workflows. Model misconfigures control tokens (<|tool_response|>) at startup, breaking structured function execution. Standard Q5_K_L remains more reliable for coding and story writing.
Read source
Your take?
GeminiAI AgentsCode generationTools

Summary generated by Claude — human-verified