Back to feed
Reddit r/LocalLLaMA·

VibeThinker-3B: what is this witchcraft? Killing it at MathQA like it has ~30B parameters

Signal
35
Hype
65
In three linesVibeThinker-3B, a 3B model, achieves exceptional MathQA results comparable to ~30B models. Reddit users report abnormally high performance for its size.
Read source
Your take?
BenchmarksOpen source

Summary generated by Claude — human-verified