Back to feed
Reddit r/LocalLLaMA·

How small can the orchestration model in an agent be? (separating it from code-gen — that obviously wants a big model)

Signal
65
Hype
15
In three linesA developer tests the minimum model size for orchestrating a local ReAct loop. Qwen3.6-35B-A3B (MoE, ~3B active) is his threshold: below it, the model invents tool parameters or overgeneralizes calls. He improves accuracy by exposing exact signatures in the system prompt.
Read source
Your take?
AI AgentsQwenPrompt engineeringOpen source

Summary generated by Claude — human-verified