Back to feed
Reddit r/LocalLLaMA·

Cactus Hybrid Router: Gemma4-2B can match Gemini-3.1-Flash-Lite by routing 15-55% of tasks to Gemini And Running The Rest Locally.

Signal
72
Hype
35
In three linesCactus Hybrid Router, a 65k parameter routing model, directs 15-55% of tasks to Gemini-3.1-Flash-Lite and runs the rest locally with Gemma4-2B. The system maintains performance even with 4-bit quantization and handles text, vision, and audio.
Read source
Your take?
GeminiAI AgentsOpen sourceInfrastructure

Summary generated by Claude — human-verified