Back to feed
Reddit r/LocalLLaMA·

An agent that plans with a frontier model but runs most of tokens locally (built it for my own dual-3090 rig)

Signal
45
Hype
35
In three linesPersonal hybrid agent tool: frontier model planning (Codex) with local execution using Qwen 3.6 27B on dual RTX 3090. 3-tier architecture (Planner/Local/Senior optional) to minimize frontier costs while retaining reasoning capabilities. Deterministic task validation.
Read source
Your take?
AI AgentsQwenCode generationPrompt engineeringOpen source

Summary generated by Claude — human-verified