ACC: Compiling Agent Trajectories for Long-Context Training
Signal
78
Hype
25
In three linesACC converts agent trajectories (search, software engineering, database querying) into long-context QA pairs for SFT training. Removes tool response masking and creates explicit supervision over distant dependencies. Qwen3-30B-A3B achieves +18.1 on MRCR and +7.6 on GraphWalks, comparable to Qwen3-235B.Read source
Your take?
Summary generated by Claude — human-verified