Back to feed
arXiv cs.CL·

ACC: Compiling Agent Trajectories for Long-Context Training

Signal
78
Hype
25
In three linesACC converts agent trajectories (search, software engineering, database querying) into long-context QA pairs for SFT training. Removes tool response masking and creates explicit supervision over distant dependencies. Qwen3-30B-A3B achieves +18.1 on MRCR and +7.6 on GraphWalks, comparable to Qwen3-235B.
Read source
Your take?
AI AgentsReasoningFine-tuningBenchmarks

Summary generated by Claude — human-verified