Can Generalist Agents Automate Data Curation?
Signal
78
Hype
25
In three linesCuration-Bench evaluates whether generalist AI agents can automate training data curation. Agents reach published baselines within ten iterations but tend toward local policy variants. With scaffolding requiring method citation and adaptation, an agent autonomously composes a data-selection policy outperforming strong baselines at one-tenth their data budget.Read source
Your take?
Summary generated by Claude — human-verified