Back to feed
arXiv cs.AI·

Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents

Signal
72
Hype
28
In three linesSkillTTA synthesizes task-specific textual skills by retrieving relevant training trajectories, with adaptation through context only—no parameter updates. Evaluated on SpreadsheetBench, ALFWorld, and BigCodeBench: Pass@1 improves from 0.397 to 0.505 on SpreadsheetBench, from 0.517 to 0.651 on BigCodeBench.
Read source
Your take?
AI AgentsPrompt engineeringBenchmarksPapers

Summary generated by Claude — human-verified