arXiv cs.CL·19 May 2026

Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents

Signal

Hype

In three linesSkillTTA synthesizes task-specific textual skills by retrieving relevant training trajectories, with no parameter updates to the solver model. Evaluated on SpreadsheetBench, ALFWorld, and BigCodeBench: SpreadsheetBench improves from 0.397 to 0.505 Pass@1, BigCodeBench from 0.517 to 0.651.

Read source

Your take?

AI Agents Prompt engineering Reasoning Benchmarks

Summary generated by Claude — human-verified

Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents

Other angles on this story