Back to feed
arXiv cs.AI·

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory

Signal
75
Hype
25
In three linesEvo-Memory is a benchmark for evaluating self-evolving memory in LLM agents. It structures data into sequential task streams and tests 10+ memory modules across 10 datasets. Authors propose ExpRAG for experience reuse and ReMem, an action-think-memory refine pipeline for continuous improvement.
Read source
Your take?
AI AgentsBenchmarksRAGReasoning

Summary generated by Claude — human-verified