Back to feed
arXiv cs.CL·

MemTrain: Self-Supervised Context Memory Training

Signal
78
Hype
15
In three linesMemTrain introduces a self-supervised training framework to enhance context-memory capabilities of LLM agents. Two coupled proxy tasks on Wikipedia (masked entity reconstruction and intermediate memory recall) are jointly optimized using GRPO. Achieves gains up to 17.67 points on long-text QA and search-based QA benchmarks.
Read source
Your take?
AI AgentsReinforcement learningPapersBenchmarks

Summary generated by Claude — human-verified