Back to feed
arXiv cs.CL·

SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?

Signal
78
Hype
15
In three linesSocialMemBench evaluates AI memory systems in multi-party social group settings. The benchmark includes 430 personas, 7,355 conversation turns, and 1,031 QA pairs across 5 social archetypes. Gemini 2.5 Flash reaches 0.721 on small networks; open-source frameworks (Mem0, LangMem, Graphiti, Cognee) plateau at 0.12-0.18, revealing a significant capability gap.
Read source
Your take?
BenchmarksGeminiAI AgentsMulti-agent

Summary generated by Claude — human-verified