Back to feed
arXiv cs.AI·

SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?

Signal
78
Hype
15
In three linesSocialMemBench benchmarks AI memory systems in multi-party social groups (430 personas, 7,355 conversation turns, 1,031 QA pairs). Gemini 2.5 Flash reaches 0.721 on small networks vs 0.98 expected. Four open-source frameworks (Mem0, LangMem, Graphiti, Cognee) score 0.12-0.18, well below references (0.345-0.369), revealing a measurable gap.
Read source
Your take?
BenchmarksGeminiAI AgentsMulti-agentEvals

Summary generated by Claude — human-verified