Back to feed
arXiv cs.AI·

PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

Signal
72
Hype
25
In three linesPluRule is a multimodal, multilingual benchmark for moderating pluralistic communities on social media. It covers 13,371 rule violations across 1,989 Reddit communities and 2,885 rules in 9 languages. State-of-the-art vision-language models, including GPT-4.5 with advanced reasoning, only marginally outperform a trivial baseline, revealing that pluralistic moderation remains a fundamental challenge.
Read source
Your take?
BenchmarksVisionAI safetyGPT

Summary generated by Claude — human-verified