Back to feed
arXiv cs.CL·

PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

Signal
72
Hype
25
In three linesPluRule is a multimodal multilingual benchmark for moderating pluralistic communities on social media. It covers 13,371 rule violations across 1,989 Reddit communities (9 languages, 2,885 rules). State-of-the-art vision-language models, including GPT-4.5 with advanced reasoning, only marginally outperform a trivial baseline.
Read source
Your take?
BenchmarksVisionAI safetyEvals

Summary generated by Claude — human-verified