Back to feed
arXiv cs.CL·

Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes

Signal
72
Hype
18
In three linesNew arXiv paper on interpretable detection of harmful Chinese memes. Authors create Ex-ToxiCN-MM, first explanation dataset with opposing interpretations (harmful/non-harmful), and C-HarmKB, Chinese cultural knowledge base. They propose RIKE, attribution analysis framework with AKE and RIR modules, outperforming baselines. Code and data open-sourced.
Read source
Your take?
VisionAI safetyEvalsOpen source

Summary generated by Claude — human-verified