Back to feed
arXiv cs.CL·

Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios

Signal
82
Hype
15
In three linesGAMA-Bench, a benchmark of 1,298 paired scenarios, reveals systematic asymmetry: LLMs apply harsher response standards to male actors than female actors for identical misconduct. Male actors receive more punitive and blame-centered framing, while female actors receive therapeutic and empathy-oriented responses. The pattern persists across 10 models and all scenario types.
Read source
Your take?
EvalsAI safetyAlignmentBenchmarks

Summary generated by Claude — human-verified