Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges
Signal
78
Hype
15
In three linesStudy on rationalization bias in LLM judges. Researchers test whether model explanations remain stable when non-evidential cues are perturbed (verbosity, confidence). They propose PROOF-BEFORE-PREFERENCE to improve cue invariance and reduce explanation anchoring.Read source
Your take?
Summary generated by Claude — human-verified