Back to feed
arXiv cs.CL·

Lost in Interpretation: The Plausibility-Faithfulness Trade-off in Cross-Lingual Explanations

Signal
75
Hype
15
In three linesEnglish explanations for auditing multilingual LLMs mask a trade-off: higher span agreement with human rationales but weaker causal grounding in model predictions. Across 3 tasks and 5 languages, comprehensiveness degrades up to 5.7x in English-pivot conditions despite stable task accuracy. Authors recommend auditing in input language rather than English pivots.
Read source
Your take?
Evals

Summary generated by Claude — human-verified