Back to feed
arXiv cs.CL·

Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

Signal
78
Hype
22
In three linesDISCA, a training-free inference-time method, culturally aligns LLMs via within-country sociodemographic disagreement. Tested on 20 countries and 7 backbones (2B–70B), it reduces cultural misalignment by 10–24% on MultiTP without modifying model weights.
Read source
Your take?
AlignmentAI safetyPapersBenchmarks

Summary generated by Claude — human-verified