arXiv cs.CL·19 May 2026

Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

Signal

Hype

In three linesDISCA, a training-free inference-time method, culturally aligns LLMs via within-country sociodemographic disagreement. Tested on 20 countries and 7 backbones (2B–70B), it reduces cultural misalignment by 10–24% on MultiTP without modifying model weights.

Read source

Your take?

Alignment AI safety Papers Benchmarks

Summary generated by Claude — human-verified

Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

Other angles on this story