From Context Shift to Stylistic Collapse: Why Training Objectives Matter More Than Scale
Signal
78
Hype
25
In three linesStudy of 17 models (410M-100B+ parameters) showing instruction-tuning causes linguistic entropy collapse (amplification: 1,949-16,853%), independent of RLHF. Strong control (lambda=5.0) reduces this effect by 40.5% and outperforms frontier models by 96.7-98.2% despite 200-1000x scale disadvantage.Read source
Your take?
Summary generated by Claude — human-verified