The Long-Term Effects of Data Selection in LLM Fine-Tuning
Signal
78
Hype
15
In three linesStudy on long-term effects of data selection during multi-stage LLM fine-tuning. Authors show that short-term optimal strategies (loss-based, gradient-based, diversity-based) can slow future learning and increase catastrophic forgetting. They propose LHAS (Long-Horizon Aware Selection) to evaluate selection as a global training intervention.Read source
Your take?
Summary generated by Claude — human-verified