Back to feed
arXiv cs.LG·

The Long-Term Effects of Data Selection in LLM Fine-Tuning

Signal
78
Hype
15
In three linesStudy on long-term effects of data selection during multi-stage LLM fine-tuning. Authors show that short-term optimal strategies (loss-based, gradient-based, diversity-based) can slow future learning and increase catastrophic forgetting. They propose LHAS (Long-Horizon Aware Selection) to evaluate selection as a global training intervention.
Read source
Your take?
Fine-tuningBenchmarksPapers

Summary generated by Claude — human-verified