Learnability-Informed Fine-Tuning of Diffusion Language Models
Signal
78
Hype
25
In three linesNew LIFT method for fine-tuning diffusion language models (DLMs). Analysis shows vanilla SFT ignores token learnability based on masking. LIFT aligns learning with diffusion steps: easy tokens when input is masked, hard tokens with more context. Up to 3x gains on AIME'24/25 vs SFT baselines.Read source
Your take?
Summary generated by Claude — human-verified