Learned Relay Representations for Forward-Thinking Discrete Diffusion Models
Signal
78
Hype
25
In three linesLearned Relay Representations (Relay) enables Masked Diffusion Models to propagate latent information across denoising steps via a differentiable per-token channel trained with truncated BPTT. Applied to Fast-dLLM v2, it outperforms supervised finetuning on coding tasks and reduces inference latency by 32%.Read source
Your take?
Summary generated by Claude — human-verified