Back to feed
arXiv cs.LG·

Learned Relay Representations for Forward-Thinking Discrete Diffusion Models

Signal
78
Hype
25
In three linesLearned Relay Representations (Relay) enables Masked Diffusion Models to propagate latent information across denoising steps via a differentiable per-token channel trained with truncated BPTT. Applied to Fast-dLLM v2, it outperforms supervised finetuning on coding tasks and reduces inference latency by 32%.
Read source
Your take?
Code generationReasoningPapers

Summary generated by Claude — human-verified