The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models
Signal
75
Hype
15
In three linesMasked diffusion models (MDMs) with confidence-based decoding fail on complex reasoning tasks. Confidence-aligned training amplifies errors by an order of magnitude on multi-digit addition. Random masking better preserves the logical trajectories required for reasoning.Read source
Your take?
Summary generated by Claude — human-verified