Mixing Times of Glauber Dynamics on Masked Language Models
Signal
78
Hype
15
In three linesMasked language models (MLMs) define local conditional distributions incompatible with any consistent global joint distribution. Authors model iterative resampling as Glauber dynamics Markov chain, proving O(n log n) mixing time under bounded cross-token influence, but showing exponential metastability at low temperature with persistent semantic basins.Read source
Your take?
Summary generated by Claude — human-verified