TAPS: Target-Aware Prefix Tree Selection for Diffusion-Drafted Speculative Decoding
Signal
78
Hype
15
In three linesTAPS introduces a target-aware prefix selection method for diffusion-drafted speculative decoding. By converting diffusion marginals into path-conditioned acceptance estimates, TAPS selects a compact prefix-closed subtree under fixed verification budget. Results: 7.9x lossless speedup vs vanilla autoregressive decoding, 1.36x and 1.74x over DFlash and DDTree.Read source
Your take?
Summary generated by Claude — human-verified