Back to feed
arXiv cs.LG·

Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't

Signal
75
Hype
15
In three linesTheoretical study of padded transformer expressivity. Authors prove polynomially padded constant-precision transformers are equivalent to L-uniform AC⁰, while growing-precision ones achieve L-uniform TC⁰. Model depth and numeric precision are key factors; width beyond logarithmic does not increase expressivity.
Read source
Your take?
ReasoningPapersBenchmarks

Summary generated by Claude — human-verified