Back to feed
arXiv cs.LG·

Spectral Asymptotics of Neural Network Loss Landscapes: An Exact Decomposition of the Curvature Exponent

Signal
78
Hype
15
In three linesTheoretical study of neural network loss landscape geometry. Authors prove a Spectral Alignment Decomposition explaining why curvature exponent α varies across layer types (α≈2 convolutions, α≈1 transformer attention, α<1 MLP). Empirical validation on 93 layers, 5 architectures, 3 datasets with ~2% median error.
Read source
Your take?
PapersReasoningBenchmarks

Summary generated by Claude — human-verified