Back to feed
arXiv cs.AI·

Minor First, Major Last: A Depth-Induced Implicit Bias of Sharpness-Aware Minimization

Signal
72
Hype
15
In three linesStudy of implicit bias of Sharpness-Aware Minimization (SAM) on linear diagonal networks for binary classification. For L=1, both ℓ∞-SAM and ℓ2-SAM recover ℓ2 max-margin classifier like gradient descent. At L=2, ℓ2-SAM exhibits "sequential feature amplification": predictor initially relies on minor coordinates then shifts to major ones, contrasting with GD behavior.
Read source
Your take?
ReasoningPapers

Summary generated by Claude — human-verified