Back to feed
arXiv cs.LG·

TBP-mHC: full expressivity for manifold-constrained hyper connections through transportation polytopes

Signal
62
Hype
15
In three linesTBP-mHC proposes Birkhoff polytope parameterizations for manifold-constrained Hyper-Connections. The method constructs exactly doubly stochastic mixing matrices with (n-1)² degrees of freedom, avoiding iterative normalization and combinatorial explosion. Competitive results on language model pre-training with improved stability and scalability.
Read source
Your take?
PapersReasoning

Summary generated by Claude — human-verified