The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling
Signal
72
Hype
25
In three linesThe Cognitive Categorical Transformer (CCT), a 306M-parameter model augmenting GPT-2 Small, incorporates category-theoretic and cognitive-science-inspired components. On WikiText-103, CCT achieves 21.27 validation perplexity versus 24.19 for GPT-2 Small baseline, a 12% relative reduction (2.92 PPL). Ablations show simplicial message passing accounts for 84% of the improvement.Read source
Your take?
Summary generated by Claude — human-verified