arXiv cs.AI·29 May 2026

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

Signal

Hype

In three linesThe Cognitive Categorical Transformer (CCT), a 306M-parameter model augmenting GPT-2 Small, incorporates category-theoretic and cognitive-science-inspired components. On WikiText-103, CCT achieves 21.27 validation perplexity versus 24.19 for GPT-2 Small baseline, a 12% relative reduction (2.92 PPL). Ablations show simplicial message passing accounts for 84% of the improvement.

Read source

Your take?

GPT Papers Benchmarks Reasoning

Summary generated by Claude — human-verified

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

Other angles on this story