arXiv cs.AI·27 May 2026

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

Signal

Hype

In three linesarXiv paper reveals that models with statistically indistinguishable atomic knowledge fail systematically to chain them in multi-hop reasoning (>40 percentage point gap). Aggregate metrics mask this 'composition collapse'. Authors introduce a double-gate protocol decomposing post-training gains into three independent channels: atomic stability, residual composition, and critical depth.

Read source

Your take?

Reasoning Benchmarks Evals

Summary generated by Claude — human-verified

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

Other angles on this story