Back to feed
arXiv cs.AI·

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

Signal
78
Hype
15
In three linesarXiv paper reveals that models with statistically indistinguishable atomic knowledge fail systematically to chain them in multi-hop reasoning (>40 percentage point gap). Aggregate metrics mask this 'composition collapse'. Authors introduce a double-gate protocol decomposing post-training gains into three independent channels: atomic stability, residual composition, and critical depth.
Read source
Your take?
ReasoningBenchmarksEvals

Summary generated by Claude — human-verified