Back to feed
arXiv cs.AI·

CAM-Bench: A Benchmark for Computational and Applied Mathematics in Lean

Signal
78
Hype
15
In three linesCAM-Bench is a Lean 4 benchmark of 1,000 computational and applied mathematics problems (optimization, numerical linear algebra, numerical analysis) adapted from textbooks with locally recovered context via dependency-recovery pipeline. Evaluation of LLMs and formalization agents reveals failures in tracking local assumptions and long-horizon control in Lean.
Read source
Your take?
BenchmarksReasoningCode generation

Summary generated by Claude — human-verified