arXiv cs.CL·26 May 2026

Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning

Signal

Hype

In three linesGuardedRepair is a guarded best-of-N repair framework for LLM mathematical reasoning that selectively fixes incorrect traces while preserving correct answers. On GSM8K (95.60% → 96.89%), it fixes 17 of 58 errors with no measured broken-correct cases. On weak-reasoner ASDiv, accuracy improves from 78.40% to 87.60%.

Read source

Your take?

Reasoning Evals AI safety

Summary generated by Claude — human-verified

Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning

Other angles on this story