Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning
Signal
72
Hype
25
In three linesGuardedRepair is a guarded best-of-N repair framework for LLM mathematical reasoning that selectively fixes incorrect traces while preserving correct answers. On GSM8K (95.60% → 96.89%), it fixes 17 of 58 errors with no measured broken-correct cases. On weak-reasoner ASDiv, accuracy improves from 78.40% to 87.60%.Read source
Your take?
Summary generated by Claude — human-verified