Back to feed
arXiv cs.CL·

Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning

Signal
72
Hype
25
In three linesGuardedRepair is a guarded best-of-N repair framework for LLM mathematical reasoning that selectively fixes incorrect traces while preserving correct answers. On GSM8K (95.60% → 96.89%), it fixes 17 of 58 errors with no measured broken-correct cases. On weak-reasoner ASDiv, accuracy improves from 78.40% to 87.60%.
Read source
Your take?
ReasoningEvalsAI safety

Summary generated by Claude — human-verified