Confidence Geometry Reveals Trace-Level Correctness in Large Language Model Reasoning
Signal
78
Hype
25
In three linesToken-level confidence trajectories in LLMs encode geometric signals linked to reasoning trace correctness. Without access to text or hidden states, low-dimensional representations separate correct from incorrect traces on GSM8K, MATH, and MMLU. NeuralConf, a lightweight estimator, improves confidence-weighted answer aggregation over majority voting.Read source
Your take?
Summary generated by Claude — human-verified