GPT plus confiant sur les tâches difficiles où ils se trompe le plus, selon un preprint USC/Berkeley
Signal
72
Hype
35
In three linesGPT-4o, ChatGPT, and GPT-o3 display confidence exceeding their actual accuracy, with the gap widening on difficult tasks where they make the most mistakes. A USC/Berkeley preprint reveals growing divergence between stated confidence and real performance.Read source
Your take?
Summary generated by Claude — human-verified