ActuIA·26 May 2026

GPT plus confiant sur les tâches difficiles où ils se trompe le plus, selon un preprint USC/Berkeley

Signal

Hype

In three linesGPT-4o, ChatGPT, and GPT-o3 display confidence exceeding their actual accuracy, with the gap widening on difficult tasks where they make the most mistakes. A USC/Berkeley preprint reveals growing divergence between stated confidence and real performance.

Read source

Your take?

GPT OpenAI Evals AI safety Papers

Summary generated by Claude — human-verified

GPT plus confiant sur les tâches difficiles où ils se trompe le plus, selon un preprint USC/Berkeley

Other angles on this story