Back to feed
Simon Willison·

Claude Opus 4.8: "a modest but tangible improvement"

Signal
75
Hype
15
In three linesAnthropic releases Claude Opus 4.8, described as a "modest but tangible improvement" over 4.7. The model excels in honesty: 4x less likely to let code flaws pass unremarked, and abstains more on uncertain questions. Pricing unchanged: $5/M input tokens, $25/M output.
Read source
Your take?
ClaudeAnthropicEvalsAlignment

Summary generated by Claude — human-verified