Back to feed
Simon Willison·

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude

Signal
75
Hype
35
1 other source cover this →
In three linesAnthropic reverses a secret policy that would have degraded Claude Fable/Mythos performance for AI researchers without notification. The company acknowledges making 'the wrong tradeoff' and now makes safeguards visible.
Read source
Your take?
ClaudeAnthropicAI safetyAlignment

Summary generated by Claude — human-verified