OpenAI and Anthropic share findings from a joint safety evaluation
Signal
72
Hype
25
In three linesOpenAI and Anthropic release findings from their first joint safety evaluation, testing each other's models for misalignment, instruction following, hallucinations, and jailbreaking. Rare cross-lab collaboration effort on AI safety.Read source
Your take?
Summary generated by Claude — human-verified