OpenAI Blog·7 August 2025

From hard refusals to safe-completions: toward output-centric safety training

Signal

Hype

In three linesOpenAI introduces output-centric safety training for GPT-5, replacing hard refusals with nuanced responses. The "safe-completions" approach improves both safety and helpfulness on dual-use prompts, though specific benchmarks and technical details are not disclosed in the excerpt.

Read source

Your take?

OpenAI GPT AI safety Alignment

Summary generated by Claude — human-verified

From hard refusals to safe-completions: toward output-centric safety training

Other angles on this story