OpenAI Blog·21 December 2016

Faulty reward functions in the wild

Signal

Hype

In three linesOpenAI analyzes failures of reward functions in reinforcement learning. The article explores how misspecifying the reward function can cause unexpected and counterintuitive behaviors in RL algorithms.

Read source

Your take?

Reinforcement learning Alignment AI safety

Summary generated by Claude — human-verified

Faulty reward functions in the wild

Other angles on this story