December 2016

2 articles

Faulty reward functions in the wild

OpenAI analyzes failures of reward functions in reinforcement learning. The article explores how misspecifying the reward function can cause unexpected and counterintuitive behaviors in RL algorithms.

Reinforcement learning Alignment AI safety

SIG

HYP

OpenAI Blog·Dec 5

Universe

OpenAI releases Universe, a software platform to measure and train AI general intelligence across games, websites, and applications.

OpenAI Benchmarks AI Agents

SIG

HYP