June 2017

4 articles

Faster physics in Python

OpenAI open-sources a high-performance Python library for robotic simulation using the MuJoCo engine, developed from a year of robotics research.

Robotics Open source Tools

SIG

HYP

OpenAI Blog·Jun 13

Learning from human preferences

OpenAI and DeepMind develop a preference learning algorithm to infer human objectives without explicit reward functions, reducing risks of undesirable AI behaviors.

OpenAI DeepMind Reinforcement learning

SIG

HYP

OpenAI Blog·Jun 8

Learning to cooperate, compete, and communicate

OpenAI explores multiagent environments where agents compete for resources as stepping stones toward AGI. These environments provide natural curriculum (difficulty matched to competitor skill) and no stable equilibrium, creating constant pressure for improvement.

Multi-agent AI Agents Reinforcement learning

SIG

HYP

OpenAI Blog·Jun 5

UCB exploration via Q-ensembles

OpenAI introduces an uncertainty-based exploration method (UCB) using Q-ensembles for reinforcement learning. The technique improves exploration by estimating uncertainty through multiple Q-estimators, enabling better exploration-exploitation trade-off.

Reinforcement learning OpenAI

SIG

HYP