April 2018

3 articles

Evolved Policy Gradients

OpenAI releases Evolved Policy Gradients (EPG), a metalearning approach that evolves the loss function of learning agents. EPG-trained agents generalize to novel tasks unseen during training, such as navigating to objects placed on different sides of a room.

OpenAI Reinforcement learning Reasoning

SIG

HYP

OpenAI Blog·Apr 10

Gotta Learn Fast: A new benchmark for generalization in RL

OpenAI introduces a new benchmark to evaluate generalization in reinforcement learning. The tool measures RL agents' ability to adapt to novel and varied environments beyond their training data.

Reinforcement learning Benchmarks OpenAI

SIG

HYP

OpenAI Blog·Apr 5

Retro Contest

OpenAI launches a transfer learning contest measuring a reinforcement learning algorithm's ability to generalize from previous experience.

Reinforcement learning Benchmarks OpenAI

SIG

HYP