October 2018

4 articles

Reinforcement learning with prediction-based rewards

OpenAI introduces Random Network Distillation (RND), a prediction-based reinforcement learning method that encourages exploration through curiosity. RND exceeds average human performance on Montezuma's Revenge for the first time.

OpenAI Reinforcement learning Reasoning

SIG

HYP

OpenAI Blog·Oct 22

Learning complex goals with iterated amplification

OpenAI proposes iterated amplification, an AI safety technique enabling specification of complex behaviors by decomposing tasks into simpler sub-tasks, without labeled data or reward functions. Experiments remain limited to simple algorithmic domains.

OpenAI AI safety Alignment

SIG

HYP

OpenAI Blog·Oct 11

OpenAI Scholars 2019: Applications open

OpenAI opens applications for its second Scholars cohort: 6–10 stipends and mentorship for underrepresented individuals to study deep learning full-time for 3 months and open-source a project.

OpenAI Open source

SIG

HYP

OpenAI Blog·Oct 2

FFJORD: Free-form continuous dynamics for scalable reversible generative models

OpenAI introduces FFJORD, a reversible generative model using free-form continuous dynamics to learn complex distributions. The method reduces computational complexity and improves scalability compared to prior approaches.

OpenAI Papers Benchmarks

SIG

HYP