March 2018

7 articles

Variance reduction for policy gradient with action-dependent factorized baselines

OpenAI publishes a variance reduction method for policy gradient algorithms using action-dependent factorized baselines. The technique improves training efficiency by reducing gradient estimator variance, applicable to reinforcement learning models.

Reinforcement learning OpenAI Papers

SIG

HYP

OpenAI Blog·Mar 15

Improving GANs using optimal transport

OpenAI publishes a method to improve GANs using optimal transport. The technique reduces training instability and improves generated image quality by leveraging Wasserstein distances.

Image generation Papers OpenAI

SIG

HYP

OpenAI Blog·Mar 15

Report from the OpenAI hackathon

OpenAI hosted its first hackathon on March 3rd with 100 members of the AI community. The event brought together developers and researchers to work on projects using OpenAI technologies.

OpenAI

SIG

HYP

OpenAI Blog·Mar 8

On first-order meta-learning algorithms

OpenAI publishes analysis on first-order meta-learning algorithms. The article explores theoretical and practical foundations of optimization methods that enable models to learn to learn quickly from few examples.

OpenAI Papers Reinforcement learning

SIG

HYP

OpenAI Blog·Mar 7

Reptile: A scalable meta-learning algorithm

OpenAI introduces Reptile, a scalable meta-learning algorithm that samples tasks, applies stochastic gradient descent, and updates initial parameters toward learned parameters. Mathematically similar to first-order MAML, it requires only black-box access to optimizers like SGD or Adam with comparable efficiency and performance.

OpenAI Reinforcement learning

SIG

HYP

OpenAI Blog·Mar 6

OpenAI Scholars

OpenAI launches a scholarship program for 6–10 individuals from underrepresented groups. Recipients will study deep learning full-time for 3 months and open-source a project.

OpenAI Open source Business

SIG

HYP

OpenAI Blog·Mar 3

Some considerations on learning to explore via meta-reinforcement learning

OpenAI explores meta-reinforcement learning to improve agents' ability to explore efficiently. The article examines how models can learn generalizable exploration strategies rather than being pre-programmed.

Reinforcement learning AI Agents OpenAI

SIG

HYP