Learning a hierarchy
OpenAI develops a hierarchical reinforcement learning algorithm that learns reusable high-level actions. Applied to navigation, the agent discovers primitives (walking, crawling) and rapidly solves tasks requiring thousands of timesteps.