OpenAI Blog·16 August 2017

More on Dota 2

Signal

Hype

In three linesOpenAI demonstrates that self-play catapults ML systems from subhuman to superhuman performance with sufficient compute. Within a month, the system progressed from matching top-ranked players to beating professional pros, with continued improvement. Unlike supervised learning constrained by training data, self-play automatically generates better data as the agent improves.

Read source

Your take?

OpenAI Reinforcement learning Benchmarks

Summary generated by Claude — human-verified

More on Dota 2

Other angles on this story