Back to feed
OpenAI Blog·

More on Dota 2

Signal
75
Hype
35
In three linesOpenAI demonstrates that self-play catapults ML systems from subhuman to superhuman performance with sufficient compute. Within a month, the system progressed from matching top-ranked players to beating professional pros, with continued improvement. Unlike supervised learning constrained by training data, self-play automatically generates better data as the agent improves.
Read source
Your take?
OpenAIReinforcement learningBenchmarks

Summary generated by Claude — human-verified