OpenAI Blog·15 February 2024

Video generation models as world simulators

Signal

Hype

In three linesOpenAI introduces Sora, a text-conditional diffusion model trained jointly on videos and images of variable durations, resolutions and aspect ratios. Built on a transformer architecture operating on spacetime patches, Sora generates up to one minute of high-fidelity video. OpenAI suggests that scaling video generation models is a promising path toward general-purpose physical world simulators.

Read source

Your take?

OpenAI Video generation Reasoning

Summary generated by Claude — human-verified

Video generation models as world simulators

Other angles on this story