The Future of Video is Imagined.

Sora 2.0 is a state-of-the-art AI model that generates high-fidelity video from text, images, and other clips. Bring your most ambitious creative concepts to life with unprecedented ease and control.

Sora 2.0 generated video example

Create from Any Starting Point.

Sora 2.0 is a multimodal model, allowing you to seamlessly integrate various inputs into your workflow.

Text-to-Video Generation

Simply provide a written scenario, and Sora will generate a short video that faithfully matches your prompt's content, setting, and camera motion.

Text-to-video generation example
Image-to-video animation example

Image-to-Video Animation

Animate the contents of a single image. Sora can bring still images to life by adding accurate motion and detail based on the source photo.

Video-to-Video Transformation

Take an existing video and extend it forward or backward in time, fill in missing frames, or completely transform its style with text instructions.

Video-to-video transformation example

Enhanced for the Professional Workflow.

Sora 2.0 introduces key improvements for faster, more consistent, and higher-quality results.

Faster Generation

Powered by the Sora Turbo model, which significantly reduces wait times for video outputs.

Higher Fidelity

Achieves more faithful interpretations of complex prompts thanks to improved training data and techniques.

Improved Consistency

Maintains a subject's appearance with greater stability, even when they move off-screen and return.

New In-Editor Tools

Iteratively refine results with built-in Remix, Re-cut, and Blend features, turning Sora into an AI-powered editing studio.

Direct Your Vision, Frame by Frame.

Go beyond a single prompt with the interactive Storyboard editor. Break your timeline into segments and specify different scenes, camera angles, or inputs at each timestamp for ultimate creative control. This makes it possible to dictate motion cues and scene changes with much finer control.

Sora 2.0 Storyboard Editor Interface
1

Timeline Segments: Break your video into precise time-based cards

2

Mixed Inputs: Combine text, images, and video clips in each segment

3

Scene Control: Specify camera angles, motion, and transitions

Output that Meets the Moment.

Sora 2.0 delivers state-of-the-art quality, realism, and style versatility, setting a new standard for AI-generated video.

16:9 landscape video example
9:16 vertical video example
1:1 square video example
Resolution

Up to 1080p Full HD

output for crisp, detailed visuals

Duration

Up to 20 seconds

long, with research showing capability for up to a minute

Realism

Photorealistic Quality

Produces convincing lighting, textures, and camera motion that can blur the line between real and fake

Style Versatility

Unlimited Styles

Capable of mimicking a wide range of styles, from Pixar-like cartoons and anime to stop-motion claymation

Get Access to Sora 2.0.

Sora is included with ChatGPT Plus and Pro subscriptions. No separate purchase is necessary.

ChatGPT Plus

~$20/month

Max Resolution

720p

Max Duration

10 seconds

Generation Speed

Standard

Watermark

Yes (optional)

ChatGPT Pro

~$50/month

Max Resolution

1080p

Max Duration

20 seconds

Generation Speed

Priority Access

Watermark

No Watermark