Capabilities
- Modalities: text → video, image → video, video (generated) → video
- Audio: Sora 2 supports audio generation in output videos (similar to the Sora app).
- Remix: Sora 2 introduces the ability to remix existing videos by making targeted adjustments instead of regenerating from scratch.
- Responsible AI and video generation: Azure OpenAI’s video generation models include built-in Responsible AI (RAI) protections to help ensure safe and compliant use. Sora 2 blocks all IP and photorealistic content. In addition, Azure provides input and output moderation across all image generation models, along with Azure-specific safeguards such as content filtering and abuse monitoring. These systems help detect and prevent the generation or misuse of harmful, unsafe, or policy-violating content. Customers can learn more about these safeguards and how to customize them on the Content filtering page.
Model details
Sora 2 uses the Azure OpenAI v1 API, aligning with OpenAI’s native Sora 2 schema.| Aspect | Sora 2 |
|---|---|
| Model type | Adapts OpenAI’s latest Sora 2 API using v1 API |
| Modalities supported | text → video, image → video, video (generated) → video |
| Audio generation | ✅ Supported in outputs |
| Remix capability | ✅ Supported — make targeted edits to existing videos |
| Performance & fidelity | Enhanced realism, physics, and temporal consistency |
| Billing | Per second billing information |
Quickstart
Generate video clips using the Azure OpenAI service. Video generation is an asynchronous process. You create a job request with your text prompt and video format specifications, and the model processes the request in the background. You check the status of the video generation job and, once it finishes, retrieve the generated video through a download URL. The example uses the Sora 2 model.Clean up resources
If you want to clean up and remove an Azure OpenAI resource, you can delete the resource. Before deleting the resource, you must first delete any deployed models.Responsible AI and video generation
Azure OpenAI’s image generation models include built-in Responsible AI (RAI) protections to help ensure safe and compliant use. In addition, Azure provides input and output moderation across all image generation models, along with Azure-specific safeguards such as content filtering and abuse monitoring. These systems help detect and prevent the generation or misuse of harmful, unsafe, or policy-violating content. Currently the Sora 2 API enforces several content restrictions:- Only content suitable for audiences under 18 (a setting to bypass this restriction will be available in the future).
- Copyrighted characters and copyrighted music will be rejected.
- Real people—including public figures—cannot be generated.
- Input images with faces of humans are currently rejected.
Sora 2 API reference
The Sora 2 API provides 5 endpoints, each with distinct capabilities.- Create Video: Start a new render job from a prompt, with optional reference inputs or a remix ID.
- Get Video Status: Retrieve the current state of a render job and monitor its progress
- Download Video: Fetch the finished MP4 once the job is completed.
- List Videos: Enumerate your videos with pagination for history, dashboards, or housekeeping.
- Delete Videos: Delete an individual video ID from Azure OpenAI’s storage
API parameters
| Parameter | Type | Sora 2 |
|---|---|---|
| Prompt | String (required) | Natural-language description of the shot. Include shot type, subject, action, setting, lighting, and any desired camera motion to reduce ambiguity. Keep it single-purpose for best adherence. |
| Model | String (optional) | Sora-2 (default) |
| Size (Output resolution in width × height) | String (optional) | Portrait: 720×1280 Landscape: 1280×720 Default: 720×1280 |
| Seconds | String (optional) | 4 / 8 / 12 Default: 4 |
| Input reference | File (optional) | Single reference image used as a visual anchor for the first frame. Accepted MIME types: image/jpeg, image/png, image/webp. Must match size exactly. |
| Remix_video_id | String (optional) | ID of a previously completed video (e.g., video_...) to reuse structure, motion, and framing. Same as Sora 2 |
videos.create()
You’ll need to update to the latest version of the OpenAI client withpip install openai --upgrade to prevent AttributeError: 'OpenAI' object has no attribute 'videos'.
- Microsoft Entra ID
- API Key
- Environment Variables
- Response
Create a video and poll job status
CallGET /videos/{video_id} with the ID returned from the create call. The response shows the job’s current status, progress percentage, and any errors.
Expected states are queued, in_progress, completed, and failed.
- Microsoft Entra ID
- API Key
- Environment Variables
- Response
Synchronous:Use this version if testing in Jupyter Notebooks to avoid Async:
RuntimeError: asyncio.run() cannot be called from a running event loopDownload video
- Microsoft Entra ID
- API Key
- Environment Variables
- Response
Video generation from reference source
Theinput_reference parameter allows you to transform existing images using Sora 2. The resolution of the source image and final video must match. Supported values are 720x1280, and 1280x720.
- Microsoft Entra ID
- API Key
- Environment Variables
- Response
Local reference file:URL based reference file:
Remix video
The remix feature allows you to modify specific aspects of an existing video while preserving its core elements. By referencing the previous videoid from a successfully completed generation, and supplying an updated prompt the system maintains the original video’s framework, scene transitions, and visual layout while implementing your requested changes. For optimal results, limit your modifications to one clearly articulated adjustment—narrow, precise edits retain greater fidelity to the source material and minimize the likelihood of generating visual defects.
- Microsoft Entra ID
- API Key
- Environment Variables
- Response
Best practices for prompts
Write text prompts in English or other Latin script languages for the best video generation performance.Limitations
Content quality limitations
Sora 2 might have difficulty with complex physics, causal relationships (for example, bite marks on a cookie), spatial reasoning (for example, knowing left from right), and precise time-based event sequencing such as camera movement.Technical limitations
Sora 2 video generation is currently in preview. Keep the following limitations in mind:- Sora 2 supports the following output resolution dimensions: 480x480, 480x854, 854x480, 720x720, 720x1280, 1280x720, 1080x1080, 1080x1920, 1920x1080.
- Sora 2 can produce videos between 1 and 20 seconds long.
- You can request multiple video variants in a single job: for 1080p resolutions, this feature is disabled; for 720p, the maximum is two variants; for other resolutions, the maximum is four variants.
- You can have two video creation jobs running at the same time. You must wait for one of the jobs to finish before you can create another.
- Jobs are available for up to 24 hours after they’re created. After that, you must create a new job to generate the video again.
- You can use up to two images as input (the generated video interpolates content between them).
- You can use one video up to five seconds as input.
Related content
- Learn more about Azure OpenAI deployment types.
- Learn more about Azure OpenAI quotas and limits.