- Deploy MAI image models in Microsoft Foundry
- Authenticate by using Microsoft Entra ID or API keys
- Generate images by using the MAI image generations API
- Run an image edit by using the MAI image edits API
| Model name | Model version | Type |
|---|---|---|
MAI-Image-2.5-Flash (Preview) | 2026-06-02 | Text-to-image generation Image-to-image edits |
MAI-Image-2.5 (Preview) | 2026-06-02 | Text-to-image generation Image-to-image edits |
MAI-Image-2e (Preview) | 2026-04-09 | Text-to-image generation |
MAI-Image-2 (Preview) | 2026-02-20 | Text-to-image generation |
Prerequisites
- An Azure subscription with a valid payment method. If you don’t have an Azure subscription, create a paid Azure account.
- Access to Microsoft Foundry with appropriate permissions to create and manage resources.
- A Microsoft Foundry project. MAI image models are available for global standard deployment (West Central US, East US, West US, West Europe, Sweden Central, South India, and UAE North).
- Cognitive Services Contributor role on the Azure AI Foundry resource to deploy models. For more information, see Azure RBAC roles.
Deploy MAI image models
To deploy an MAI image model, follow the instructions in Deploy Microsoft Foundry Models in the Foundry portal. Alternatively, you can deploy the model by using the Azure CLI. The following code shows deployment ofMAI-Image-2.5 To deploy a different model, replace the model name and version in the lines --model-name MAI-Image-2.5 and --model-version 2026-06-02 with the values for your desired model.
Replace <ACCOUNT_NAME>, <RESOURCE_GROUP>, <DEPLOYMENT_NAME> with your values.
Run text-to-image generation
The following example shows how to generate an image from a text prompt using an MAI image model with the MAI image generations API.- Python
- REST API
Use API key authentication
-
Install the
requestslibrary: -
Set environment variables:
-
Run the following code:
Expected output: A JSON response containing the generated image data in base64 format. The image is decoded and saved as
output.pngin the current directory.
Use Microsoft Entra ID authentication
To use Microsoft Entra ID instead of an API key, replace theapi-key header with a bearer token obtained using the DefaultAzureCredential:-
Install the Azure Identity library:
-
Update the request headers in the API key authentication code:
Reference: DefaultAzureCredential
Run an image-to-image edit
The following example shows how to perform an image-to-image edit using an MAI image model with the MAI image edits API.MAI-Image-2.5-Flash (Preview) and MAI-Image-2.5 (Preview) support image-to-image edits using the Image edits API.
Requests for image-to-image edits use multipart form data.
- Python
- REST API
Use API key authentication
-
Install the
requestslibrary: -
Set environment variables:
-
Run the following code:
Expected output: A JSON response containing the edited image data in base64 format. The image is decoded and saved as
output.pngin the current directory.
Available MAI image models
Foundry supports use of MAI-Image-2.5-Flash (Preview), MAI-Image-2.5 (Preview), MAI-Image-2 (Preview), and MAI-Image-2e (Preview). Each of these models are suitable for the following key use-cases:- Text-to-image generation: Generate high-quality images from natural language prompts, enabling users to translate textual descriptions into visually coherent outputs suitable for a wide range of creative and design use cases.
- Photorealistic image synthesis: Capable of generating realistic imagery with consistent visual structure, making it suitable for concept visualization and content creation scenarios.
- Product, branding and commercial design: Well suited for product imagery, marketing visuals, brand assets, and commercial creative workflows.
- Image-to-image editing: Support precise, controllable edits to existing images, including object removal, replacement, attribute changes, inpainting, text updates, and artifact cleanup while preserving composition and layout.
- High-fidelity portraits: Generate expressive, natural-looking portraits with accurate facial structure, lighting, and texture.
- Accurate text rendering: Improved rendering of text within generated images, including labels, posters, packaging, and signage.
- Visual reasoning: Reason across objects, scene structure, lighting, scale, and spatial positioning to produce consistent outputs, even from ambiguous prompts.
MAI-Image-2.5-Flash (Preview)
MAI-Image-2.5-Flash (Preview) is a text-to-image generation and image-to-image editing model designed to create high-quality, visually rich images from natural language prompts and to perform precise, controllable edits on existing images. It uses a diffusion-based generative approach to progressively refine images, enabling strong alignment between the input text and the generated output. The model is optimized to produce diverse and coherent images across a wide range of creative and design scenarios, making it well suited for tasks such as concept visualization, creative content generation, image editing workflows, and production design.MAI-Image-2.5 (Preview)
MAI-Image-2.5 (Preview) is a text-to-image generation and image-to-image editing model designed to create high-quality, visually rich images from natural language prompts and to perform precise, controllable edits on existing images. It uses a diffusion-based generative approach to progressively refine images, enabling strong alignment between the input text and the generated output. The model excels at precise, surgical edits with consistency — enabling users and developers to make targeted object edits, adapt layouts, update text, clean up artifacts like motion blur, and preserve visual consistency across iterations.MAI-Image-2e (Preview)
MAI-Image-2e (Preview) delivers high-quality image generation, just like MAI-Image-2, but up to 22% faster and four times more efficient than MAI-Image-2, making it a smart choice for developers building at scale. MAI-Image-2e is best suited for high-volume, fast-turnaround scenarios — for example, product imagery at scale, marketing variations, branded assets, or any workflow where efficiency and cost per image are key.MAI-Image-2 (Preview)
MAI-Image-2 (Preview) is a text-to-image generation model designed to create high-quality, visually rich images from natural language prompts. It uses a diffusion-based generative approach to progressively refine images, enabling strong alignment between the input text and the generated output. The model is optimized to produce diverse and coherent images across a wide range of creative and design scenarios, making it well suited for tasks such as concept visualization, creative content generation, and image design workflows.API endpoints
After you deploy an MAI image model, use the MAI image generations API to generate images and the MAI image edits API for image-to-image edits.-
Image generations API endpoint: A Microsoft-managed endpoint that accepts a text prompt and returns a PNG image. The API endpoint has the following form:
-
Image edits API endpoint: A Microsoft-managed endpoint that accepts a JPEG or PNG image and returns a PNG image. The API endpoint has the following form:
Request parameters
The following table lists the request parameters for the image APIs:| Parameter | API | Type | Description |
|---|---|---|---|
model | Both | string | The deployment name you assigned when you deployed the model. |
prompt | Both | string | The text prompt that describes the image to generate or edits to make. Maximum context length: 32,000 tokens. |
image | Image edits | string | The path to the image you want to edit. The image is passed as multipart form data. Must be in JPEG or PNG format. |
width | Image generations | integer | Width of the output image in pixels. Minimum: 768. The product of width × height must not exceed 1,048,576. |
height | Image generations | integer | Height of the output image in pixels. Minimum: 768. The product of width × height must not exceed 1,048,576. |
The output format is always PNG. The maximum total pixel count is 1,048,576 (equivalent to 1024×1024). Both
width and height must be at least 768 pixels each. Either dimension can exceed 1024 as long as the total pixel count stays within the limit.API quotas and limits
MAI image models have the following rate limits measured in Requests Per Minute (RPM). The tier available to you depends on your subscription and deployment configuration.| Deployment Type | Tier | MAI-Image-2.5-Flash (RPM) | MAI-Image-2.5 (RPM) | MAI-Image-2e (RPM) | MAI-Image-2 (RPM) |
|---|---|---|---|---|---|
| Global Standard | 0 (Free) | 0 | 0 | 0 | 0 |
| Global Standard | 1 | 2 | 2 | 18 | 9 |
| Global Standard | 2 | 4 | 4 | 30 | 15 |
| Global Standard | 3 | 6 | 6 | 60 | 30 |
| Global Standard | 4 | 8 | 8 | 90 | 45 |
| Global Standard | 5 | 10 | 10 | 120 | 60 |
| Global Standard | 6 | 12 | 12 | 180 | 90 |
Troubleshoot
Use the following table to resolve common errors when working with MAI image models:| Error | Cause | Fix |
|---|---|---|
401 Unauthorized | Invalid API key or expired token | Regenerate the key in the Azure portal. For Entra ID authentication, ensure the token scope is https://cognitiveservices.azure.com/.default. |
404 Not Found | Incorrect deployment name or endpoint URL | Verify the deployment name and endpoint in the Foundry portal under Deployments. |
400 Bad Request | width or height below minimum, or total pixel count exceeds maximum | Ensure width and height are each at least 768, and that width × height ≤ 1,048,576. |
429 Too Many Requests | Rate limit exceeded | Wait and retry, or request a quota increase. |
Responsible AI considerations
When using MAI image models in Foundry, consider these responsible AI practices:- Be aware of known limitations: Despite technical mitigations such as data filtering and content classifiers applied at the system level, image generation models can produce harmful or unexpected content based on user requests. Common risk areas include violent or gory content, sexual content or nudity, depictions of public figures, and replication of trademarked or other protected material.
- Configure content safety: Apply additional mitigations appropriate to your use case, because no generative model is immune to adversarial prompts.
- Comply with applicable terms: Ensure your use of generated images complies with Microsoft’s terms of service and applicable copyright and intellectual property laws.
- Be transparent: Disclose that content is AI-generated when sharing or publishing images.
- Avoid harmful content: Don’t generate content that could be harmful, misleading, or in violation of privacy.