Azure AI Inference beta SDK is deprecated and will be retired on August 26, 2026. Switch to the generally available OpenAI/v1 API with a stable OpenAI SDK. Follow the migration guide to switch to OpenAI/v1, using the SDK for your preferred programming language.
Prerequisites
To complete this article, you need the following:- An Azure subscription. If you’re using GitHub Models, you can upgrade your experience and create an Azure subscription in the process. For more information, see Upgrade from GitHub Models to Foundry Models.
- A Foundry project. This project type is managed under a Foundry resource (formerly known as Azure AI Services resource). If you don’t have a Foundry project, see Create a project for Microsoft Foundry.
- Azure role-based access control (RBAC) permissions to create and manage deployments. You need the Cognitive Services Contributor role or equivalent permissions for the Foundry resource.
- Foundry Models from partners and community require access to Azure Marketplace. Ensure you have the permissions required to subscribe to model offerings. Foundry Models sold by Azure don’t have this requirement.
Troubleshooting
| Error | Cause | Resolution |
|---|---|---|
| Quota exceeded | Your subscription reached the deployment quota for the selected SKU or region. | Check your quota in the Foundry portal or request an increase through Azure support. |
| Authorization failed | The identity used doesn’t have the required RBAC role. | Assign the Cognitive Services Contributor role on the Foundry resource. |
| Model not available | The model isn’t available in your region or subscription. | Run az cognitiveservices account list-models to check available models and regions. |
| Extension not found | The cognitiveservices CLI extension isn’t installed. | Run az extension add -n cognitiveservices to install the extension. |