Azure OpenAI Provider

Azure OpenAI Service provides REST API access to OpenAI’s powerful language models including GPT-4, GPT-4o, and GPT-3.5-Turbo, with enterprise-grade security, compliance, and regional availability. The Microsoft Agent Framework provides multiple ways to work with Azure OpenAI depending on your use case.

Provider Options

Azure OpenAI integration offers three main approaches:

Responses Client

Direct chat completions with streaming and tools

Chat Client

Chat-based interactions with conversation management

Azure AI Project

Persistent agents with Azure AI Foundry projects

Installation

pip install agent-framework --pre
# Azure OpenAI support is included in the core package

Authentication

Azure OpenAI supports multiple authentication methods:

Azure CLI Credential (Recommended for Development)

from azure.identity import AzureCliCredential
from agent_framework.azure import AzureOpenAIResponsesClient

# Run 'az login' first
credential = AzureCliCredential()
client = AzureOpenAIResponsesClient(credential=credential)

Managed Identity (Recommended for Production)

from azure.identity import ManagedIdentityCredential
from agent_framework.azure import AzureOpenAIResponsesClient

credential = ManagedIdentityCredential()
client = AzureOpenAIResponsesClient(credential=credential)

API Key (Not Recommended)

from azure.core.credentials import AzureKeyCredential
from agent_framework.azure import AzureOpenAIResponsesClient

credential = AzureKeyCredential(api_key)
client = AzureOpenAIResponsesClient(credential=credential)

API keys should only be used for local development. Always use managed identities in production for better security.

Azure OpenAI Responses Client

The Responses Client provides direct access to Azure OpenAI chat completions with support for streaming, function calling, and structured outputs.

Basic Usage

import asyncio
from agent_framework.azure import AzureOpenAIResponsesClient
from azure.identity import AzureCliCredential

async def main():
    # Create agent with Azure OpenAI Responses Client
    agent = AzureOpenAIResponsesClient(
        credential=AzureCliCredential()
    ).as_agent(
        instructions="You are a helpful assistant.",
    )
    
    # Non-streaming response
    result = await agent.run("What is the capital of France?")
    print(result)

asyncio.run(main())

Configuration

Environment Variables
Explicit Configuration

AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com
AZURE_OPENAI_RESPONSES_DEPLOYMENT_NAME=gpt-4o
AZURE_OPENAI_API_VERSION=2024-10-01-preview

client = AzureOpenAIResponsesClient(
    endpoint="https://your-resource.openai.azure.com",
    deployment_name="gpt-4o",
    api_version="2024-10-01-preview",
    credential=AzureCliCredential(),
)

Streaming Responses

import asyncio
from agent_framework.azure import AzureOpenAIResponsesClient
from azure.identity import AzureCliCredential

async def main():
    agent = AzureOpenAIResponsesClient(
        credential=AzureCliCredential()
    ).as_agent(
        instructions="You are a helpful assistant.",
    )
    
    # Streaming response
    query = "Write a short story about AI agents."
    print("Agent: ", end="", flush=True)
    async for chunk in agent.run(query, stream=True):
        if chunk.text:
            print(chunk.text, end="", flush=True)
    print()

asyncio.run(main())

Function Calling

import asyncio
from typing import Annotated
from agent_framework import tool
from agent_framework.azure import AzureOpenAIResponsesClient
from azure.identity import AzureCliCredential
from pydantic import Field

@tool(approval_mode="never_require")  # Use "always_require" in production
def get_weather(
    location: Annotated[str, Field(description="City name")],
) -> str:
    """Get the weather for a location."""
    return f"Weather in {location}: Sunny, 72°F"

async def main():
    agent = AzureOpenAIResponsesClient(
        credential=AzureCliCredential()
    ).as_agent(
        instructions="You are a weather assistant.",
        tools=[get_weather],
    )
    
    result = await agent.run("What's the weather in Seattle?")
    print(result)

asyncio.run(main())

Azure OpenAI Chat Client

The Chat Client provides chat-based interactions with conversation management.

import asyncio
from agent_framework.azure import AzureOpenAIChatClient
from azure.identity import AzureCliCredential

async def main():
    agent = AzureOpenAIChatClient(
        credential=AzureCliCredential()
    ).as_agent(
        instructions="You are a helpful assistant.",
    )
    
    result = await agent.run("Hello, how are you?")
    print(result)

asyncio.run(main())

Azure AI Project Agent Provider

For persistent agents managed through Azure AI Foundry projects, use the AzureAIProjectAgentProvider.

Setup

Create an Azure AI Foundry project at ai.azure.com
Get your project endpoint from the project settings
Deploy a model to your project

Basic Usage

import asyncio
from agent_framework.azure import AzureAIProjectAgentProvider
from azure.identity.aio import AzureCliCredential

async def main():
    async with (
        AzureCliCredential() as credential,
        AzureAIProjectAgentProvider(credential=credential) as provider
    ):
        # Create a persistent agent
        agent = await provider.create_agent(
            name="WeatherAgent",
            instructions="You are a weather assistant.",
            tools=[...],
        )
        
        # Use the agent
        result = await agent.run("What's the weather?")
        print(result)

asyncio.run(main())

Configuration

Environment Variables
Explicit Configuration

AZURE_AI_PROJECT_ENDPOINT=https://your-project.services.ai.azure.com
AZURE_AI_MODEL_DEPLOYMENT_NAME=gpt-4o

provider = AzureAIProjectAgentProvider(
    project_endpoint="https://your-project.services.ai.azure.com",
    model="gpt-4o",
    credential=AzureCliCredential(),
)

Working with Existing Agents

import asyncio
from agent_framework.azure import AzureAIProjectAgentProvider
from azure.identity.aio import AzureCliCredential

async def main():
    async with (
        AzureCliCredential() as credential,
        AzureAIProjectAgentProvider(credential=credential) as provider
    ):
        # Get an existing agent by ID
        agent = await provider.get_agent(agent_id="your-agent-id")
        
        # Use the agent
        result = await agent.run("Hello!")
        print(result)

asyncio.run(main())

Available Models

Azure OpenAI supports various model families:

Model Family	Models	Best For
GPT-4o	gpt-4o, gpt-4o-mini	Latest multimodal model, vision, function calling
GPT-4	gpt-4, gpt-4-32k	Complex reasoning, high-quality outputs
GPT-3.5	gpt-35-turbo	Fast, cost-effective chat
o1/o3	o1-preview, o1-mini, o3-mini	Advanced reasoning tasks

Model availability varies by region. Check the Azure OpenAI model availability page for details.

Advanced Features

Code Interpreter

Azure OpenAI Assistants support code interpreter for Python code execution:

from agent_framework.azure import code_interpreter_tool

agent = await provider.create_agent(
    name="DataAnalyst",
    instructions="You are a data analyst.",
    tools=[code_interpreter_tool()],
)

File Search

Enable file search for RAG capabilities:

from agent_framework.azure import file_search_tool

agent = await provider.create_agent(
    name="DocumentAssistant",
    instructions="You help with documents.",
    tools=[file_search_tool()],
)

Structured Outputs

from pydantic import BaseModel

class WeatherResponse(BaseModel):
    location: str
    temperature: float
    condition: str

agent = client.as_agent(
    instructions="You are a weather assistant.",
    response_format=WeatherResponse,
)

Best Practices

Use Managed Identities in Production

Always use Azure managed identities for authentication in production environments. This eliminates the need to store API keys and provides better security through Azure AD.

from azure.identity import ManagedIdentityCredential
credential = ManagedIdentityCredential()

Choose the Right Client Type

Use Responses Client for simple chat completions and streaming
Use Chat Client when you need conversation management
Use Azure AI Project Provider for persistent agents with managed state

Enable Retry Logic

Azure OpenAI has rate limits. Implement retry logic with exponential backoff for production applications.

Monitor Costs

Different models have different pricing. Use GPT-4o-mini for development and testing, and reserve GPT-4o for production workloads that require maximum quality.

Regional Deployment

Deploy models in regions close to your users for better latency. Azure OpenAI is available in multiple regions worldwide.

Troubleshooting

Authentication Errors

If you see authentication errors:

Run az login to authenticate with Azure CLI
Ensure your account has access to the Azure OpenAI resource
Check that the correct subscription is selected: az account show
Verify RBAC roles: You need “Cognitive Services OpenAI User” or higher

Rate Limit Errors

Azure OpenAI has rate limits based on your deployment:

Check your quota in the Azure Portal
Implement exponential backoff retry logic
Consider requesting a quota increase
Use multiple deployments to distribute load

Model Not Found

If you get model not found errors:

Verify the deployment name matches exactly
Check that the model is deployed in your resource
Ensure you’re using the correct endpoint
Verify the API version is compatible with the model

Next Steps

Function Tools

Learn how to add function calling to your agents

Sessions & Memory

Manage multi-turn conversations and sessions

Workflows

Build complex multi-agent workflows

Deploy to Azure

Deploy your agents to Azure Functions

Get Started

Core Concepts

Workflows

Providers

Hosting & Deployment

Migration Guides

​Azure OpenAI Provider

​Provider Options

Responses Client

Chat Client

Azure AI Project

​Installation

​Authentication

​Azure CLI Credential (Recommended for Development)

​Managed Identity (Recommended for Production)

​API Key (Not Recommended)

​Azure OpenAI Responses Client

​Basic Usage

​Configuration

​Streaming Responses

​Function Calling

​Azure OpenAI Chat Client

​Azure AI Project Agent Provider

​Setup

​Basic Usage

​Configuration

​Working with Existing Agents

​Available Models

​Advanced Features

​Code Interpreter

​File Search

​Structured Outputs

​Best Practices

​Troubleshooting

​Next Steps

Function Tools

Sessions & Memory

Workflows

Deploy to Azure

Build docs developers (and LLMs) love

Azure OpenAI Provider

Provider Options

Installation

Authentication

Azure CLI Credential (Recommended for Development)

Managed Identity (Recommended for Production)

API Key (Not Recommended)

Azure OpenAI Responses Client

Basic Usage

Configuration

Streaming Responses

Function Calling

Azure OpenAI Chat Client

Azure AI Project Agent Provider

Setup

Basic Usage

Configuration

Working with Existing Agents

Available Models

Advanced Features

Code Interpreter

File Search

Structured Outputs

Best Practices

Troubleshooting

Next Steps