Azure AI Integration

Overview

This portfolio integrates Azure AI’s chat completion API to power an intelligent chat assistant. The integration uses Azure’s serverless inference endpoint through a Netlify Function proxy.

Azure AI Setup

Create Azure Account

Azure offers free credits for new users, which is perfect for testing and small-scale deployments.

Access Azure AI Services

Navigate to Azure AI Studio:

Go to Azure AI Studio
Sign in with your Azure account
Create a new project or select an existing one

Deploy a Model

Deploy a chat completion model:

Navigate to Deployments in your Azure AI project
Click Create new deployment
Select a model (e.g., gpt-4o-mini for cost-effective inference)
Configure deployment settings
Deploy the model

Get API Credentials

Retrieve your API token:

Go to your deployment details
Copy the API Key (this is your API_TOKEN)
Note the Endpoint URL (should be https://models.inference.ai.azure.com)

API Endpoint

The integration uses Azure’s serverless inference endpoint:

https://models.inference.ai.azure.com/chat/completions

This is a unified endpoint that routes requests to your deployed models based on the model name in your request.

Integration Architecture

The Netlify Function acts as a secure proxy:

Frontend sends chat requests to /.netlify/functions/chat
Netlify Function adds authentication and forwards to Azure AI
Azure AI processes the request and returns completions
Netlify Function proxies the response back to the frontend

Configuration

Environment Variables

Set your Azure AI API token in Netlify:

# Navigate to:
# Site Settings > Environment Variables

API_TOKEN=your_azure_ai_api_token_here

Never commit your .env file or API tokens to version control. Add .env to your .gitignore file.

Function Implementation

The Netlify Function handles authentication automatically:

netlify/functions/chat.ts

const response = await fetch('https://models.inference.ai.azure.com/chat/completions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    'Authorization': `Bearer ${process.env.API_TOKEN}`
  },
  body: event.body
});

See the full implementation in serverless-functions.mdx.

Request Format

Send requests following the OpenAI chat completion format:

{
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Hello, how are you?"
    }
  ],
  "model": "gpt-4o-mini",
  "max_tokens": 1000,
  "temperature": 0.7
}

Request Parameters

Parameter	Type	Required	Description
`messages`	Array	Yes	Array of message objects with `role` and `content`
`model`	String	Yes	Model identifier (e.g., `gpt-4o-mini`)
`max_tokens`	Number	No	Maximum tokens in response (default: 1000)
`temperature`	Number	No	Randomness level 0-2 (default: 0.7)

Response Format

Azure AI returns responses in OpenAI-compatible format:

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4o-mini",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! I'm doing well, thank you for asking."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 20,
    "completion_tokens": 15,
    "total_tokens": 35
  }
}

Frontend Integration Example

import { useState } from 'react';

function useChat() {
  const [messages, setMessages] = useState([]);
  const [loading, setLoading] = useState(false);

  const sendMessage = async (content: string) => {
    setLoading(true);
    
    try {
      const response = await fetch('/.netlify/functions/chat', {
        method: 'POST',
        headers: { 'Content-Type': 'application/json' },
        body: JSON.stringify({
          messages: [
            ...messages,
            { role: 'user', content }
          ],
          model: 'gpt-4o-mini',
          max_tokens: 1000
        })
      });

      const data = await response.json();
      const assistantMessage = data.choices[0].message;
      
      setMessages([...messages, 
        { role: 'user', content },
        assistantMessage
      ]);
    } catch (error) {
      console.error('Chat error:', error);
    } finally {
      setLoading(false);
    }
  };

  return { messages, sendMessage, loading };
}

Available Models

Azure AI supports various models. Popular options:

gpt-4o-mini: Cost-effective, fast responses, good for general chat
gpt-4o: More capable, better for complex tasks
gpt-4-turbo: Advanced reasoning and longer context

Check the Azure AI Model Catalog for the latest available models and pricing.

Cost Optimization

Choose the Right Model

Use gpt-4o-mini for general chat interactions to minimize costs while maintaining quality.

Limit Token Usage

Set reasonable max_tokens values to prevent excessive token consumption:

max_tokens: 1000  // Adjust based on your use case

Monitor Usage

Track your API usage in the Azure portal to avoid unexpected charges.

Implement Rate Limiting

Add rate limiting on the frontend or in the Netlify Function to prevent abuse.

Troubleshooting

Authentication Errors

If you receive 401 Unauthorized errors:

Verify your API_TOKEN environment variable is set correctly in Netlify
Check that the token hasn’t expired
Ensure you’re using the correct token (not the endpoint URL)

Model Not Found

If you get model not found errors:

Verify the model is deployed in your Azure AI project
Check the model name matches exactly
Ensure your API token has access to the model

Timeout Issues

If requests timeout:

Reduce max_tokens to speed up generation
Check Azure AI service status
Consider upgrading Netlify plan for longer function timeouts

Netlify Functions have a 10-second timeout on free plans. Large responses may timeout.

Security Best Practices

Never expose API tokens in frontend code or version control
Use environment variables for all sensitive credentials
Implement rate limiting to prevent abuse and unexpected costs
Restrict CORS to your specific domain in production
Monitor API usage regularly through Azure portal
Rotate tokens periodically for enhanced security

Setup

Backend

Overview

Azure AI Setup

API Endpoint

Integration Architecture

Configuration

Environment Variables

Function Implementation

Request Format

Request Parameters

Response Format

Frontend Integration Example

Available Models

Cost Optimization

Troubleshooting

Authentication Errors

Model Not Found

Timeout Issues

Security Best Practices

Build docs developers (and LLMs) love

Setup

Backend

​Overview

​Azure AI Setup

​API Endpoint

​Integration Architecture

​Configuration

​Environment Variables

​Function Implementation

​Request Format

​Request Parameters

​Response Format

​Frontend Integration Example

​Available Models

​Cost Optimization

​Troubleshooting

​Authentication Errors

​Model Not Found

​Timeout Issues

​Security Best Practices

Build docs developers (and LLMs) love

Overview

Azure AI Setup

API Endpoint

Integration Architecture

Configuration

Environment Variables

Function Implementation

Request Format

Request Parameters

Response Format

Frontend Integration Example

Available Models

Cost Optimization

Troubleshooting

Authentication Errors

Model Not Found

Timeout Issues

Security Best Practices