Create an Azure AI Search Service
Azure AI Search is an information retrieval platform for the enterprise. It supports traditional search and conversational, AI-driven search for “chat with your data” experiences. The easiest way to create a search service is through the Azure portal, covered in this guide.Before You Start
Some properties are fixed for the lifetime of the search service. Decide on these before creating your service:| Property | Description |
|---|---|
| Name | Becomes part of the URL endpoint (must be unique) |
| Region | Determines data residency and feature availability |
| Tier | Determines infrastructure, service limits, and billing |
| Compute type | Standard VMs (recommended) or confidential VMs |
Prerequisites
- An Azure subscription (start free trial)
- Permissions to create resources in your subscription
Each Azure subscription can have one free Azure AI Search service, which is perfect for evaluation and learning.
Create Your Search Service
Sign in to Azure Portal
Navigate to the Azure portal and sign in with your Azure account
Set Resource Group
Choose an existing resource group or create a new one:
- Resource groups help organize related resources
- They enable cost tracking and management
- Make cleanup easier when testing
Name Your Service
Enter a unique service name:
- Must be 2-60 characters
- Use only lowercase letters, digits, and dashes
- Cannot start or end with dashes
- Cannot contain consecutive dashes
- Becomes part of your endpoint:
https://your-service-name.search.windows.net
Select Region
Choose the region where your service will be hosted. Consider:
- Proximity to your users and data sources
- Feature availability (some features require specific regions)
- Capacity availability (some regions may be at capacity)
- Agentic retrieval region requirements
Choose Pricing Tier
Select your pricing tier based on your needs. See Pricing Tiers below for details.
Select Compute Type
- Standard (recommended): Standard Azure VMs with data encrypted at rest and in transit
- Confidential: Hardware-based trusted execution environment (10% surcharge)
Pricing Tiers
Azure AI Search offers multiple pricing tiers:- Free
- Basic
- Standard
- Storage Optimized
Free Tier
- One per subscription
- Perfect for learning and evaluation
- 50 MB storage limit
- 3 indexes maximum
- No SLA
- May be deleted if inactive for extended periods
Complete most quickstarts and tutorials on the Free tier.
Services created after April 3, 2024 have larger partitions and higher vector quotas at every billable tier.
Naming Guidelines
Valid Service Names
Invalid Service Names
Region Selection
Checklist for Choosing a Region
Feature Availability
Feature Availability
Check if your target region supports:
- Your desired pricing tier
- AI enrichment capabilities
- Integrated vectorization
- Multimodal search
- Agentic retrieval (preview)
Data Residency
Data Residency
- Keep services in the same region to minimize latency
- No bandwidth charges for same-region services
- Consider compliance and data sovereignty requirements
AI Integration
AI Integration
For AI enrichment with Azure AI services:
- Key-based connections require both services in the same region
- Keyless connections (preview) allow different regions
- Check Azure Vision availability for multimodal embeddings
Business Continuity
Business Continuity
For high availability:
- Create services in multiple regions
- Use 2+ replicas per service for availability zones
- Example: East US and West US for North America coverage
Configure Authentication
By default, new services use API key authentication. For better security, enable role-based access control (RBAC).Choose Authentication Method
Select one of:
- API Key: Traditional key-based authentication
- Role-based access control: Azure AD authentication (recommended)
- Both: Allow both methods during migration
Scale Your Service
After deployment, you can scale your service to meet demand.Replicas and Partitions
-
Replicas: Copies of your index for high availability and query throughput
- Minimum 2 for read-only SLA
- Minimum 3 for read/write SLA
- Distribute across availability zones automatically
-
Partitions: Divide your index for storage and parallel processing
- More storage capacity
- Higher indexing throughput
- Parallel query execution
Scaling Example
Scaling is only available on billable tiers. Free tier services cannot be scaled.
Compute Type Selection
Standard Compute (Default)
- Standard Azure VMs
- Data encrypted at rest and in transit
- Suitable for most workloads
- Base pricing
- Building typical search applications
- Cost optimization is important
- Standard security is sufficient
Confidential Compute
- Azure confidential computing VMs
- Hardware-based trusted execution environment
- Data protected in use (during processing)
- 10% surcharge
- Limited regional availability
- Some features restricted
- Processing highly sensitive data
- Compliance requires data-in-use protection
- Regulatory requirements mandate confidential computing
Monitor Your Service
Once deployed, monitor service health and usage:- Service Dashboard: View overview metrics
- Metrics: Query latency, throttling, index size
- Logs: Diagnostic logs for troubleshooting
- Alerts: Set up notifications for issues
Cost Management
Optimize your search service costs:Right-Size Resources
Start small and scale up based on actual usage metrics
Monitor Usage
Use Azure Cost Management to track spending
Optimize Indexing
Schedule indexer runs during off-peak hours
Review Replicas
Scale down replicas when high availability isn’t needed
When to Add a Second Service
Consider creating additional services for:- Multi-region deployment: Reduce latency for global users
- Disaster recovery: Ensure service availability during regional outages
- Environment isolation: Separate dev, test, and production
- Multi-tenant architectures: Isolate customer data
- Workload separation: Different SLAs for different applications
You cannot separate indexing and querying workloads. Both operations run on the same service.
Request Additional Quota
Azure subscriptions have limits on the number of search services:- Navigate to Quotas service in Azure portal
- Filter by Search services
- Select your subscription and region
- Click Request adjustment
- Enter new limit (must exceed current)
- Submit request
Alternative Creation Methods
Besides the Azure portal, you can create services using:Azure PowerShell
Azure CLI
ARM Template
Bicep
Next Steps
Create an Index
Build your first search index
Import Data
Load data from Azure services
Security
Configure authentication and access control
Monitoring
Set up monitoring and alerts