Model Deployment
Deploy models in Microsoft Foundry using multiple options optimized for different scenarios.Deployment Methods
Serverless API Deployment
Characteristics:- Pay-per-token billing
- Microsoft-managed infrastructure
- Automatic scaling
- No capacity planning
Provisioned Throughput
Characteristics:- Reserved capacity (PTUs)
- Predictable cost and performance
- Dedicated resources
- Fungible across models
Managed Compute
Characteristics:- Deploy to Azure VMs
- Billed for VM hours
- Supports open-source models
- Full infrastructure control
Deployment Process
Regional Considerations
- Model availability varies by region
- Check Region Support
- Consider data residency requirements
- Evaluate latency for global users
Model Lifecycle
- GA: Full support and SLA
- Deprecation Notice: 6-12 months warning
- Deprecated: No new deployments
- Retired: Model unavailable