Custom Vision
Azure AI Custom Vision is an image recognition service that lets you build, deploy, and improve custom image classification and object detection models. Train models with your own labeled images to detect specific objects or classify images according to your custom categories.Custom Vision is retiring. Existing applications can continue to use the service, but new projects should consider Computer Vision or other alternatives.
What is Custom Vision?
Custom Vision uses machine learning to analyze images for features you specify. You provide labeled training images, and the service trains a model customized to your specific use case. Once trained, you can use the model to classify new images or detect objects.Image Classification
Apply one or more labels to entire images based on visual characteristics
Object Detection
Detect and locate specific objects within images with bounding boxes
Key Features
Image Classification
Apply custom labels to images:- Multi-class Classification: Each image gets one label
- Multi-label Classification: Images can have multiple labels
- Train with your own categories
- Minimum 5 images per label recommended
- 50+ images per label for best results
- Categorize products by type
- Classify defects in manufacturing
- Identify plant species
- Categorize documents by type
Object Detection
Detect and locate objects in images:- Draw bounding boxes around objects
- Label multiple objects per image
- Return coordinates for each detection
- Confidence scores for each object
- Minimum 15 images per object recommended
- Detect defects on products
- Count items on shelves
- Identify parts in images
- Locate logos in photos
How It Works
Domain Optimization
Custom Vision offers specialized domains optimized for specific scenarios:Classification Domains
- General: All-purpose classification
- General (compact): Optimized for mobile and edge devices
- Food: Food and dishes
- Landmarks: Famous landmarks and buildings
- Retail: Retail products and items
- Adult: Adult content detection
Object Detection Domains
- General: All-purpose object detection
- General (compact): Optimized for mobile and edge
- Logo: Brand and logo detection
- Products on Shelves: Retail shelf products
Making Predictions
Use the prediction API to classify images or detect objects:Classification Prediction
Object Detection Prediction
Export Models
Export trained models for offline use:- CoreML: iOS applications
- TensorFlow: Android and custom deployments
- ONNX: Cross-platform inference
- TensorFlow Lite: Mobile devices
- Dockerfile: Container deployments
Custom Vision Portal
The Custom Vision portal provides a web interface for:- Creating and managing projects
- Uploading and labeling images
- Training models
- Testing predictions
- Viewing performance metrics
- Exporting models
- Managing API keys
Training Data Requirements
Image Classification
- Minimum: 5 images per tag
- Recommended: 50+ images per tag
- Variety: Include different angles, lighting, backgrounds
- Balance: Similar number of images per tag
- Quality: Clear, well-lit images
Object Detection
- Minimum: 15 images per object
- Recommended: 50+ images per object
- Bounding boxes: Tight boxes around objects
- Variety: Different positions, sizes, orientations
- Occlusion: Include partially hidden objects
Performance Metrics
Evaluate model performance:- Precision: Percentage of correct predictions
- Recall: Percentage of objects found
- mAP: Mean average precision (object detection)
- Threshold: Adjustable confidence threshold
Use Cases
Manufacturing
Manufacturing
- Quality control and defect detection
- Product classification on assembly lines
- Part identification and sorting
- Visual inspection automation
Retail
Retail
- Product recognition and categorization
- Shelf monitoring and planogram compliance
- Visual search for similar products
- Inventory management
Healthcare
Healthcare
- Medical image classification
- Skin condition identification
- X-ray and scan analysis
- Equipment and instrument detection
Agriculture
Agriculture
- Plant disease detection
- Crop type identification
- Pest detection
- Yield estimation
SDK Support
Python
C#
Java
Maven packages for training and prediction
JavaScript
Input Requirements
- Formats: JPEG, PNG, BMP, GIF
- File Size: Less than 6 MB (training), 4 MB (prediction)
- Dimensions: Minimum 256 pixels on shortest side
- Maximum images: 100,000 per project
- Maximum tags: 500 per project
Pricing
- Free Tier (F0):
- 2 projects
- 5,000 training images per project
- 10,000 predictions per month
- Standard Tier (S0):
- Unlimited projects
- 100,000 training images per project
- Pay per transaction
Getting Started
Access Portal
Go to customvision.ai and sign in
Best Practices
- Use 50+ images per tag for better accuracy
- Include variety in training data (angles, lighting, backgrounds)
- Balance training data across tags
- Use appropriate domain for your scenario
- Test with images not in training set
- Retrain with incorrectly classified images
- Adjust confidence threshold based on use case
Migration Guidance
With Custom Vision retiring, consider these alternatives:- Computer Vision: For general image analysis and pre-built models
- Azure Machine Learning: For advanced custom model training
- Custom models: Export your model before retirement