General Questions
What is CVAT?
CVAT (Computer Vision Annotation Tool) is an open-source, interactive tool for annotating images and videos for computer vision applications. It’s used by tens of thousands of users worldwide to create high-quality training datasets for machine learning models.Is CVAT free?
Yes! CVAT is open-source and released under the MIT License. You can:- Use cvat.ai for free with limitations (10 tasks, 500MB storage)
- Self-host CVAT for unlimited usage
- Subscribe to CVAT Cloud for additional features
- Get Enterprise support for self-hosted deployments
What annotation types does CVAT support?
CVAT supports:- Bounding boxes (rectangles)
- Rotated bounding boxes
- Polygons
- Polylines
- Points
- Ellipses
- Cuboids (3D boxes)
- Skeletons (keypoints)
- Masks (segmentation)
- Tags (image-level labels)
What file formats does CVAT support?
CVAT supports 30+ annotation formats including:- CVAT (XML)
- COCO (JSON)
- YOLO / Ultralytics YOLO
- Pascal VOC
- KITTI
- Cityscapes
- LabelMe
- Datumaro
- TFRecord
- And many more!
Installation
What are the system requirements?
Minimum requirements:- RAM: 8GB (16GB recommended)
- CPU: 4 cores (8 cores recommended)
- Storage: 20GB+ depending on data size
- Docker: 20.10.0+
- Docker Compose: 1.29.0+
- RAM: 32GB+
- CPU: 16+ cores
- Storage: SSD with sufficient space for your datasets
How do I install CVAT?
For local installation:Can I use CVAT without Docker?
While it’s technically possible, Docker is strongly recommended because:- CVAT has many dependencies (PostgreSQL, Redis, ClickHouse, etc.)
- Docker ensures consistent environments
- Manual setup is complex and error-prone
How do I update CVAT?
For Docker Compose installations:Can I install CVAT on Windows?
Yes, using:- WSL2 (Windows Subsystem for Linux) - Recommended
- Docker Desktop for Windows
Usage
How do I create my first annotation task?
- Go to Tasks → Create new task
- Enter a task name
- Add labels (e.g., “car”, “person”, “dog”)
- Upload images or videos
- Click Submit
- Wait for processing to complete
- Click Open to start annotating
What’s the difference between a Task, Job, and Project?
- Project: Container for related tasks with shared labels
- Task: A set of media files to annotate
- Job: A subset of a task’s frames, assigned to an annotator
How do I import existing annotations?
- Open your task
- Go to Actions → Upload annotations
- Select format (COCO, YOLO, etc.)
- Upload your annotation file
- Click Submit
How do I export annotations?
- Open your task
- Go to Actions → Export task dataset
- Select export format
- Choose whether to save images
- Click Export
- Download when ready
Can I use CVAT for video annotation?
Yes! CVAT has excellent video support:- Track objects across frames
- Interpolation between keyframes
- Video chapter navigation
- Frame-by-frame control
- High-performance frame caching
How does tracking work?
- Create a Track annotation (not a shape)
- Draw the object on the first frame
- Navigate to a later frame
- Adjust the annotation position
- CVAT automatically interpolates between frames
Can multiple people annotate the same task?
Yes! Split your task into jobs:- When creating a task, set Job overlap and Segment size
- CVAT creates multiple jobs automatically
- Assign different jobs to different annotators
- Each annotator works on their assigned frames
What are serverless functions?
Serverless functions enable:- Auto-annotation: Automatic annotation using AI models
- Interactive annotation: AI-assisted annotation tools (e.g., Segment Anything)
- Tracking: Object tracking across frames
- Segment Anything (SAM)
- YOLOv7
- Mask R-CNN
- Custom models via Nuclio
Troubleshooting
CVAT is slow or unresponsive
Common causes:- Insufficient RAM: Increase Docker memory limit
- Large images: Use smaller images or enable compression
- Too many objects: Split task into smaller jobs
- Slow storage: Use SSD instead of HDD
- Enable frame caching
- Use compressed chunks
- Reduce chunk size
- Close unused browser tabs
I can’t upload large videos
For large files:- Increase Traefik’s upload size limit
- Use cloud storage integration (AWS S3, Azure, Google Cloud)
- Split video into smaller segments
- Use mounted file shares
docker-compose.yml:
Annotations disappeared or weren’t saved
Check:- Click Save (Ctrl+S) regularly
- Check browser console for errors
- Verify permissions (can you write to the job?)
- Check server logs:
docker logs cvat_server
- Check Actions → Export job dataset to see if data exists
- Contact admin if data is critical
Docker containers fail to start
Common issues: Port conflicts:- Increase Docker memory limit (Docker Desktop → Settings → Resources)
- Kill unnecessary processes
How do I check logs?
Database is corrupted
To reset (WARNING: This deletes all data!):CVAT shows “502 Bad Gateway”
This usually means backend isn’t ready:- Database is running:
docker ps | grep db - Redis is running:
docker ps | grep redis - Migrations completed:
docker exec cvat_server python manage.py showmigrations
Features
Can I use CVAT offline?
Yes, if self-hosted! Once installed:- No internet connection required for annotation
- Internet needed for:
- Auto-annotation (if using cloud models)
- Cloud storage integration
- Updates and new models
Does CVAT support 3D annotation?
Yes! CVAT supports:- Point cloud annotation (.pcd files)
- 3D cuboids with rotation
- Multiple camera views
- Related images (synchronized 2D images)
Can I customize CVAT?
Yes! CVAT is highly customizable:- UI plugins: Add custom UI components
- Server plugins: Extend backend functionality
- Custom annotation formats: Add import/export formats
- Serverless functions: Deploy custom AI models
- Webhooks: Integrate with external systems
Does CVAT have an API?
Yes! CVAT provides:- REST API: Full-featured HTTP API
- Python SDK:
pip install cvat-sdk - CLI:
pip install cvat-cli - OpenAPI schema: https://app.cvat.ai/api/docs
Can I integrate CVAT with my ML pipeline?
Yes! Common integrations:- FiftyOne: Dataset management and visualization
- Roboflow: Model training and deployment
- HuggingFace: Model integration
- Custom scripts: Using SDK/CLI
- Webhooks: Real-time notifications
- Train model
- Use model for auto-annotation in CVAT
- Manually correct predictions
- Export improved dataset
- Retrain model
- Repeat
What about annotation quality control?
CVAT includes:- Quality reports: Compare annotations between annotators
- Consensus: Merge annotations from multiple annotators
- Honeypot frames: Ground truth frames for validation
- Review mode: Accept/reject annotations with comments
- Inter-annotator agreement: Calculate agreement metrics
Does CVAT support SSO/LDAP?
Yes, in Enterprise version:- LDAP/Active Directory
- SAML 2.0
- OAuth 2.0
Can I run CVAT in production?
Yes! For production:- Use Kubernetes/Helm for orchestration
- Set up SSL/TLS (HTTPS)
- Configure backups (database and media)
- Use cloud storage (S3, Azure, GCS)
- Set up monitoring (Prometheus, Grafana)
- Consider Enterprise support
Getting Help
Where can I get help?
- Documentation: https://docs.cvat.ai/
- GitHub Issues: https://github.com/cvat-ai/cvat/issues
- Gitter Chat: https://gitter.im/opencv-cvat/public
- Discord: https://discord.gg/fNR3eXfk6C
- Stack Overflow: Tag questions with
#cvat - Email: [email protected]
How do I report a bug?
- Check if already reported: GitHub Issues
- Create new issue with:
- CVAT version
- Steps to reproduce
- Expected vs actual behavior
- Screenshots/videos if applicable
- Browser console errors
- Server logs if relevant
How do I request a feature?
- Check existing requests
- Create new issue:
- Describe the feature
- Explain use case
- Provide examples if possible
- Discuss with maintainers
- Consider contributing!
Can I contribute to CVAT?
Absolutely! Contributions welcome:- Code: Fix bugs, add features
- Documentation: Improve guides and docs
- Testing: Report bugs, test new features
- Community: Answer questions, help others
More Resources
- Website: https://www.cvat.ai/
- Documentation: https://docs.cvat.ai/
- GitHub: https://github.com/cvat-ai/cvat
- YouTube: https://www.youtube.com/@cvat-ai
- LinkedIn: https://www.linkedin.com/company/cvat-ai/
- Blog: https://www.cvat.ai/post
Still Have Questions?
If your question isn’t answered here:- Check the full documentation
- Ask on Gitter or Discord
- Open a GitHub Issue
- Contact us at [email protected]