Overview
Qdrant is a vector similarity search engine written in Rust that provides high performance, rich filtering, and advanced features like payload-based filtering and hybrid search.Setup
- Local Docker
- Qdrant Cloud
- Self-Hosted
Run Qdrant locally with Docker:Then in Flowise:
Configuration
Required Parameters
URL of your Qdrant instance:
- Local:
http://localhost:6333 - Cloud:
https://your-cluster.cloud.qdrant.io
Name of the collection to use (created automatically if doesn’t exist)
Embedding model for vector generation
Optional Parameters
Qdrant API key credential (required for Qdrant Cloud)
Documents to upsert into the collection
Track indexed documents to prevent duplication
Vector dimension size (must match embedding model):
- OpenAI ada-002: 1536
- OpenAI text-embedding-3-small: 1536
- OpenAI text-embedding-3-large: 3072
Distance metric for similarity:
Cosine- Normalized similarity (recommended)Euclid- Euclidean distanceDot- Dot product
Key for storing document content in payload
Key for storing document metadata in payload
Batch size for upserting vectors. Useful for large datasets:
Advanced collection configuration (see Qdrant docs):
Filter search results by payload:
Number of results to return
Enable file uploads scoped to chat session
Usage Examples
Basic Local Setup
Qdrant Cloud Production
Batch Upsert for Large Datasets
With Advanced Filtering
Custom Payload Keys
Qdrant Filter Syntax
Qdrant supports powerful filtering:Best Practices
Collection Design
- Create separate collections per domain
- Use descriptive collection names
- Set appropriate vector dimensions
- Choose right distance metric
Performance
- Use batch upserts for large datasets
- Enable indexing for frequent filters
- Monitor collection size
- Use appropriate batch sizes
Filtering
- Index frequently filtered fields
- Use payload schema for validation
- Combine filters efficiently
- Test filter performance
Scaling
- Use Qdrant Cloud for production
- Monitor memory usage
- Implement sharding for huge datasets
- Set up replicas for HA
Advanced Features
Payload Schema
Define schema for validation:Optimizer Configuration
Quantization for Memory Efficiency
Common Issues
Connection Failed
Connection Failed
Can’t connect to Qdrant serverSolution:
- Verify Qdrant is running:
curl http://localhost:6333 - Check URL format includes http/https
- For cloud: verify API key is correct
- Check firewall/network settings
Dimension Mismatch
Dimension Mismatch
Error: “Vector dimension does not match collection”Solution:
- Set correct vector dimension parameter
- Ensure embeddings match dimension
- Delete and recreate collection if needed
- Verify embedding model configuration
Collection Already Exists
Collection Already Exists
Error when creating collectionSolution:
- Collections are auto-created, this is normal
- Use existing collection or delete old one
- Check if dimension/metric matches
Slow Queries
Slow Queries
Search is taking too longSolution:
- Create indexes on filtered fields
- Reduce topK value
- Use more restrictive filters
- Consider quantization
- Check collection size and resources
Monitoring
Qdrant provides telemetry and monitoring:- Vector count per collection
- Query latency (p50, p95, p99)
- Memory usage
- Disk usage
- Indexing status
Outputs
Retriever interface with configured filters and topK
Direct vector store for custom operations