Overview
Zvec supports querying multiple vector fields simultaneously:- Text + Image: Multi-modal search
- Dense + Sparse: Hybrid retrieval (semantic + keyword)
- Domain-specific: Title, content, metadata embeddings
- Cross-lingual: Different language embeddings
Basic Multi-Vector Query
Schema Definition
Define a collection with multiple vector fields:Inserting Multi-Vector Documents
Querying Multiple Fields
Use Cases
1. Multi-Modal Search (Text + Image)
Search products using both text description and image:2. Hybrid Retrieval (Dense + Sparse)
Combine semantic search with keyword matching:3. Hierarchical Embeddings
Search with title, content, and metadata embeddings:4. Cross-Lingual Search
Search across multiple language embeddings:Result Fusion Strategies
Zvec provides multiple strategies for combining results from different vector fields:1. Reciprocal Rank Fusion (RRF)
Best for: Multi-modal search, hybrid retrieval- No score normalization required
- Robust across different embedding types
- Simple, no hyperparameters to tune
rank_i is the document’s rank in result list i.
2. Weighted Score Fusion
Best for: Known field importance, domain-specific weighting- Fine-grained control over field importance
- Score-aware (uses actual relevance scores)
3. Custom Fusion
Implement domain-specific fusion logic:Query Construction
Per-Field Query Parameters
Specify different query parameters for each vector field:Query by ID
Use existing document as query:Filtering with Multi-Vector Queries
Combine multi-vector search with filters:Performance Considerations
1. Over-Fetching
Fetch more candidates than needed for effective reranking:2. Field Selection
Only query fields that are relevant:3. Parallel Execution
Zvec executes multi-vector queries in parallel internally:Best Practices
- Start with RRF: Use RrfReRanker as baseline, then tune weights if needed
- Over-fetch 5-10x: Fetch 5-10x more candidates than final topk
- Tune weights on validation data: Use A/B testing or grid search
- Monitor per-field performance: Track which fields contribute most
- Use appropriate metrics: COSINE for normalized, L2 for absolute distances
See Also
- Reranking - Detailed reranking strategies
- Index Types - Choose the right index for each field
- Hybrid Search Guide - Dense + sparse retrieval examples