IVFIndexParam

Overview

The IVFIndexParam class configures an IVF (Inverted File Index) index. IVF partitions the vector space into clusters and performs approximate nearest neighbor search by searching only a subset of clusters, providing a good balance between speed and accuracy for large datasets.

Constructor

IVFIndexParam(
    metric_type: MetricType = MetricType.IP,
    n_list: int = 0,
    n_iters: int = 10,
    use_soar: bool = False,
    quantize_type: QuantizeType = QuantizeType.UNDEFINED
)

Parameters

metric_type

MetricType

default:"MetricType.IP"

Distance metric used for similarity computation. Options:

MetricType.IP - Inner product (default)
MetricType.L2 - Euclidean distance
MetricType.COSINE - Cosine similarity

n_list

int

default:"0"

Number of clusters (inverted lists) to partition the dataset into. This is the primary parameter for IVF indexes.Effects:

More clusters improve search speed but may reduce accuracy
Fewer clusters improve accuracy but slow down search
Set to 0 for automatic selection based on dataset size
Typical range: sqrt(N) to N/100, where N is the number of vectors

Auto-selection (n_list=0): When set to 0, the system automatically chooses an optimal value based on your dataset size.

n_iters

int

default:"10"

Number of iterations for k-means clustering during index training. More iterations yield more stable and accurate centroids.Effects:

Higher values improve cluster quality
Higher values increase index construction time
Typical range: 10-50
Default of 10 works well for most cases

use_soar

bool

default:"False"

Whether to enable SOAR (Scalable Optimized Adaptive Routing) for improved IVF search performance.SOAR is an optimization technique that can improve query performance by adaptively routing queries to the most relevant clusters.

quantize_type

QuantizeType

default:"QuantizeType.UNDEFINED"

Optional quantization type for vector compression. Options:

QuantizeType.UNDEFINED - No quantization (default)
QuantizeType.FP16 - 16-bit floating point
QuantizeType.INT8 - 8-bit integer quantization

Quantization can significantly reduce memory usage, especially important for large IVF indexes.

Properties

n_list

Number of inverted lists (clusters). Returns 0 if auto-selection is enabled. Type: int

n_iters

Number of k-means iterations during training. Type: int

use_soar

Whether SOAR optimization is enabled. Type: bool

Methods

to_dict()

Convert the index parameters to a dictionary representation. Returns: dict - Dictionary with all index parameter fields.

Examples

Basic IVF index with auto-selection

from zvec import IVFIndexParam
from zvec.typing import MetricType

# Create index with automatic n_list selection
index_params = IVFIndexParam(
    metric_type=MetricType.COSINE
)

Explicit cluster count

from zvec import IVFIndexParam
from zvec.typing import MetricType

# Manually set number of clusters
index_params = IVFIndexParam(
    metric_type=MetricType.COSINE,
    n_list=100,
    n_iters=15
)

High accuracy configuration

from zvec import IVFIndexParam
from zvec.typing import MetricType

# Use fewer clusters for higher accuracy
index_params = IVFIndexParam(
    metric_type=MetricType.IP,
    n_list=50,
    n_iters=25,
    use_soar=True
)

Memory-efficient configuration

from zvec import IVFIndexParam
from zvec.typing import MetricType, QuantizeType

# Use quantization to reduce memory usage
index_params = IVFIndexParam(
    metric_type=MetricType.L2,
    n_list=100,
    quantize_type=QuantizeType.INT8
)

Large dataset configuration

from zvec import IVFIndexParam
from zvec.typing import MetricType, QuantizeType

# Optimize for millions of vectors
index_params = IVFIndexParam(
    metric_type=MetricType.COSINE,
    n_list=1000,
    n_iters=20,
    use_soar=True,
    quantize_type=QuantizeType.FP16
)

Using with a collection

import zvec
from zvec import IVFIndexParam
from zvec.typing import MetricType, DataType

# Create a collection with IVF index
collection = zvec.create_collection(
    path="./my_collection",
    dimension=768,
    data_type=DataType.FLOAT32,
    index_param=IVFIndexParam(
        metric_type=MetricType.COSINE,
        n_list=100,
        use_soar=True
    )
)

Performance Tuning

Choosing n_list:

Small datasets (under 100K vectors): 50-100 clusters
Medium datasets (100K-1M vectors): sqrt(N) clusters
Large datasets (over 1M vectors): N/100 to N/1000 clusters
Start with n_list=0 (auto) and adjust based on performance metrics

Training time vs search performance:

Higher n_iters improves cluster quality but increases index build time
Use n_iters=10 for quick experiments
Use n_iters=20-50 for production deployments
The training cost is paid once during index construction

Memory Usage

Approximate memory per vector:

Memory = dimension * bytes_per_element + overhead

Where:

bytes_per_element = 4 for FLOAT32, 2 for FP16, 1 for INT8
overhead = small constant for cluster metadata

IVF indexes are generally more memory-efficient than HNSW, especially with quantization.

Query Time Configuration

At query time, use IVFQueryParam to control the number of clusters searched:

from zvec import IVFQueryParam, VectorQuery

# Search more clusters for higher accuracy
params = IVFQueryParam(nprobe=20)

query = VectorQuery(
    field_name="embedding",
    vector=[0.1, 0.2, 0.3],
    param=params
)

When to Use IVF

IVF is ideal for:

Large datasets (millions to billions of vectors)
Memory-constrained environments (especially with quantization)
Scenarios where moderate recall (85-95%) is acceptable
Cost-sensitive deployments

Consider alternatives for:

Small datasets (under 10K vectors) - use Flat or HNSW
Need highest accuracy (over 98% recall) - use HNSW
Real-time requirements with very low latency - use HNSW

SOAR Optimization

SOAR (Scalable Optimized Adaptive Routing) is an advanced optimization that:

Adaptively routes queries to the most relevant clusters
Can improve search speed without sacrificing accuracy
Recommended for production deployments with large datasets

index_params = IVFIndexParam(
    metric_type=MetricType.COSINE,
    n_list=1000,
    use_soar=True
)

Initialization

Collection

Schema Types

Query Types

Index Parameters

Embedding Functions

Re-ranking

Types & Enums

IVFIndexParam

Overview

Constructor

Parameters

Properties

n_list

n_iters

use_soar

Methods

to_dict()

Examples

Basic IVF index with auto-selection

Explicit cluster count

High accuracy configuration

Memory-efficient configuration

Large dataset configuration

Using with a collection

Performance Tuning

Memory Usage

Query Time Configuration

When to Use IVF

SOAR Optimization

See Also

Build docs developers (and LLMs) love

Initialization

Collection

Schema Types

Query Types

Index Parameters

Embedding Functions

Re-ranking

Types & Enums

​Overview

​Constructor

​Parameters

​Properties

​n_list

​n_iters

​use_soar

​Methods

​to_dict()

​Examples

​Basic IVF index with auto-selection

​Explicit cluster count

​High accuracy configuration

​Memory-efficient configuration

​Large dataset configuration

​Using with a collection

​Performance Tuning

​Memory Usage

​Query Time Configuration

​When to Use IVF

​SOAR Optimization

​See Also

Build docs developers (and LLMs) love

Overview

Constructor

Parameters

Properties

n_list

n_iters

use_soar

Methods

to_dict()

Examples

Basic IVF index with auto-selection

Explicit cluster count

High accuracy configuration

Memory-efficient configuration

Large dataset configuration

Using with a collection

Performance Tuning

Memory Usage

Query Time Configuration

When to Use IVF

SOAR Optimization

See Also