Semantic Search

Semantic search enables you to search documents using natural language queries instead of exact keyword matches. TopK automatically generates embeddings for your text fields and handles the similarity computation.

Overview

With semantic search, you can:

Search using natural language queries without providing embeddings
Automatically generate embeddings using built-in models
Find semantically similar content even when exact words don’t match
Combine semantic search with filters and other search types

Schema Setup

To enable semantic search on a field, add a semanticIndex() to a text field:

import { Client } from "topk-js";
import { text, semanticIndex } from "topk-js/schema";

const client = new Client({
  apiKey: "YOUR_API_KEY",
  region: "aws-us-east-1-elastica"
});

await client.collections().create("books", {
  title: text().required().index(semanticIndex()),
  summary: text().index(semanticIndex({ model: "cohere/embed-v4" }))
});

Supported Models

TopK supports the following embedding models:

cohere/embed-v4 (default)
cohere/embed-english-v3
cohere/embed-multilingual-v3

Embedding Types

You can specify the embedding data type for optimized storage:

float32 - Full precision (default)
uint8 - Quantized to 8-bit unsigned integers
binary - Binary embeddings for maximum compression

import { text, semanticIndex } from "topk-js/schema";

const schema = {
  title: text().index(semanticIndex({
    model: "cohere/embed-v4",
    embeddingType: "uint8"
  }))
};

Inserting Documents

When you insert documents, TopK automatically generates embeddings for fields with semantic indexes:

await client.collection("books").upsert([
  { 
    _id: "gatsby", 
    title: "The Great Gatsby",
    summary: "A story of love and the American Dream"
  },
  { 
    _id: "1984", 
    title: "1984",
    summary: "A dystopian novel about totalitarianism"
  }
]);

Querying with Semantic Search

Use fn.semanticSimilarity() to compute semantic similarity between your query and indexed fields:

import { select, field, fn } from "topk-js/query";

const results = await client.collection("books").query(
  select({
    title: field("title"),
    similarity: fn.semanticSimilarity("title", "classic American novel")
  })
  .topk(field("similarity"), 10)
);

Multiple Field Search

Combine semantic similarity scores from multiple fields:

import { select, field, fn } from "topk-js/query";

const results = await client.collection("books").query(
  select({
    title: field("title"),
    title_sim: fn.semanticSimilarity("title", "love story"),
    summary_sim: fn.semanticSimilarity("summary", "love story")
  })
  .topk(
    field("title_sim").add(field("summary_sim")),
    10
  )
);

Combining with Filters

You can filter results before applying semantic search:

import { select, filter, field, fn, match } from "topk-js/query";

const results = await client.collection("books").query(
  select({
    title: field("title"),
    similarity: fn.semanticSimilarity("summary", "dystopian society")
  })
  .filter(match("totalitarian", { field: "summary" }))
  .topk(field("similarity"), 5)
);

Semantic indexes automatically handle embedding generation during document insertion. You don’t need to provide or manage embeddings yourself.

For best results, use descriptive queries that capture the semantic meaning you’re looking for, rather than just keywords.

Reranking - Improve semantic search results with reranking
True Hybrid Search - Combine semantic and keyword search
Vector Search - Bring your own embeddings

Get Started

Core Concepts

Collections

Documents

Advanced

Semantic Search

Overview

Schema Setup

Supported Models

Embedding Types

Inserting Documents

Querying with Semantic Search

Multiple Field Search

Combining with Filters

Build docs developers (and LLMs) love

Get Started

Core Concepts

Collections

Documents

Advanced

​Overview

​Schema Setup

​Supported Models

​Embedding Types

​Inserting Documents

​Querying with Semantic Search

​Multiple Field Search

​Combining with Filters

​Related Concepts

Build docs developers (and LLMs) love

Overview

Schema Setup

Supported Models

Embedding Types

Inserting Documents

Querying with Semantic Search

Multiple Field Search

Combining with Filters

Related Concepts