Query Documents

Overview

The query() method executes a query against a collection. Queries are built by chaining together stages like select(), filter(), topk(), limit(), and sort() to create powerful search and retrieval operations.

Method Signature

JavaScript
Python

query(query: Query, options?: QueryOptions): Promise<Array<Record<string, any>>>

def query(
    query: Query,
    lsn: Optional[str] = None,
    consistency: Optional[ConsistencyLevel] = None
) -> list[dict[str, Any]]:
    """Execute a query against the collection."""

Parameters

query

Query

required

A query object built using query stages like select(), filter(), topk(), etc.

options

QueryOptions

Optional query options:

lsn: Wait for writes up to this Log Sequence Number before executing the query
consistency: Consistency level ('indexed' or 'strong')

Returns

results

Array<Record<string, any>>

An array of documents matching the query. Each document contains the fields specified in the select() stage, or all fields if no select stage is used.

Query Stages

select()

Specify which fields to return and compute derived fields.

import { select, field } from "topk-js/query";

const query = select({
  title: field("title"),
  author: field("author"),
  price_with_tax: field("price").mul(1.08)
});

filter()

Filter documents based on conditions.

import { filter, field } from "topk-js/query";

const query = filter(
  field("published_year").gte(2000)
    .and(field("price").lt(50))
);

topk()

Select the top K documents based on a scoring expression.

import { select, field, fn } from "topk-js/query";

const query = select({
  title: field("title"),
  score: fn.vectorDistance("embedding", [0.1, 0.2, 0.3, 0.4])
}).topk(field("score"), 10);

limit()

Limit the number of results returned.

import { filter, field } from "topk-js/query";

const query = filter(field("in_stock").eq(true)).limit(100);

sort()

Sort results by an expression.

import { filter, field } from "topk-js/query";

const query = filter(field("category").eq("books"))
  .sort(field("published_year"), false); // descending

Examples

Vector Search

Find the most similar documents using vector distance.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const queryVector = [0.12, 0.45, 0.78, 0.23];

const results = await client.collection("books").query(
  select({
    title: field("title"),
    author: field("author"),
    distance: fn.vectorDistance("embedding", queryVector)
  }).topk(field("distance"), 10)
);

results.forEach(doc => {
  console.log(`${doc.title} by ${doc.author} (distance: ${doc.distance})`);
});

Filtered Search

Combine filtering with vector search.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const results = await client.collection("books").query(
  select({
    title: field("title"),
    author: field("author"),
    year: field("published_year"),
    distance: fn.vectorDistance("embedding", [0.1, 0.2, 0.3, 0.4])
  })
  .filter(
    field("published_year").gte(2000)
      .and(field("genres").contains("Science Fiction"))
  )
  .topk(field("distance"), 20)
);

Keyword Search

Search using keyword matching on indexed fields.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const results = await client.collection("books").query(
  select({
    title: field("title"),
    author: field("author"),
    score: fn.bm25Score()
  })
  .filter(field("title").matchAny(["machine", "learning", "ai"]))
  .topk(field("score"), 10)
);

Sparse Vector Search

Search using sparse vector embeddings.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";
import { f32SparseVector } from "topk-js/data";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const sparseQuery = f32SparseVector({
  0: 0.5,
  10: 0.8,
  100: 0.3
});

const results = await client.collection("documents").query(
  select({
    content: field("content"),
    score: fn.vectorDistance("sparse_embedding", sparseQuery)
  }).topk(field("score"), 15)
);

Semantic Search

Use semantic similarity for text search.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const results = await client.collection("articles").query(
  select({
    title: field("title"),
    content: field("content"),
    similarity: fn.semanticSimilarity("content", "artificial intelligence applications")
  }).topk(field("similarity"), 10)
);

Complex Expressions

Use computed fields and complex scoring logic.

import { Client } from "topk-js";
import { select, field, fn } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

const results = await client.collection("products").query(
  select({
    name: field("name"),
    base_price: field("price"),
    discount: field("discount_percent"),
    final_price: field("price").mul(
      field("discount_percent").div(100).sub(1).abs()
    ),
    in_stock: field("in_stock"),
    score: fn.vectorDistance("embedding", [0.1, 0.2, 0.3, 0.4])
      .boost(field("in_stock").eq(true), 1.5)
      .boost(field("discount_percent").gt(20), 1.2)
  })
  .filter(field("category").eq("electronics"))
  .topk(field("score"), 25)
);

Consistency Guarantees

Wait for specific writes before querying.

import { Client } from "topk-js";
import { filter, field } from "topk-js/query";

const client = new Client({
  apiKey: "your-api-key",
  region: "us-east-1"
});

// Perform an upsert and get the LSN
const lsn = await client.collection("books").upsert([{
  _id: "book_1",
  title: "New Book",
  author: "New Author"
}]);

// Query with consistency guarantee
const results = await client.collection("books").query(
  filter(field("author").eq("New Author")),
  { lsn: lsn, consistency: 'strong' }
);

Best Practices

Use Filters Early: Apply filter() stages early in your query pipeline to reduce the number of documents that need processing.
Select Only Needed Fields: Use select() to specify only the fields you need, reducing data transfer and improving performance.
Index Appropriately: Ensure fields used in vector search have vector indexes, and fields used in keyword search have keyword indexes.
Batch Queries: If you need to perform multiple independent queries, consider running them in parallel.
Use topk() for Ranking: When you need ranked results, use topk() instead of sort().limit() for better performance.

get() - Retrieve documents by their IDs
upsert() - Insert or update documents
update() - Update existing documents
delete() - Remove documents

Get Started

Core Concepts

Collections

Documents

Advanced

Query Documents

Overview

Method Signature

Parameters

Returns

Query Stages

select()

filter()

topk()

limit()

sort()

Examples

Vector Search

Filtered Search

Keyword Search

Sparse Vector Search

Semantic Search

Complex Expressions

Consistency Guarantees

Best Practices

Build docs developers (and LLMs) love

Get Started

Core Concepts

Collections

Documents

Advanced

​Overview

​Method Signature

​Parameters

​Returns

​Query Stages

​select()

​filter()

​topk()

​limit()

​sort()

​Examples

​Vector Search

​Filtered Search

​Keyword Search

​Sparse Vector Search

​Semantic Search

​Complex Expressions

​Consistency Guarantees

​Best Practices

​Related Methods

Build docs developers (and LLMs) love

Overview

Method Signature

Parameters

Returns

Query Stages

select()

filter()

topk()

limit()

sort()

Examples

Vector Search

Filtered Search

Keyword Search

Sparse Vector Search

Semantic Search

Complex Expressions

Consistency Guarantees

Best Practices

Related Methods