DataStax Astra DB

Overview

DataStax Astra DB is a cloud-native vector database built on Apache Cassandra. It provides serverless vector search with automatic scaling and multi-region support.

Installation

npm install @llamaindex/astra @datastax/astra-db-ts

Basic Usage

import { AstraDBVectorStore } from "@llamaindex/astra";
import { VectorStoreIndex, Document } from "llamaindex";

const vectorStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT,
    namespace: "default_keyspace"
  }
});

// Connect to existing collection or create new one
await vectorStore.connect("my_collection");
// OR
// await vectorStore.createAndConnect("my_collection", {
//   vector: { dimension: 1536, metric: "cosine" }
// });

const documents = [
  new Document({ text: "LlamaIndex is a data framework." }),
  new Document({ text: "Astra DB is a vector database." })
];

const index = await VectorStoreIndex.fromDocuments(documents, {
  storageContext: { vectorStore }
});

const queryEngine = index.asQueryEngine();
const response = await queryEngine.query({
  query: "What is Astra DB?"
});

Constructor Options

params.token

string

required

Astra DB application token (defaults to ASTRA_DB_APPLICATION_TOKEN env var)

params.endpoint

string

required

Astra DB API endpoint (defaults to ASTRA_DB_API_ENDPOINT env var)

params.namespace

string

default:"default_keyspace"

Astra DB namespace/keyspace (defaults to ASTRA_DB_NAMESPACE env var)

idKey

string

default:"_id"

Field name for storing document IDs

contentKey

string

default:"content"

Field name for storing text content

Configuration

Environment Variables

ASTRA_DB_APPLICATION_TOKEN=AstraCS:xyz...
ASTRA_DB_API_ENDPOINT=https://xxx-yyy-zzz.apps.astra.datastax.com
ASTRA_DB_NAMESPACE=default_keyspace  # Optional

Getting Astra DB Credentials

Sign up at Astra DB
Create a new database
Generate an application token
Copy the API endpoint from the database overview

Collection Management

Create and Connect

import { AstraDBVectorStore } from "@llamaindex/astra";

const vectorStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT
  }
});

// Create new collection with vector configuration
await vectorStore.createAndConnect("my_collection", {
  vector: {
    dimension: 1536,  // Match your embedding model
    metric: "cosine"  // or "euclidean", "dot_product"
  }
});

Connect to Existing Collection

// Connect to existing collection
await vectorStore.connect("existing_collection");

Important

You must call either connect() or createAndConnect() before adding, deleting, or querying documents.

Querying

Basic Query

const index = await VectorStoreIndex.fromVectorStore(vectorStore);

const retriever = index.asRetriever({
  similarityTopK: 5
});

const nodes = await retriever.retrieve("query text");

nodes.forEach(node => {
  console.log(`Score: ${node.score}`);
  console.log(`Text: ${node.node.text}`);
});

Metadata Filtering

import { MetadataFilters, FilterCondition, FilterOperator } from "@llamaindex/core/vector-store";

const documents = [
  new Document({
    text: "Doc 1",
    metadata: { category: "tech", year: 2023, tags: ["ai", "ml"] }
  }),
  new Document({
    text: "Doc 2",
    metadata: { category: "science", year: 2024 }
  })
];

const index = await VectorStoreIndex.fromDocuments(documents, {
  storageContext: { vectorStore }
});

const retriever = index.asRetriever({
  filters: new MetadataFilters({
    filters: [
      { key: "category", value: "tech", operator: FilterOperator.EQ },
      { key: "year", value: 2023, operator: FilterOperator.GTE }
    ],
    condition: FilterCondition.AND
  })
});

const nodes = await retriever.retrieve("query");

Supported Filter Operators

Astra DB supports:

EQ - Equal
NE - Not equal
GT - Greater than
LT - Less than
GTE - Greater than or equal
LTE - Less than or equal
IN - Value in array
NIN - Value not in array
IS_EMPTY - Array is empty (size = 0)

Managing Data

Add Documents

const newDoc = new Document({
  text: "New content",
  metadata: { source: "api" }
});
await index.insert(newDoc);

Delete by Document ID

await vectorStore.delete(refDocId);

Access Astra DB Client

const client = vectorStore.client();

// Use DataAPI client directly
// See: https://docs.datastax.com/en/astra-db-serverless/api-reference/data-api.html

Complete Example

import { AstraDBVectorStore } from "@llamaindex/astra";
import { VectorStoreIndex, Document, Settings } from "llamaindex";
import { OpenAI, OpenAIEmbedding } from "@llamaindex/openai";

// Configure settings
Settings.llm = new OpenAI({ model: "gpt-4" });
Settings.embedModel = new OpenAIEmbedding();

// Create vector store
const vectorStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT,
    namespace: "llamaindex"
  }
});

// Create collection with vector configuration
await vectorStore.createAndConnect("technical_docs", {
  vector: {
    dimension: 1536,
    metric: "cosine"
  }
});

// Load documents
const documents = [
  new Document({
    text: "Astra DB is a serverless vector database...",
    metadata: { source: "docs", category: "database" }
  }),
  new Document({
    text: "LlamaIndex integrates with Astra DB...",
    metadata: { source: "tutorial", category: "integration" }
  })
];

// Build index
const index = await VectorStoreIndex.fromDocuments(documents, {
  storageContext: { vectorStore }
});

// Query with filters
const retriever = index.asRetriever({
  similarityTopK: 3,
  filters: new MetadataFilters({
    filters: [
      { key: "category", value: "database", operator: FilterOperator.EQ }
    ]
  })
});

const nodes = await retriever.retrieve("serverless vector database");
console.log(nodes);

Distance Metrics

Astra DB supports different similarity metrics:

// Cosine similarity (recommended for normalized embeddings)
await vectorStore.createAndConnect("cosine_collection", {
  vector: { dimension: 1536, metric: "cosine" }
});

// Euclidean distance
await vectorStore.createAndConnect("euclidean_collection", {
  vector: { dimension: 1536, metric: "euclidean" }
});

// Dot product
await vectorStore.createAndConnect("dot_product_collection", {
  vector: { dimension: 1536, metric: "dot_product" }
});

Multi-Region Support

Astra DB automatically replicates data across multiple regions:

// Data is automatically replicated
const vectorStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT
  }
});

// Queries automatically use nearest region

Namespaces (Keyspaces)

Organize collections into namespaces:

// Production namespace
const prodStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT,
    namespace: "production"
  }
});

// Development namespace
const devStore = new AstraDBVectorStore({
  params: {
    token: process.env.ASTRA_DB_APPLICATION_TOKEN,
    endpoint: process.env.ASTRA_DB_API_ENDPOINT,
    namespace: "development"
  }
});

Best Practices

Use appropriate metric: Cosine for normalized vectors, dot_product for speed
Organize with namespaces: Separate environments or use cases
Monitor usage: Track API requests and storage in Astra console
Handle errors: Implement retry logic for transient failures
Batch operations: Insert multiple documents at once for efficiency
Use serverless: Automatic scaling eliminates capacity planning

Troubleshooting

Authentication Failed

Verify your credentials:

try {
  const vectorStore = new AstraDBVectorStore({
    params: {
      token: process.env.ASTRA_DB_APPLICATION_TOKEN,
      endpoint: process.env.ASTRA_DB_API_ENDPOINT
    }
  });
  await vectorStore.connect("test_collection");
  console.log("Connected successfully");
} catch (error) {
  console.error("Authentication error:", error.message);
}

Collection Not Found

Create the collection first:

// Create if it doesn't exist
await vectorStore.createAndConnect("my_collection", {
  vector: { dimension: 1536, metric: "cosine" }
});

Dimension Mismatch

Ensure embedding dimensions match collection configuration:

import { OpenAIEmbedding } from "@llamaindex/openai";

// text-embedding-3-small: 1536 dimensions
const embedModel = new OpenAIEmbedding({
  model: "text-embedding-3-small"
});

// Astra collection must match
await vectorStore.createAndConnect("my_collection", {
  vector: {
    dimension: 1536,  // Must match embedding model
    metric: "cosine"
  }
});

Core Package

Main Package

LLM Providers

Vector Stores

Workflow & Tools

Overview

Installation

Basic Usage

Constructor Options

Configuration

Environment Variables

Getting Astra DB Credentials

Collection Management

Create and Connect

Connect to Existing Collection

Important

Querying

Basic Query

Metadata Filtering

Supported Filter Operators

Managing Data

Add Documents

Delete by Document ID

Access Astra DB Client

Complete Example

Distance Metrics

Multi-Region Support

Namespaces (Keyspaces)

Best Practices

Troubleshooting

Authentication Failed

Collection Not Found

Dimension Mismatch

See Also

Build docs developers (and LLMs) love

Core Package

Main Package

LLM Providers

Vector Stores

Workflow & Tools

​Overview

​Installation

​Basic Usage

​Constructor Options

​Configuration

​Environment Variables

​Getting Astra DB Credentials

​Collection Management

​Create and Connect

​Connect to Existing Collection

​Important

​Querying

​Basic Query

​Metadata Filtering

​Supported Filter Operators

​Managing Data

​Add Documents

​Delete by Document ID

​Access Astra DB Client

​Complete Example

​Distance Metrics

​Multi-Region Support

​Namespaces (Keyspaces)

​Best Practices

​Troubleshooting

​Authentication Failed

​Collection Not Found

​Dimension Mismatch

​See Also

Build docs developers (and LLMs) love

Overview

Installation

Basic Usage

Constructor Options

Configuration

Environment Variables

Getting Astra DB Credentials

Collection Management

Create and Connect

Connect to Existing Collection

Important

Querying

Basic Query

Metadata Filtering

Supported Filter Operators

Managing Data

Add Documents

Delete by Document ID

Access Astra DB Client

Complete Example

Distance Metrics

Multi-Region Support

Namespaces (Keyspaces)

Best Practices

Troubleshooting

Authentication Failed

Collection Not Found

Dimension Mismatch

See Also