Settings & Configuration

The Settings object provides centralized configuration for LlamaIndex.TS. It manages global defaults for LLMs, embeddings, chunk sizes, and other parameters used throughout the framework.

Accessing Settings

Import Settings

import { Settings } from "llamaindex";
// or from core
import { Settings } from "@llamaindex/core/global";

Global Settings Object

Settings is a singleton that stores global configuration:

packages/core/src/global/settings.ts

export const Settings = {
  llm: LLM,                    // Default language model
  embedModel: BaseEmbedding,   // Default embedding model
  tokenizer: Tokenizer,        // Token counting
  chunkSize: number,           // Text chunk size
  callbackManager: CallbackManager,  // Event handling
  debug: boolean,              // Debug mode
};

Settings uses AsyncLocalStorage internally, allowing context-aware configuration in concurrent operations.

Configuring the Default LLM

Basic Configuration

Set global LLM

import { Settings } from "llamaindex";
import { OpenAI } from "@llamaindex/openai";

Settings.llm = new OpenAI({
  model: "gpt-4-turbo-preview",
  apiKey: process.env.OPENAI_API_KEY,
  temperature: 0.7,
  maxTokens: 1000,
});

You must set Settings.llm before using any functionality that requires an LLM, or you’ll get an error.

LLM Configuration Error

From @llamaindex/core/global/settings/llm.ts

export function getLLM(): LLM {
  const currentLLM = llmAsyncLocalStorage.getStore() ?? globalLLM;
  if (!currentLLM) {
    throw new Error(
      "Cannot find LLM, please set `Settings.llm = ...` on the top of your code. " +
      "Check https://ts.llamaindex.ai/docs/llamaindex/modules/models/llms for details."
    );
  }
  return currentLLM;
}

Multiple LLM Providers

OpenAI
Anthropic
Ollama
Google

import { OpenAI } from "@llamaindex/openai";

Settings.llm = new OpenAI({
  model: "gpt-4-turbo-preview",
  apiKey: process.env.OPENAI_API_KEY,
});

import { Anthropic } from "@llamaindex/anthropic";

Settings.llm = new Anthropic({
  model: "claude-3-5-sonnet-20241022",
  apiKey: process.env.ANTHROPIC_API_KEY,
});

import { Ollama } from "@llamaindex/ollama";

Settings.llm = new Ollama({
  model: "llama3.1",
  baseURL: "http://localhost:11434",
});

import { Gemini } from "@llamaindex/google";

Settings.llm = new Gemini({
  model: "gemini-1.5-pro",
  apiKey: process.env.GOOGLE_API_KEY,
});

Configuring Embeddings

Basic Configuration

Set global embedding model

import { Settings } from "llamaindex";
import { OpenAIEmbedding } from "@llamaindex/openai";

Settings.embedModel = new OpenAIEmbedding({
  model: "text-embedding-3-large",
  apiKey: process.env.OPENAI_API_KEY,
  dimensions: 1024,
});

Embedding Model Error

From @llamaindex/core/global/settings/embedModel.ts

export function getEmbeddedModel(): BaseEmbedding {
  const currentEmbeddedModel =
    embeddedModelAsyncLocalStorage.getStore() ?? globalEmbeddedModel;
  if (!currentEmbeddedModel) {
    throw new Error(
      "Cannot find Embedding, please set `Settings.embedModel = ...` on the top of your code. " +
      "Check https://ts.llamaindex.ai/docs/llamaindex/modules/models/embeddings for details."
    );
  }
  return currentEmbeddedModel;
}

Multiple Embedding Providers

import { OpenAIEmbedding } from "@llamaindex/openai";

Settings.embedModel = new OpenAIEmbedding({
  model: "text-embedding-3-small",
  dimensions: 512,
});

Context-Aware Settings

Use withLLM, withEmbedModel, and other with* methods for scoped configuration:

Scoped Settings

import { Settings } from "llamaindex";
import { OpenAI } from "@llamaindex/openai";
import { Anthropic } from "@llamaindex/anthropic";

// Global default
Settings.llm = new OpenAI({ model: "gpt-3.5-turbo" });

// Use different LLM in specific scope
const result1 = await Settings.withLLM(
  new OpenAI({ model: "gpt-4" }),
  async () => {
    // This code uses GPT-4
    const response = await Settings.llm.chat({
      messages: [{ role: "user", content: "Complex task" }]
    });
    return response;
  }
);

// Use Anthropic in another scope
const result2 = await Settings.withLLM(
  new Anthropic({ model: "claude-3-opus-20240229" }),
  async () => {
    // This code uses Claude-3 Opus
    const response = await Settings.llm.chat({
      messages: [{ role: "user", content: "Another task" }]
    });
    return response;
  }
);

// Outside scopes, uses global GPT-3.5-turbo

All Context Methods

Available with* methods

Settings.withLLM<Result>(llm: LLM, fn: () => Result): Result
Settings.withEmbedModel<Result>(embedModel: BaseEmbedding, fn: () => Result): Result
Settings.withTokenizer<Result>(tokenizer: Tokenizer, fn: () => Result): Result
Settings.withChunkSize<Result>(chunkSize: number, fn: () => Result): Result
Settings.withCallbackManager<Result>(callbackManager: CallbackManager, fn: () => Result): Result

Chunk Size Configuration

Control how documents are split into chunks:

Set chunk size

import { Settings } from "llamaindex";

// Set global chunk size (in tokens)
Settings.chunkSize = 1024;

// Use different chunk size in scope
Settings.withChunkSize(512, () => {
  // Documents will be chunked into 512 token pieces
  const index = await VectorStoreIndex.fromDocuments(documents);
});

The default chunk size is 1024 tokens. Adjust based on your LLM’s context window and use case.

Node Parser Configuration

The main package extends Settings with node parser configuration:

packages/llamaindex/src/Settings.ts

import { Settings as CoreSettings } from "@llamaindex/core/global";
import { SentenceSplitter } from "@llamaindex/core/node-parser";

class GlobalSettings {
  get nodeParser(): NodeParser {
    if (this.#nodeParser === null) {
      this.#nodeParser = new SentenceSplitter({
        chunkSize: this.chunkSize,
        chunkOverlap: this.chunkOverlap,
      });
    }
    return this.#nodeParser;
  }
  
  set nodeParser(nodeParser: NodeParser) {
    this.#nodeParser = nodeParser;
  }
}

export const Settings = new GlobalSettings();

Custom Node Parser

Configure custom parser

import { Settings } from "llamaindex";
import { SentenceSplitter } from "llamaindex";

Settings.nodeParser = new SentenceSplitter({
  chunkSize: 512,
  chunkOverlap: 50,
  paragraphSeparator: "\n\n",
});

Callback Manager

Event handling

import { Settings, CallbackManager } from "llamaindex";

const callbackManager = new CallbackManager();

callbackManager.on("llm-start", (event) => {
  console.log("LLM started:", event);
});

callbackManager.on("llm-end", (event) => {
  console.log("LLM finished:", event);
});

callbackManager.on("llm-stream", (event) => {
  console.log("LLM streaming:", event.delta);
});

Settings.callbackManager = callbackManager;

Debug Mode

Enable debug logging:

Enable debugging

import { Settings } from "llamaindex";

// Via environment variable
process.env.DEBUG = "llamaindex:*";

// Or set localStorage in browser
if (typeof window !== "undefined") {
  window.localStorage.debug = "llamaindex:*";
}

// Check if debug is enabled
if (Settings.debug) {
  console.log("Debug mode is enabled");
}

From @llamaindex/core/global/settings.ts

get debug() {
  let debug = getEnv("DEBUG");
  if (typeof window !== "undefined") {
    debug ||= window.localStorage.debug;
  }
  return (
    (Boolean(debug) && debug?.includes("llamaindex")) ||
    debug === "*" ||
    debug === "true"
  );
}

Environment Variables

OpenAI

OPENAI_API_KEY=sk-...
OPENAI_MODEL=gpt-4-turbo-preview

Anthropic

ANTHROPIC_API_KEY=sk-ant-...

Pinecone

PINECONE_API_KEY=...
PINECONE_INDEX_NAME=my-index

Debug

DEBUG=llamaindex:*  # Enable all debug logs
DEBUG=*             # Enable all debug logs globally

Complete Configuration Example

Full setup

import { Settings } from "llamaindex";
import { OpenAI, OpenAIEmbedding } from "@llamaindex/openai";
import { SentenceSplitter } from "llamaindex";

// Configure LLM
Settings.llm = new OpenAI({
  model: "gpt-4-turbo-preview",
  apiKey: process.env.OPENAI_API_KEY,
  temperature: 0.7,
  maxTokens: 2000,
});

// Configure Embeddings
Settings.embedModel = new OpenAIEmbedding({
  model: "text-embedding-3-large",
  apiKey: process.env.OPENAI_API_KEY,
  dimensions: 1024,
});

// Configure Node Parser
Settings.nodeParser = new SentenceSplitter({
  chunkSize: 1024,
  chunkOverlap: 200,
});

// Configure Chunk Size
Settings.chunkSize = 1024;

// Now use the framework - Settings will be used automatically
import { VectorStoreIndex, Document } from "llamaindex";

const documents = [
  new Document({ text: "Your content here" }),
];

// Uses Settings.llm, Settings.embedModel, Settings.nodeParser
const index = await VectorStoreIndex.fromDocuments(documents);
const queryEngine = index.asQueryEngine();

const response = await queryEngine.query({
  query: "What is this about?"
});

Settings in Different Contexts

Index Creation

Settings used in indexing

const index = await VectorStoreIndex.fromDocuments(documents);
// Uses:
// - Settings.nodeParser to chunk documents
// - Settings.embedModel to create embeddings

Query Engine

Settings used in queries

const queryEngine = index.asQueryEngine();
const response = await queryEngine.query({ query: "..." });
// Uses:
// - Settings.llm to generate responses
// - Settings.embedModel to embed the query

Agents

Settings used in agents

import { OpenAIAgent } from "@llamaindex/openai";

const agent = new OpenAIAgent({
  llm: Settings.llm,  // Can use Settings.llm explicitly
  tools: [...],
});

Best Practices

Set Early

Configure Settings at the top of your application before using any LlamaIndex functionality

Environment Variables

Use environment variables for API keys - never hardcode secrets

Context Scoping

Use with* methods for temporary configuration changes in specific scopes

Consistent Models

Use the same embedding model for indexing and querying to ensure compatibility

Next Steps

Data Flow

Learn how data flows through the ingestion and query pipeline

LLM Configuration

Explore detailed LLM provider configuration options

Getting Started

Core Concepts

Building with LlamaIndex

Data Management

Models & Embeddings

Retrievers & Indices

Advanced Features

Settings & Configuration

Accessing Settings

Global Settings Object

Configuring the Default LLM

Basic Configuration

LLM Configuration Error

Multiple LLM Providers

Configuring Embeddings

Basic Configuration

Embedding Model Error

Multiple Embedding Providers

Context-Aware Settings

All Context Methods

Chunk Size Configuration

Node Parser Configuration

Custom Node Parser

Callback Manager

Debug Mode

Environment Variables

Complete Configuration Example

Settings in Different Contexts

Index Creation

Query Engine

Agents

Best Practices

Set Early

Environment Variables

Context Scoping

Consistent Models

Next Steps

Data Flow

LLM Configuration

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Building with LlamaIndex

Data Management

Models & Embeddings

Retrievers & Indices

Advanced Features

​Accessing Settings

​Global Settings Object

​Configuring the Default LLM

​Basic Configuration

​LLM Configuration Error

​Multiple LLM Providers

​Configuring Embeddings

​Basic Configuration

​Embedding Model Error

​Multiple Embedding Providers

​Context-Aware Settings

​All Context Methods

​Chunk Size Configuration

​Node Parser Configuration

​Custom Node Parser

​Callback Manager

​Debug Mode

​Environment Variables

​Complete Configuration Example

​Settings in Different Contexts

​Index Creation

​Query Engine

​Agents

​Best Practices

Set Early

Environment Variables

Context Scoping

Consistent Models

​Next Steps

Data Flow

LLM Configuration

Build docs developers (and LLMs) love

Accessing Settings

Global Settings Object

Configuring the Default LLM

Basic Configuration

LLM Configuration Error

Multiple LLM Providers

Configuring Embeddings

Basic Configuration

Embedding Model Error

Multiple Embedding Providers

Context-Aware Settings

All Context Methods

Chunk Size Configuration

Node Parser Configuration

Custom Node Parser

Callback Manager

Debug Mode

Environment Variables

Complete Configuration Example

Settings in Different Contexts

Index Creation

Query Engine

Agents

Best Practices

Next Steps