Integrations overview

ZeroEval provides seamless integrations with popular LLM frameworks and SDKs. These integrations automatically capture traces, token usage, latency metrics, and more without requiring manual instrumentation.

Auto-detection with `wrap()`

The wrap() function automatically detects the client type and applies the appropriate wrapper:

import { OpenAI } from 'openai';
import * as ai from 'ai';
import * as ze from 'zeroeval';

// Automatically detects OpenAI client
const openai = ze.wrap(new OpenAI());

// Automatically detects Vercel AI SDK module
const wrappedAI = ze.wrap(ai);

The wrapper detects the client type by checking for specific properties and constructor names. If the client type is unsupported, it throws an error with helpful guidance.

If you haven’t called ze.init() and ZEROEVAL_API_KEY is set in your environment, the SDK will automatically initialize when you use wrap().

Supported integrations

OpenAI

Wrap OpenAI clients for automatic tracing

Vercel AI SDK

Wrap AI SDK functions for automatic tracing

LangChain

Use callback handlers for LangChain and LangGraph

What gets traced

All integrations automatically capture:

Input and output data - Prompts, messages, and completions
Token usage - Prompt tokens, completion tokens, and total tokens
Latency metrics - Time to first token (streaming), total duration
Throughput - Characters or tokens per second
Model parameters - Temperature, max tokens, top-p, etc.
Errors - Exception messages and stack traces
Metadata - Tags, custom attributes, and ZeroEval prompt metadata

Integration patterns

Proxy-based wrapping

OpenAI and Vercel AI integrations use JavaScript proxies to intercept API calls:

Non-invasive - Original client behavior is preserved
Type-safe - Full TypeScript support with original types
Double-wrap protection - Safe to wrap multiple times
Automatic initialization - Uses ZEROEVAL_API_KEY from environment

Callback handlers

LangChain uses the callback handler pattern:

Set globally with setGlobalCallbackHandler()
Or pass per-invocation in callbacks option
Traces chains, LLMs, tools, retrievers, and agents
Supports LangGraph workflows

Streaming support

All integrations support streaming responses:

Captures time to first token as latency metric
Tracks throughput as characters per second
Automatically accumulates full response for tracing
Yields chunks transparently to your code

ZeroEval metadata extraction

Integrations automatically detect and process ZeroEval prompt metadata embedded in system messages:

const messages = [
  {
    role: 'system',
    content: `<zeroeval task="greeting" prompt_version_id="abc123" variables='{"name":"Alice"}'>\nYou are a friendly assistant. Greet {{name}}.</zeroeval>`
  },
  { role: 'user', content: 'Hello!' }
];

The wrapper will:

Extract metadata (task, prompt_version_id, variables)
Strip the <zeroeval> tags from the message
Interpolate variables like {{name}}
Attach metadata to the span for filtering and analysis
Look up bound models from prompt versions

Get Started

Core Concepts

Integrations

Guides

Auto-detection with `wrap()`

Supported integrations

OpenAI

Vercel AI SDK

LangChain

What gets traced

Integration patterns

Proxy-based wrapping

Callback handlers

Streaming support

ZeroEval metadata extraction

Next steps

OpenAI integration

Vercel AI integration

LangChain integration

Build docs developers (and LLMs) love

Get Started

Core Concepts

Integrations

Guides

​Auto-detection with wrap()

​Supported integrations

OpenAI

Vercel AI SDK

LangChain

​What gets traced

​Integration patterns

​Proxy-based wrapping

​Callback handlers

​Streaming support

​ZeroEval metadata extraction

​Next steps

OpenAI integration

Vercel AI integration

LangChain integration

Build docs developers (and LLMs) love

Auto-detection with `wrap()`

Supported integrations

What gets traced

Integration patterns

Proxy-based wrapping

Callback handlers

Streaming support

ZeroEval metadata extraction

Next steps