wrapOpenAI

import { OpenAI } from 'openai';
import { wrapOpenAI } from 'zeroeval';

const client = wrapOpenAI(new OpenAI());

const completion = await client.chat.completions.create({
  model: 'gpt-4',
  messages: [{ role: 'user', content: 'Hello!' }]
});

Overview

wrapOpenAI() creates a proxy-based wrapper around your OpenAI client that automatically traces all API calls. The wrapper preserves all TypeScript types and client functionality while instrumenting methods for observability. If ze.init() hasn’t been called and ZEROEVAL_API_KEY is set in your environment, the SDK will automatically initialize itself.

Type Signature

function wrapOpenAI<T extends InstanceType<typeof OpenAI>>(
  client: T
): WrappedOpenAI<T>

Parameters

client

InstanceType<typeof OpenAI>

required

An instance of the OpenAI client from the openai package.

Returns

WrappedOpenAI<T>

T & { __zeroeval_wrapped?: boolean }

A wrapped OpenAI client that preserves all original types and functionality while adding automatic tracing.

Traced Operations

Chat Completions

Method: client.chat.completions.create() Traces both streaming and non-streaming chat completions with:

Full input/output capture
Token usage metrics (inputTokens, outputTokens)
Throughput calculation (chars/second)
Streaming metrics (latency to first token)
Prompt metadata extraction and variable interpolation

Embeddings

Method: client.embeddings.create() Traces embedding generation with:

Input text capture
Model information
Embedding dimension and count

Images

Methods:

client.images.generate()
client.images.edit()
client.images.createVariation()

Traces image generation operations with request parameters.

Audio

Methods:

client.audio.transcriptions.create()
client.audio.translations.create()

Traces audio processing operations.

Proxy-Based Instrumentation

The wrapper uses JavaScript Proxies to intercept method calls without modifying the original client:

// The wrapper intercepts calls at multiple levels:
openai.chat.completions.create(...)
  └─> Proxy on openai
      └─> Proxy on chat
          └─> Proxy on completions
              └─> Wrapped create() method

This approach ensures:

No monkey-patching or prototype modification
Full type preservation
No interference with OpenAI SDK internals

Streaming Support

wrapOpenAI() fully supports streaming responses. The wrapper:

Detects when stream: true is set
Wraps the async iterator returned by OpenAI
Captures chunks as they arrive
Records latency to first token
Calculates throughput after completion
Extracts usage information from final chunk (when available)

const stream = await client.chat.completions.create({
  model: 'gpt-4',
  messages: [{ role: 'user', content: 'Write a story' }],
  stream: true
});

// Each chunk is traced
for await (const chunk of stream) {
  // Use chunks normally
}
// Span ends when stream completes

Metadata Extraction

The wrapper automatically processes ZeroEval metadata embedded in system messages:

const completion = await client.chat.completions.create({
  model: 'gpt-4',
  messages: [
    {
      role: 'system',
      content: `
<!--zeroeval
task: customer-support
prompt_version_id: pv_abc123
variables:
  user_name: Alice
  issue_type: billing
-->
You are a helpful assistant for {{user_name}}.
Handle their {{issue_type}} issue.
      `
    },
    { role: 'user', content: 'I need help' }
  ]
});

The wrapper:

Extracts metadata from the HTML comment
Interpolates variables in the prompt template
Removes the metadata comment before sending to OpenAI
Attaches metadata to the trace span

Double-Wrap Protection

Calling wrapOpenAI() on an already-wrapped client returns the existing wrapper:

const wrapped1 = wrapOpenAI(new OpenAI());
const wrapped2 = wrapOpenAI(wrapped1); // Returns wrapped1

Error Tracing

API errors are automatically captured and attached to spans:

try {
  await client.chat.completions.create({
    model: 'invalid-model',
    messages: [{ role: 'user', content: 'Hello' }]
  });
} catch (error) {
  // Error is traced with code, message, and stack
}

Span Attributes

Each traced operation includes:

service.name

string

Set to "openai"

kind

string

Operation kind: "llm", "embedding", or "operation"

provider

string

Set to "openai"

model

string

The model name used in the request

messages

array

Serialized messages (for chat completions)

streaming

boolean

Whether the request used streaming

inputTokens

number

Prompt tokens consumed (from usage data)

outputTokens

number

Completion tokens generated (from usage data)

throughput

number

Characters per second for the completion

latency

number

Time to first token (streaming only)

zeroeval

object

Extracted ZeroEval metadata (task, prompt_version_id, variables)

wrap() - Auto-detect and wrap any supported client
wrapVercelAI() - Vercel AI SDK wrapper

Core API

Decorators & Context

Integrations

Signals

Prompts

Feedback

Types

Overview

Type Signature

Parameters

Returns

Traced Operations

Chat Completions

Embeddings

Images

Audio

Proxy-Based Instrumentation

Streaming Support

Metadata Extraction

Double-Wrap Protection

Error Tracing

Span Attributes

Build docs developers (and LLMs) love

Core API

Decorators & Context

Integrations

Signals

Prompts

Feedback

Types

​Overview

​Type Signature

​Parameters

​Returns

​Traced Operations

​Chat Completions

​Embeddings

​Images

​Audio

​Proxy-Based Instrumentation

​Streaming Support

​Metadata Extraction

​Double-Wrap Protection

​Error Tracing

​Span Attributes

​Related Functions

Build docs developers (and LLMs) love

Overview

Type Signature

Parameters

Returns

Traced Operations

Chat Completions

Embeddings

Images

Audio

Proxy-Based Instrumentation

Streaming Support

Metadata Extraction

Double-Wrap Protection

Error Tracing

Span Attributes

Related Functions