Run Agent

Endpoint

POST /agent/start

Authentication

Requires JWT authentication via Authorization: Bearer <token> header.

Description

Starts an agent run to process a user prompt. The agent executes in the context of a thread, which can be new or existing. The API supports file uploads, custom models, optimistic creation, and background sandbox initialization. Key Features:

New or Existing Thread: Create a new thread or continue in an existing one
File Upload: Upload files that are processed and added to the sandbox
Optimistic Mode: Pre-create thread/project IDs on the client for instant UI updates
Streaming: Real-time progress via Server-Sent Events (SSE)
Billing & Limits: Automatic credit checks and concurrent run limits

Request Format

This endpoint uses multipart/form-data to support file uploads.

Request Parameters

prompt

string

required

User message/task for the agent to execute. Required for new threads.

thread_id

string

UUID of an existing thread to continue, or a new UUID in optimistic mode.

project_id

string

UUID of the project containing the thread. Required in optimistic mode.

agent_id

string

UUID of the agent to use. If not provided, uses the user’s default agent.

model_name

string

Override the agent’s default model (e.g., “kortix/basic”, “openai/gpt-5-nano-2025-08-07”).Model access depends on subscription tier:

Free: kortix/minimax
Pro: kortix/basic, openai/gpt-5-nano, anthropic/claude-3.5-sonnet
Enterprise: All models

optimistic

string

default:"false"

Set to "true" to enable optimistic mode. Client pre-generates thread_id and project_id for instant UI updates.When enabled, thread_id and project_id must be provided as valid UUIDs.

memory_enabled

string

Set to "true" or "false" to enable/disable memory for this run.

mode

string

Execution mode (e.g., “code”, “chat”). Used for specialized agent behaviors.

files

file[]

Array of files to upload. Files are parsed, uploaded to the project’s sandbox, and made available to the agent.Supported formats: text files, code files, PDFs, images, etc.

Response

Returns immediately with run metadata. The agent executes in the background.

thread_id

string

UUID of the thread (new or existing)

agent_run_id

string

Unique identifier for this agent run (UUID)

project_id

string

UUID of the project containing the thread

sandbox_id

string

UUID of the sandbox where files are uploaded. Only returned if files were uploaded or sandbox was created.

status

string

Initial run status, always "running"

timing_breakdown

object

Performance timing information (only if X-Emit-Timing: true header set by super admin)

Show Timing Fields

setup_ms

number

Milliseconds taken for request setup and validation

Examples

curl -X POST https://api.kortix.ai/agent/start \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "prompt=Write a Python script to analyze CSV files" \
  -F "agent_id=550e8400-e29b-41d4-a716-446655440000"

Response Example

{
  "thread_id": "770fa622-g4bd-63f6-c938-668877662222",
  "agent_run_id": "880gb733-h5ce-74g7-d049-779988773333",
  "project_id": "990hc844-i6df-85h8-e150-880099884444",
  "sandbox_id": "aa1id955-j7eg-96i9-f261-991100995555",
  "status": "running"
}

Streaming Progress

Agent runs stream real-time events via Server-Sent Events (SSE):

GET /agent-run/{agent_run_id}/stream

See the Stream Agent Run documentation for details.

Error Responses

402 Payment Required

error

Billing or limit issuesInsufficient Credits:

{
  "detail": {
    "message": "Billing check failed: Insufficient credits",
    "error_code": "INSUFFICIENT_CREDITS"
  }
}

Model Access Denied:

{
  "detail": {
    "message": "Your current subscription plan does not include access to openai/gpt-5-nano-2025-08-07. Please upgrade your subscription.",
    "tier_name": "free",
    "error_code": "MODEL_ACCESS_DENIED"
  }
}

Concurrent Run Limit:

{
  "detail": {
    "message": "Maximum of 1 concurrent runs. You have 1 running.",
    "running_thread_ids": ["770fa622-g4bd-63f6-c938-668877662222"],
    "running_count": 1,
    "limit": 1,
    "error_code": "AGENT_RUN_LIMIT_EXCEEDED"
  }
}

Thread Limit:

{
  "detail": {
    "message": "Maximum of 10 threads allowed.",
    "current_count": 10,
    "limit": 10,
    "error_code": "THREAD_LIMIT_EXCEEDED"
  }
}

400 Bad Request

error

Invalid request parameters

{
  "detail": "prompt required when creating new thread"
}

Or optimistic mode validation:

{
  "detail": "thread_id and project_id required for optimistic mode"
}

404 Not Found

error

Thread not found (when using existing thread)

{
  "detail": "Thread not found"
}

401 Unauthorized

error

Missing or invalid authentication token

{
  "detail": "Not authenticated"
}

500 Internal Server Error

error

Server error during run startup

{
  "detail": "Failed to start agent: <error details>"
}

Optimistic Mode

Optimistic mode allows instant UI updates by pre-generating IDs on the client:

import { v4 as uuidv4 } from 'uuid';

// Client pre-generates IDs
const threadId = uuidv4();
const projectId = uuidv4();

// Show in UI immediately
showThread(threadId, { status: 'creating' });

// Start run with pre-generated IDs
const run = await client.agents.run({
  prompt: 'Generate a report',
  threadId,
  projectId,
  optimistic: true,
});

// UI already showing thread - just update status
updateThreadStatus(threadId, 'running');

File Upload Details

File Processing:

Files are parsed on the server
Sandbox is created/resolved for the project
Files are uploaded to sandbox storage
Agent can access files via /workspace directory

File Limits:

Maximum file size: Varies by tier
Supported formats: Text, code, PDFs, images, documents

File Accessibility:

# Files are available in the sandbox at /workspace
run = client.agents.run(
    prompt="Analyze data.csv and create visualizations",
    files=[open('data.csv', 'rb')]
)

# Agent can access the file:
# - Read: /workspace/data.csv
# - Process: pandas.read_csv('/workspace/data.csv')
# - Output: /workspace/output/charts.png

Concurrent Run Limits

Limits based on subscription tier:

Free: 1 concurrent run
Pro: 3 concurrent runs
Enterprise: 10+ concurrent runs

To check running threads:

try:
    run = client.agents.run(prompt="Task")
except Exception as e:
    if 'AGENT_RUN_LIMIT_EXCEEDED' in str(e):
        # Wait for a run to complete or stop an existing run
        active_runs = client.agents.list_active()
        print(f"Active runs: {[r.thread_id for r in active_runs]}")

Background Execution

The /agent/start endpoint returns immediately while the agent executes in the background:

Immediate Response: Returns agent_run_id, thread_id, status: "running"
Background Task: Agent execution runs asynchronously
Streaming: Progress streamed via SSE to /agent-run/{agent_run_id}/stream
Completion: Status updated to completed, failed, or stopped

Notes

Fast Response: API returns in less than 100ms by using background execution
Sandbox Creation: Sandboxes are created on-demand; with files (blocking), without files (background)
Cache Warming: Credits and user context are prewarmed in background
Model Override: Free tier users requesting kortix/basic are auto-downgraded to kortix/minimax
Slot Reservation: Concurrent slot is reserved before execution to enforce limits
Cleanup: Slots are automatically released on completion, failure, or timeout

Stream Agent Run - Stream real-time progress
Stop Agent Run - Cancel a running agent
Get Agent Run Status - Check run status
List Agent Runs - View run history

Getting Started

Agents

Threads

Tools

SDK

Endpoint

Authentication

Description

Request Format

Request Parameters

Response

Examples

Response Example

Streaming Progress

Error Responses

Optimistic Mode

File Upload Details

Concurrent Run Limits

Background Execution

Notes

Build docs developers (and LLMs) love

Getting Started

Agents

Threads

Tools

SDK

​Endpoint

​Authentication

​Description

​Request Format

​Request Parameters

​Response

​Examples

​Response Example

​Streaming Progress

​Error Responses

​Optimistic Mode

​File Upload Details

​Concurrent Run Limits

​Background Execution

​Notes

​Related Endpoints

Build docs developers (and LLMs) love

Endpoint

Authentication

Description

Request Format

Request Parameters

Response

Examples

Response Example

Streaming Progress

Error Responses

Optimistic Mode

File Upload Details

Concurrent Run Limits

Background Execution

Notes

Related Endpoints