Create Chat

Endpoint

POST /api/create-chat

Create a new chat session from a PDF file. This endpoint processes the PDF, generates vector embeddings, stores them in Pinecone, and creates a chat record in the database.

This endpoint requires authentication via Clerk. Ensure the user is authenticated before making this request.

Authentication

Requires a valid Clerk session. The endpoint extracts the userId from the authentication context.

Request Body

file_key

string

required

The S3 file key for the uploaded PDF. This should be the key returned after uploading a file to S3.Example: "uploads/user123/document-abc123.pdf"

file_name

string

required

The display name for the PDF file. This will be shown to users in the chat interface.Example: "Q4 Financial Report.pdf"

Response

chat_id

number

The unique identifier for the newly created chat session. Use this ID for subsequent chat operations.

Success Response (200)

{
  "chat_id": 42
}

Error Responses

error

string

Error message describing what went wrong

401 Unauthorized

Returned when the user is not authenticated:

{
  "error": "Authentication error"
}

500 Internal Server Error

Returned when PDF processing or database operations fail:

{
  "error": "internal server error"
}

Example Request

const response = await fetch('/api/create-chat', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    file_key: 'uploads/user123/document-abc123.pdf',
    file_name: 'Q4 Financial Report.pdf'
  })
});

const data = await response.json();
console.log('Chat ID:', data.chat_id);

Example Response

{
  "chat_id": 42
}

How It Works

Authentication Check: Verifies the user is authenticated via Clerk
PDF Processing: Downloads the PDF from S3 using the file_key
Text Extraction: Extracts text content from the PDF
Chunking: Splits the text into manageable chunks for embedding
Embedding: Generates vector embeddings using OpenAI’s embedding model
Vector Storage: Stores embeddings in Pinecone for semantic search
Database Record: Creates a chat record in the database with:
- fileKey: S3 file key
- pdfName: Display name
- pdfUrl: Public S3 URL for the PDF
- userId: Authenticated user’s ID
Response: Returns the newly created chat ID

The PDF processing and embedding generation may take 10-30 seconds depending on document size. Consider implementing a loading state or webhook for completion notifications.

Chat Record Schema

When a chat is created, the following data is stored:

Field	Type	Description
`id`	integer	Unique chat identifier (auto-generated)
`pdfName`	string	Display name of the PDF
`pdfUrl`	string	S3 URL for accessing the PDF
`fileKey`	string	S3 file key
`userId`	string	Clerk user ID (max 255 chars)
`createdAt`	timestamp	Chat creation timestamp

Prerequisites

Before calling this endpoint:

User Authentication: Ensure the user is signed in via Clerk
PDF Upload: Upload the PDF to S3 and obtain the file_key
S3 Configuration: Verify S3 bucket permissions allow the API to read the file
Pinecone Setup: Ensure Pinecone index is configured and accessible

Best Practices

Validate the PDF file before uploading to S3
Use descriptive file_name values for better UX
Store the returned chat_id in your application state
Implement error handling for failed PDF processing
Consider file size limits (recommend max 50MB PDFs)
Show upload progress and processing status to users

For large PDFs, consider implementing a job queue system to handle processing asynchronously and notify users when the chat is ready.

After creating a chat:

Use the returned chat_id to send messages via /api/chat
Retrieve message history via /api/get-messages
Display the PDF using the stored pdfUrl

Architecture

Integrations

API Reference

Endpoint

Authentication

Request Body

Response

Success Response (200)

Error Responses

401 Unauthorized

500 Internal Server Error

Example Request

Example Response

How It Works

Chat Record Schema

Prerequisites

Best Practices

Build docs developers (and LLMs) love

Architecture

Integrations

API Reference

​Endpoint

​Authentication

​Request Body

​Response

​Success Response (200)

​Error Responses

​401 Unauthorized

​500 Internal Server Error

​Example Request

​Example Response

​How It Works

​Chat Record Schema

​Prerequisites

​Best Practices

​Related Operations

Build docs developers (and LLMs) love

Endpoint

Authentication

Request Body

Response

Success Response (200)

Error Responses

401 Unauthorized

500 Internal Server Error

Example Request

Example Response

How It Works

Chat Record Schema

Prerequisites

Best Practices

Related Operations