OpenAI-Compatible APIs

Page Assist supports any service that implements the OpenAI API specification. This includes cloud providers, self-hosted services, and other local inference servers.

Supported Providers

Page Assist includes built-in support for many popular providers:

Cloud Providers

OpenAI - GPT models (GPT-4, GPT-3.5, etc.)
Anthropic (Claude) - Claude models
Google AI - Gemini models
Groq - Fast inference API
DeepSeek - DeepSeek models
Fireworks - Fast inference platform
Together AI - Open source models
OpenRouter - Unified API gateway
Mistral - Mistral AI models
xAI - Grok models
Novita AI - AI inference platform
Hugging Face - Inference API
Infinigence AI - AI platform
SiliconFlow - AI inference
Moonshot - AI platform
BigModel (Zhipu) - Chinese AI models
MiniMax - Chinese AI models
VolcEngine - ByteDance AI
TencentCloud - Tencent AI
AlibabaCloud - Alibaba AI

Self-Hosted / Local

LLaMA.cpp - C++ implementation
Llamafile - Single-file executables
vLLM - High-performance inference
Ollama (secondary instances)
Vercel AI Gateway - AI proxy service
Chutes - Custom deployments

Adding a Provider

Open Settings

Click the Page Assist icon in your browser toolbar, then click the Settings icon.

Navigate to OpenAI Compatible API

Go to the “OpenAI Compatible API” tab.

Click Add Provider

Click the “Add Provider” button.

Select Provider

Choose your provider from the dropdown:

If listed, select it directly (e.g., “OpenAI”, “Anthropic”, “Groq”)
If not listed, select “Custom”

Enter Configuration

Provide the required information:

Base URL: https://api.openai.com/v1
API Key: sk-...

Add Models (if required)

Some providers require manual model entry:

Auto-detected: Ollama, LM Studio, LLaMA.cpp, Llamafile, vLLM
Manual entry: Most cloud providers

For manual entry, click “Add Model” and enter model names like:

gpt-4-turbo
claude-3-5-sonnet-20241022
llama-3.1-70b-versatile

Save Configuration

Click “Save” to store your provider configuration.

Provider-Specific Setup

OpenAI

Get API Key

Visit platform.openai.com
Sign in or create an account
Navigate to API Keys
Create a new API key

Configure in Page Assist

Base URL: https://api.openai.com/v1
API Key: Your OpenAI API key (starts with sk-)
Models: gpt-4-turbo, gpt-4o, gpt-3.5-turbo, etc.

Anthropic (Claude)

Get API Key

Visit console.anthropic.com
Sign in or create an account
Navigate to API Keys
Create a new API key

Configure in Page Assist

Base URL: https://api.anthropic.com/v1
API Key: Your Anthropic API key (starts with sk-ant-)
Models: claude-3-5-sonnet-20241022, claude-3-opus-20240229, etc.

Groq

Get API Key

Visit console.groq.com
Sign in with your account
Navigate to API Keys
Create a new API key

Configure in Page Assist

Base URL: https://api.groq.com/openai/v1
API Key: Your Groq API key
Models: llama-3.1-70b-versatile, mixtral-8x7b-32768, etc.

LLaMA.cpp Server

Start LLaMA.cpp Server

Run the LLaMA.cpp server with OpenAI API compatibility:

./server -m model.gguf --port 8080

Configure in Page Assist

Provider: Select “LLaMA.cpp”
Base URL: http://localhost:8080/v1 (or your custom URL)
API Key: Not required for local instances
Models: Auto-detected from running server

vLLM

Start vLLM Server

Launch vLLM with OpenAI-compatible API:

python -m vllm.entrypoints.openai.api_server \
  --model meta-llama/Llama-2-7b-hf \
  --port 8000

Configure in Page Assist

Provider: Select “vLLM”
Base URL: http://localhost:8000/v1
API Key: Not required for local instances
Models: Auto-detected

Custom Provider

For any unlisted OpenAI-compatible service:

Select Custom

Choose “Custom” from the provider dropdown.

Enter Details

Provider Name: Give it a recognizable name
Base URL: Your API endpoint (must end with /v1)
API Key: If required by your service

Add Models

Manually add model names or let Page Assist attempt auto-detection.

Model Management

Auto-Detection

The following providers support automatic model detection:

Ollama (secondary instances)
LM Studio
LLaMA.cpp
Llamafile
vLLM

Page Assist will automatically fetch available models when you save the provider.

Manual Model Entry

For most cloud providers, you need to manually add models:

After saving the provider, click “Add Model”
Enter the exact model identifier
Optionally specify if it’s a chat or embedding model
Save the model

Model Types

Specify the model type for proper functionality:

Chat Models: For conversations (GPT-4, Claude, etc.)
Embedding Models: For RAG features (text-embedding-ada-002, etc.)

Authentication

API Keys

Most cloud providers require API keys:

OpenAI: sk-...
Anthropic: sk-ant-...
Groq: gsk_...

API keys are stored securely in your browser’s extension storage.

Custom Headers

Some providers require custom headers:

In provider settings, expand “Advanced Options”
Add custom headers as key-value pairs:
```
Key: X-Custom-Header
Value: your-value
```

Bearer Tokens

For services using bearer tokens:

Enter the token in the “API Key” field
Or add a custom header: Authorization: Bearer your-token

Troubleshooting

Connection Failed

Verify Base URL

Ensure the URL is correct and includes /v1 at the end (if required).

Check API Key

Verify your API key is valid and has sufficient credits/permissions.

Test Endpoint

Use curl or Postman to test the endpoint directly:

curl https://api.openai.com/v1/models \
  -H "Authorization: Bearer YOUR_API_KEY"

Check CORS

Some self-hosted services may need CORS configuration to work with browser extensions.

Models Not Loading

Check if models need manual entry for your provider
Verify the API endpoint supports /v1/models for auto-detection
Try manually adding model names
Check API key permissions

Authentication Errors

Verify API key format is correct
Check for extra spaces in the API key
Ensure the API key has necessary permissions
Verify the API key hasn’t expired

Rate Limiting

If you encounter rate limit errors:

Wait before sending more requests
Check your provider’s rate limits
Consider upgrading your plan
Use a different provider for high-volume usage

Best Practices

Secure Your API Keys: Never share your API keys publicly
Monitor Usage: Track API usage to avoid unexpected charges
Test with Small Models: Start with smaller/cheaper models for testing
Set Usage Limits: Configure spending limits in your provider’s dashboard
Use Multiple Providers: Distribute usage across providers for reliability
Keep Keys Rotated: Regularly rotate API keys for security

Provider Comparison

Provider	Strengths	Best For
OpenAI	Most capable models	Complex tasks, reasoning
Anthropic	Long context, safety	Document analysis, safe AI
Groq	Ultra-fast inference	Speed-critical applications
Local (Ollama/LM Studio)	Privacy, no costs	Private data, development
Together AI	Open models, good pricing	Open source model access
OpenRouter	Access to many models	Model variety, fallbacks

Get Started

Core Features

AI Providers

Configuration

Troubleshooting

Resources

OpenAI-Compatible APIs

Supported Providers

Cloud Providers

Self-Hosted / Local

Adding a Provider

Provider-Specific Setup

OpenAI

Anthropic (Claude)

Groq

LLaMA.cpp Server

vLLM

Custom Provider

Model Management

Auto-Detection

Manual Model Entry

Model Types

Authentication

API Keys

Custom Headers

Bearer Tokens

Troubleshooting

Connection Failed

Models Not Loading

Authentication Errors

Rate Limiting

Best Practices

Provider Comparison

Next Steps

Build docs developers (and LLMs) love

Get Started

Core Features

AI Providers

Configuration

Troubleshooting

Resources

​Supported Providers

​Cloud Providers

​Self-Hosted / Local

​Adding a Provider

​Provider-Specific Setup

​OpenAI

​Anthropic (Claude)

​Groq

​LLaMA.cpp Server

​vLLM

​Custom Provider

​Model Management

​Auto-Detection

​Manual Model Entry

​Model Types

​Authentication

​API Keys

​Custom Headers

​Bearer Tokens

​Troubleshooting

​Connection Failed

​Models Not Loading

​Authentication Errors

​Rate Limiting

​Best Practices

​Provider Comparison

​Next Steps

Build docs developers (and LLMs) love

Supported Providers

Cloud Providers

Self-Hosted / Local

Adding a Provider

Provider-Specific Setup

OpenAI

Anthropic (Claude)

Groq

LLaMA.cpp Server

vLLM

Custom Provider

Model Management

Auto-Detection

Manual Model Entry

Model Types

Authentication

API Keys

Custom Headers

Bearer Tokens

Troubleshooting

Connection Failed

Models Not Loading

Authentication Errors

Rate Limiting

Best Practices

Provider Comparison

Next Steps