Answer Engine Overview

Orama’s Answer Engine enables you to create ChatGPT, Perplexity, or SearchGPT-like experiences using Retrieval Augmented Generation (RAG). Combine powerful full-text search with AI-powered chat capabilities to build intelligent conversational interfaces.

What is RAG?

Retrieval Augmented Generation (RAG) is a technique that enhances large language models by providing them with relevant context from your data. Instead of relying solely on the model’s training data, RAG:

Searches your database for relevant documents
Retrieves the most relevant results as context
Generates responses based on both the query and the retrieved context

This approach dramatically improves accuracy, reduces hallucinations, and keeps responses grounded in your actual data.

Key Features

Streaming Responses

Get real-time, token-by-token responses for a smooth user experience

Source Attribution

Track which documents were used to generate each response

Session Management

Maintain conversation context across multiple interactions

Custom Prompts

Customize system prompts to control AI behavior

Quick Start

Here’s a minimal example to get you started:

import { create, insert } from '@orama/orama'
import { pluginSecureProxy } from '@orama/plugin-secure-proxy'
import { AnswerSession } from '@orama/orama'

// Step 1: Configure the Secure Proxy Plugin
const secureProxy = await pluginSecureProxy({
  apiKey: 'my-api-key',
  defaultProperty: 'embeddings',
  models: {
    // The chat model to use to generate the chat answer
    chat: 'openai/gpt-4o-mini'
  }
})

// Step 2: Create your database with the plugin
const db = await create({
  schema: {
    name: 'string',
    description: 'string'
  },
  plugins: [secureProxy]
})

// Step 3: Insert your data
await insert(db, { name: 'John Doe', description: 'Software engineer' })
await insert(db, { name: 'Jane Doe', description: 'Product manager' })

// Step 4: Create an Answer Session
const session = new AnswerSession(db, {
  systemPrompt: 'You are a helpful assistant that provides information about people.',
  events: {
    onStateChange: (state) => {
      console.log('State updated:', state)
    }
  }
})

// Step 5: Ask questions
const response = await session.ask({
  term: 'john'
})

console.log(response) // AI-generated response about John Doe

How It Works

When you call session.ask(), Orama performs the following steps:

Search Phase

Orama searches your database using the provided query term and returns the most relevant documents.

Context Building

The retrieved documents are formatted and added to the conversation context.

AI Generation

The query and context are sent to the configured LLM (via Secure Proxy) to generate a response.

Streaming Response

The response is streamed back token-by-token, with state updates triggered for each chunk.

Secure Proxy Plugin

The Answer Engine requires the Secure Proxy Plugin to securely communicate with AI providers like OpenAI. This plugin:

Keeps your API keys secure on the server side
Handles authentication and rate limiting
Supports multiple AI providers and models
Is completely free to use

The Secure Proxy Plugin is essential for Answer Engine to work. Make sure to configure it with a valid API key and chat model.

Available Chat Models

When configuring the Secure Proxy Plugin, you can choose from various chat models:

openai/gpt-4o - Most capable OpenAI model
openai/gpt-4o-mini - Fast and cost-effective
openai/gpt-4-turbo - Balanced performance
openai/gpt-3.5-turbo - Budget-friendly option

Refer to the Secure Proxy documentation for a complete list of supported models.

Getting Started

Core Concepts

Search

Answer Engine (RAG)

Advanced Features

Text Analysis

Plugins

Framework Integrations

Guides

Answer Engine Overview

Answer Engine Overview

What is RAG?

Key Features

Streaming Responses

Source Attribution

Session Management

Custom Prompts

Quick Start

How It Works

Secure Proxy Plugin

Available Chat Models

Next Steps

Configure Sessions

Build Chat UIs

Additional Resources

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Search

Answer Engine (RAG)

Advanced Features

Text Analysis

Plugins

Framework Integrations

Guides

​Answer Engine Overview

​What is RAG?

​Key Features

Streaming Responses

Source Attribution

Session Management

Custom Prompts

​Quick Start

​How It Works

​Secure Proxy Plugin

​Available Chat Models

​Next Steps

Configure Sessions

Build Chat UIs

​Additional Resources

Build docs developers (and LLMs) love

Answer Engine Overview

What is RAG?

Key Features

Quick Start

How It Works

Secure Proxy Plugin

Available Chat Models

Next Steps

Additional Resources