AI Suggestions

Overview

AI Suggestions provide inline completions based on your current context - whether you’re typing a message, writing code, or drafting an email. The system learns from your writing style and remembers your preferences to deliver increasingly personalized suggestions.

Suggestions are powered by a persistent memory system that remembers facts about you, your projects, and your writing style across sessions.

Triggering Suggestions

Shortcut	Action
`Ctrl+Space`	Get AI suggestion for current context

Two modes are available:

Hotkey Mode
Auto Mode

Manual trigger - Press Ctrl+Space when you want a suggestion.

Full control over when suggestions appear
Lower resource usage
Best for focused writing sessions

How It Works

The suggestion system follows a three-step workflow:

1. Memory Search (Mandatory)

Before generating any suggestion, the system always searches your memory to personalize the output:

const getSystemPrompt = (userId: string) => `
You are a SENTENCE COMPLETION engine with PERSISTENT MEMORY.

## MANDATORY WORKFLOW

### STEP 1: ALWAYS SEARCH MEMORY FIRST (REQUIRED)
Before generating ANY text, you MUST call searchMemory.
This is NOT optional.

Call: searchMemory({
  query: "<extract keywords from user input>",
  userId: "${userId}",
  limit: 5
})

Examples:
- User types "My name is" → searchMemory({ query: "name", userId })
- User types "I'm working on" → searchMemory({ query: "project work", userId })
- User types "My favorite" → searchMemory({ query: "favorite preferences", userId })
`;

2. Generate Completion

Using the retrieved memories, the AI completes your sentence naturally:

Includes your original text plus continuation
Uses memory context for personalization (names, preferences, projects)
1-3 sentences maximum
No quotes, no questions - just natural completions

3. Store New Facts

If your input reveals new information, the system stores it for future suggestions:

// User types: "My name is Avinash and I"

// Step 1: Search memory
await searchMemory({ query: "name Avinash", userId, limit: 5 });

// Step 2: Generate completion
// Output: "My name is Avinash and I am a passionate developer
//          working on innovative projects."

// Step 3: Store the fact
await addMemory({
  messages: [{ role: "user", content: "My name is Avinash" }],
  userId
});

Memory-Enhanced Personalization

The suggestion engine remembers:

Personal Details

Your name, role, and profession
Location and time zone
Contact information

Writing Style

Tone preferences (formal/casual)
Verbosity (concise/detailed)
Common phrases and terminology

Projects & Tools

Technologies you work with
Project names and details
Frameworks and libraries

Preferences

Email signatures
Greetings and sign-offs
Domain-specific knowledge

API Implementation

The suggest endpoint uses streaming responses with tool calls:

export async function POST(req: Request) {
  const { messages, model, userId } = await req.json();
  
  const modelMessages = await convertToModelMessages(messages);

  const stream = createUIMessageStream({
    generateId: generateUUID,
    execute: async ({ writer: dataStream }) => {
      const result = streamText({
        model: myProvider.languageModel(model || defaultFastModel),
        system: getSystemPrompt(userId),
        messages: modelMessages,
        tools: {
          addMemory: addMemoryTool,
          searchMemory: searchMemoryTool,
          getAllMemories: getAllMemoriesTool,
        },
        stopWhen: stepCountIs(6),
      });

      dataStream.merge(result.toUIMessageStream({
        sendReasoning: true,
      }));
    },
  });

  return new Response(stream.pipeThrough(new JsonToSseTransformStream()));
}

Memory Tools

Three tools power the memory system:

`searchMemory`

Retrieves relevant facts based on a query:

searchMemory({
  query: "project work",
  userId: "user-123",
  limit: 5
})
// Returns: ["Working on Tabby AI Keyboard", "Uses Next.js and Electron", ...]

`addMemory`

Stores new facts from user input:

addMemory({
  messages: [{ role: "user", content: "I prefer TypeScript over JavaScript" }],
  userId: "user-123"
})
// Stores: User preference for TypeScript

`getAllMemories`

Retrieves all stored memories for a user:

getAllMemories({ userId: "user-123" })
// Returns complete memory graph

Examples

Example 1: Name Completion

Input: "My name is Avinash and I" Process:

Search memory for “name Avinash”
Generate: “My name is Avinash and I am a passionate developer working on innovative projects.”
Store the name fact

Example 2: Project Context

Input: "The project I'm building uses" Process:

Search memory for “project building technology”
Find: User is working on Tabby with Next.js and Electron
Generate: “The project I’m building uses Next.js and Electron for a seamless cross-platform experience.”

Example 3: Email Signature

Input: "Best regards," Process:

Search memory for “email signature name”
Find: User’s name is “Avinash” and role is “Software Engineer”
Generate: “Best regards,\nAvinash\nSoftware Engineer”

Absolute Rules

The suggestion engine follows strict guidelines:

Always call searchMemory before any text output

Always call addMemory when new personal facts are shared

Never skip the memory search step - this breaks personalization

Never ask questions - you’re a completion engine, not a chatbot

Never respond without checking memory first

Performance Optimization

The system uses several optimizations:

Fast models - Uses defaultFastModel for low-latency responses
Limited steps - stopWhen: stepCountIs(6) prevents excessive tool calls
Streaming - Server-sent events (SSE) for immediate feedback
Caching - Memory results are cached during the request

The more you use suggestions, the smarter they become. The memory system continuously learns from your writing patterns.

Privacy & Storage

All memories are stored locally by default:

Supabase vector store - Semantic search over memories
Neo4j knowledge graph (optional) - Visual memory relationships
User-scoped - Memories are isolated per user
Encrypted - Stored with proper authentication

Memories can be viewed and managed from the Brain Panel (Ctrl+Shift+B).

Interview Copilot

AI Assistance

Memory & Brain

Voice Features

Automation

Overview

Triggering Suggestions

How It Works

1. Memory Search (Mandatory)

2. Generate Completion

3. Store New Facts

Memory-Enhanced Personalization

Personal Details

Writing Style

Projects & Tools

Preferences

API Implementation

Memory Tools

`searchMemory`

`addMemory`

`getAllMemories`

Examples

Example 1: Name Completion

Example 2: Project Context

Example 3: Email Signature

Absolute Rules

Performance Optimization

Privacy & Storage

See Also

Build docs developers (and LLMs) love

Interview Copilot

AI Assistance

Memory & Brain

Voice Features

Automation

​Overview

​Triggering Suggestions

​How It Works

​1. Memory Search (Mandatory)

​2. Generate Completion

​3. Store New Facts

​Memory-Enhanced Personalization

Personal Details

Writing Style

Projects & Tools

Preferences

​API Implementation

​Memory Tools

​searchMemory

​addMemory

​getAllMemories

​Examples

​Example 1: Name Completion

​Example 2: Project Context

​Example 3: Email Signature

​Absolute Rules

​Performance Optimization

​Privacy & Storage

​See Also

Build docs developers (and LLMs) love

Overview

Triggering Suggestions

How It Works

1. Memory Search (Mandatory)

2. Generate Completion

3. Store New Facts

Memory-Enhanced Personalization

API Implementation

Memory Tools

`searchMemory`

`addMemory`

`getAllMemories`

Examples

Example 1: Name Completion

Example 2: Project Context

Example 3: Email Signature

Absolute Rules

Performance Optimization

Privacy & Storage

See Also