Setting up a knowledge base

Prerequisites

Before creating a knowledge base, ensure you have:

A configured embedding provider integration (see Embedding providers)
Access to the Iqra AI platform or self-hosted instance
Documents to upload (PDF, TXT, or other supported formats)

Self-hosted deployments require Milvus, MongoDB, and Redis to be properly configured. See the deployment guide for infrastructure setup.

Creating a knowledge base

Navigate to knowledge bases

From your business dashboard, access the knowledge base management section.

Create new knowledge base

Click Create Knowledge Base and provide:

Name: A descriptive name for the knowledge base
Description: Optional description of the content and purpose

Configure chunking strategy

Choose how documents will be split into chunks:

General chunking

Best for uniformly structured content like articles or documentation.

{
  Type: "General",
  Delimiter: "\\n\\n",      // Split on double newlines
  MaxLength: 1024,         // Maximum chunk size in characters
  Overlap: 50,             // Character overlap between chunks
  Preprocess: {
    ReplaceConsecutive: true,  // Normalize whitespace
    DeleteUrls: false          // Keep URLs in text
  }
}

Start with 1024 character chunks and 50 character overlap. Adjust based on your content structure and retrieval quality.

Parent-child chunking

Better for complex documents where context is crucial.

{
  Type: "ParentChild",
  Parent: {
    Type: "Paragraph",      // or use custom delimiter
    Delimiter: null,        // null for paragraph-based
    MaxLength: null
  },
  Child: {
    Delimiter: "\\n",       // Split on single newline
    MaxLength: 512          // Smaller child chunks
  },
  Preprocess: {
    ReplaceConsecutive: true,
    DeleteUrls: false
  }
}

Parent-child chunking retrieves child chunks but provides parent context to the agent, improving answer quality while maintaining retrieval precision.

Configure embedding

Select your embedding integration and model:

Integration: Choose from configured embedding providers
Model: Select the embedding model (e.g., text-embedding-004)
Vector dimension: Set based on model specifications

Changing the embedding model after documents are indexed requires re-indexing all content. Choose carefully.

Configure retrieval strategy

Select how the knowledge base will retrieve relevant chunks:

Vector search

{
  Type: "VectorSearch",
  TopK: 3,                    // Number of chunks to retrieve
  UseScoreThreshold: true,
  ScoreThreshold: 0.7,        // Minimum similarity score (0-1)
  Rerank: {
    Enabled: true,
    Integration: "rerank-integration-id"
  }
}

Full-text search

{
  Type: "FullTextSearch",
  TopK: 3,
  Rerank: {
    Enabled: false,
    Integration: null
  }
}

Hybrid search

{
  Type: "HybridSearch",
  Mode: "WeightedScore",     // or "Rerank"
  Weight: 0.7,                // Vector weight (0-1)
  TopK: 3,
  UseScoreThreshold: true,
  ScoreThreshold: 0.6,
  RerankIntegration: null
}

Hybrid search with 70% vector weight typically provides the best balance between semantic understanding and exact keyword matching.

Save configuration

Review your settings and click Create to initialize the knowledge base.

Uploading documents

Once your knowledge base is created:

Add documents

Click Upload Documents and select files from your computer. Supported formats include:

PDF (via PDF extractor or Unstructured API)
Plain text (.txt)
Other formats supported by Unstructured API

Processing

Documents are processed asynchronously:

Text extraction
Cleaning and preprocessing
Chunking based on your strategy
Embedding generation
Storage in Milvus and MongoDB
Keyword index creation

Large documents may take several minutes to process.

Verify indexing

Once processing completes, verify:

Document status shows as Indexed
Chunk count matches expectations
No error messages in the document details

The system generates embeddings in batches to optimize API usage. Embedding costs are determined by your provider’s pricing model.

Linking to agents

To enable an agent to use the knowledge base:

Open agent configuration

Navigate to your agent’s settings in the Script Builder.

Add knowledge base link

In the Knowledge Base section:

Click Link Knowledge Base
Select the knowledge base from the dropdown
Configure search strategy (if different from defaults)

Configure search strategy

Optionally override retrieval settings for this agent:

TopK: Number of chunks to retrieve per query
Search refinement: Additional filtering or boosting

Test retrieval

Use the testing interface to verify:

Queries return relevant chunks
Context quality meets expectations
Response latency is acceptable

Managing documents

Updating documents

When updating an existing document:

Upload the new version with the same filename
The system will:
- Delete old chunks from Milvus and keyword store
- Reprocess the new content
- Generate fresh embeddings
- Update vector and keyword indices

Updating documents with many chunks can be resource-intensive. Consider scheduling updates during low-traffic periods.

Deleting documents

To remove a document:

Select the document in the knowledge base
Click Delete
Confirm deletion

The system will:

Remove all chunks from Milvus
Delete keyword index entries
Clean up metadata in MongoDB

Disabling documents

Temporarily disable documents without deleting:

Toggle the document’s Enabled status
Disabled chunks are excluded from retrieval
Re-enable anytime to restore access

Performance optimization

Chunk size tuning

Optimal chunk size depends on your use case:

Small chunks (256-512 chars): Better precision, may lack context
Medium chunks (512-1024 chars): Balanced approach, recommended default
Large chunks (1024-2048 chars): More context, may dilute relevance

Use parent-child chunking if you need both precision and context without compromising either.

Overlap configuration

Chunk overlap prevents information loss at boundaries:

No overlap (0): Faster processing, risk of split concepts
Low overlap (20-50 chars): Minimal redundancy, good for most content
High overlap (100-200 chars): Maximum continuity, increased storage

Collection management

Milvus collections are automatically managed:

Loaded into memory when agents query them
Released after configurable idle period (default: 1 hour)
Reloaded on-demand with minimal latency

Frequently accessed knowledge bases remain in memory, while rarely used ones are unloaded to conserve resources.

Monitoring and maintenance

Health checks

Regularly verify:

All documents show Indexed status
No failed embedding generation jobs
Milvus collections are accessible
Embedding cache hit rate is reasonable

Updating embeddings

If you change embedding providers:

Update the knowledge base configuration
Trigger full re-indexing of all documents
Monitor progress in the processing queue
Verify retrieval quality with test queries

Re-indexing generates new embeddings for all chunks and may incur significant API costs. Test with a subset first.

Getting Started

Core Concepts

Building Agents

Integrations

Knowledge Base & RAG

Deployment

Channels

Setting up a knowledge base

Prerequisites

Creating a knowledge base

General chunking

Parent-child chunking

Vector search

Full-text search

Hybrid search

Uploading documents

Linking to agents

Managing documents

Updating documents

Deleting documents

Disabling documents

Performance optimization

Chunk size tuning

Overlap configuration

Collection management

Monitoring and maintenance

Health checks

Updating embeddings

Next steps

Embedding providers

Retrieval strategies

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Building Agents

Integrations

Knowledge Base & RAG

Deployment

Channels

​Prerequisites

​Creating a knowledge base

​General chunking

​Parent-child chunking

​Vector search

​Full-text search

​Hybrid search

​Uploading documents

​Linking to agents

​Managing documents

​Updating documents

​Deleting documents

​Disabling documents

​Performance optimization

​Chunk size tuning

​Overlap configuration

​Collection management

​Monitoring and maintenance

​Health checks

​Updating embeddings

​Next steps

Embedding providers

Retrieval strategies

Build docs developers (and LLMs) love

Prerequisites

Creating a knowledge base

General chunking

Parent-child chunking

Vector search

Full-text search

Hybrid search

Uploading documents

Linking to agents

Managing documents

Updating documents

Deleting documents

Disabling documents

Performance optimization

Chunk size tuning

Overlap configuration

Collection management

Monitoring and maintenance

Health checks

Updating embeddings

Next steps