Configuration

DeenPAL requires configuration before use, including API keys for the language model and optional customization of retrieval parameters.

Environment Variables Setup

Create .env File

Create a .env file in the project root directory:

touch .env

Add API Key

Open the .env file and add your OpenRouter API key:

.env

OPENAI_API_KEY="your_openrouter_api_key_here"

DeenPAL uses OpenRouter to access the DeepSeek model. The ChatOpenAI class from LangChain expects the OPENAI_API_KEY environment variable.

Keep your .env file secure and never commit it to version control. Add it to .gitignore.

API Key Configuration

Using OpenRouter API

DeenPAL is configured to use DeepSeek via the OpenRouter API by default:

chains.py

llm = ChatOpenAI(
    model="deepseek/deepseek-chat-v3-0324:free",
    base_url="https://openrouter.ai/api/v1"
)

To use this configuration:

Sign up at OpenRouter
Get your API key from the dashboard
Add it to your .env file as OPENAI_API_KEY

The deepseek-chat-v3-0324:free model is used, which provides free access through OpenRouter.

Alternative LLM Configurations

You can modify chains.py to use different LLM providers:

from langchain_openai import ChatOpenAI

llm = ChatOpenAI(
    model="gpt-4",
    api_key=os.getenv("OPENAI_API_KEY")
)

When changing LLM providers, ensure you update both the import statement and the environment variable name in your .env file.

Retriever Parameters

The retrieval system can be customized in chains.py to adjust how Hadiths are retrieved:

Current Configuration

chains.py

retriever = db.as_retriever(
    search_type="mmr",  # Use Maximal Marginal Relevance
    search_kwargs={"k": 4, "fetch_k": 10}
)

Parameter Explanations

Search Type: MMR (Maximal Marginal Relevance)

Why MMR?MMR provides diverse, relevant results instead of similar redundant ones. The project author tested both similarity-based and MMR retrieval, finding that MMR:

Increases diversity in retrieved Hadiths
Maintains relevance to the query
Helps the LLM provide different but related Hadiths
Avoids redundant responses

k Parameter (Number of Results)

Default: 4
Purpose: Number of diverse results returned to the LLM
Range: 1-10 recommended

Adjusting k:

search_kwargs={"k": 6, "fetch_k": 10}  # Return 6 diverse results

Lower k (1-3): Faster responses, more focused answers
Higher k (5-10): More comprehensive answers, broader context

fetch_k Parameter (Candidate Pool)

Default: 10
Purpose: Number of candidates fetched before MMR selection
Range: Should be ≥ k

Adjusting fetch_k:

search_kwargs={"k": 4, "fetch_k": 20}  # Fetch 20, select 4 diverse

Lower fetch_k: Faster retrieval, less diversity
Higher fetch_k: More diversity options, slower retrieval

Alternative Search Types

You can also use similarity-based search:

retriever = db.as_retriever(
    search_type="similarity",
    search_kwargs={"k": 4}
)

The project author found that similarity_score_threshold caused the chatbot to give redundant Hadiths. MMR is recommended for better results.

Custom Prompts

To customize how the chatbot responds, modify the prompts in prompts.py:

Default QA System Prompt

prompts.py

qa_system_prompt = (
    "You are an Islamic religious assistant for accurately retrieving hadiths..."
    "{context}"
)

Customizing the Prompt

Create or modify prompts.py to adjust the chatbot’s behavior:

prompts.py

from langchain_core.prompts import ChatPromptTemplate

# Custom system prompt
qa_system_prompt = (
    "You are a knowledgeable Islamic scholar assistant. "
    "Use the following Hadiths to answer questions accurately and respectfully.\n\n"
    "Hadiths:\n{context}\n\n"
    "Instructions:\n"
    "1. Cite each Hadith with its source and number\n"
    "2. Provide clear explanations\n"
    "3. Be respectful and humble in tone\n"
    "4. If unsure, acknowledge uncertainty"
)

# Create prompt template
qa_prompt = ChatPromptTemplate.from_messages(
    [
        ("system", qa_system_prompt),
        ("human", "{input}"),
    ]
)

The {context} placeholder is automatically filled with retrieved Hadiths, and {input} contains the user’s question.

Configuration Best Practices

Security

Never hardcode API keys in source files
Use .env for all sensitive configuration
Add .env to .gitignore
Rotate API keys periodically

Performance Tuning

For faster responses:

search_kwargs={"k": 2, "fetch_k": 5}

For more comprehensive answers:

search_kwargs={"k": 6, "fetch_k": 15}

Balanced (default):

search_kwargs={"k": 4, "fetch_k": 10}

Verifying Configuration

Test your configuration by running a simple query:

streamlit run app.py

If the app starts without errors and responds to queries, your configuration is correct.

Common Configuration Issues

API Key Not Found:

Error: DEEPSEEK_API_KEY not found

Solution: Ensure .env exists and contains the correct key. Invalid API Key:

Error: 401 Unauthorized

Solution: Verify your API key is correct and active. Model Not Found:

Error: Model not found

Solution: Check the model name matches your provider’s available models.

Next Steps

With configuration complete, you’re ready to:

Run the chatbot
Test queries and evaluate responses
Fine-tune retrieval parameters based on results

Get Started

Core Concepts

Guides

Components

Environment Variables Setup

API Key Configuration

Using OpenRouter API

Alternative LLM Configurations

Retriever Parameters

Current Configuration

Parameter Explanations

Search Type: MMR (Maximal Marginal Relevance)

k Parameter (Number of Results)

fetch_k Parameter (Candidate Pool)

Alternative Search Types

Custom Prompts

Default QA System Prompt

Customizing the Prompt

Configuration Best Practices

Security

Performance Tuning

Verifying Configuration

Common Configuration Issues

Next Steps

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Components

​Environment Variables Setup

​API Key Configuration

​Using OpenRouter API

​Alternative LLM Configurations

​Retriever Parameters

​Current Configuration

​Parameter Explanations

​Search Type: MMR (Maximal Marginal Relevance)

​k Parameter (Number of Results)

​fetch_k Parameter (Candidate Pool)

​Alternative Search Types

​Custom Prompts

​Default QA System Prompt

​Customizing the Prompt

​Configuration Best Practices

​Security

​Performance Tuning

​Verifying Configuration

​Common Configuration Issues

​Next Steps

Build docs developers (and LLMs) love

Environment Variables Setup

API Key Configuration

Using OpenRouter API

Alternative LLM Configurations

Retriever Parameters

Current Configuration

Parameter Explanations

Search Type: MMR (Maximal Marginal Relevance)

k Parameter (Number of Results)

fetch_k Parameter (Candidate Pool)

Alternative Search Types

Custom Prompts

Default QA System Prompt

Customizing the Prompt

Configuration Best Practices

Security

Performance Tuning

Verifying Configuration

Common Configuration Issues

Next Steps