Installation

System Requirements

Before installing, ensure your system meets these requirements:

Python Version

Python 3.10 or 3.11 (recommended)

RAM

Minimum 8GB RAM (16GB recommended for large candidate pools)

Storage

~2GB free space for dependencies and embedding models

Operating System

Windows, macOS, or Linux

Python 3.12+ is not yet officially supported by all dependencies. Stick with Python 3.10 or 3.11 for best compatibility.

Installation Methods

pip (Recommended)
Poetry
Docker

Option 1: Install with pip

The simplest installation method using Python’s package manager.

Clone the Repository

git clone https://github.com/YOUR_USERNAME/rag-recruitment-assistant.git
cd rag-recruitment-assistant

Create Virtual Environment

Linux/macOS:

python3 -m venv venv
source venv/bin/activate

Windows:

python -m venv venv
venv\Scripts\activate

Install Dependencies

pip install -r requirements.txt

This will install all required packages:

langchain - LLM application framework
langchain-community - Community integrations
langchain-google-genai - Gemini LLM integration
langchain-huggingface - HuggingFace embeddings
sentence-transformers - Neural embedding models
faiss-cpu - Vector similarity search
pypdf - PDF processing
reportlab - PDF generation
pandas==2.2.2 - Data manipulation
matplotlib - Static visualizations
plotly - Interactive visualizations

Option 2: Install with Poetry

For production environments or teams using dependency management.

Install Poetry

curl -sSL https://install.python-poetry.org | python3 -

Initialize Project

git clone https://github.com/YOUR_USERNAME/rag-recruitment-assistant.git
cd rag-recruitment-assistant
poetry install

Activate Environment

poetry shell

Option 3: Docker Container

For isolated, reproducible deployments.

Pull Image

docker pull your-registry/rag-recruitment:latest

Run Container

docker run -it \
  -e GOOGLE_API_KEY="your-api-key" \
  -v $(pwd)/data:/app/data \
  rag-recruitment:latest

Dependencies Breakdown

Here’s what gets installed from requirements.txt:

View requirements.txt

langchain
langchain-community
langchain-google-genai
langchain-huggingface
sentence-transformers
faiss-cpu
pypdf
reportlab
pandas==2.2.2
matplotlib
plotly

Core Dependencies

Package	Version	Purpose
`langchain`	latest	Framework for building LLM applications
`langchain-google-genai`	latest	Gemini 1.5 Flash integration
`langchain-huggingface`	latest	HuggingFace embeddings wrapper
`sentence-transformers`	latest	Pre-trained embedding models
`faiss-cpu`	latest	Vector similarity search engine
`pypdf`	latest	PDF document parsing
`pandas`	2.2.2	Structured data manipulation

GPU Acceleration: If you have a CUDA-compatible GPU, replace faiss-cpu with faiss-gpu for faster vector search on large candidate databases.

API Key Setup

The system requires a Google API key to use Gemini 1.5 Flash. Follow these steps to obtain and configure it:

Step 1: Get Your API Key

Visit Google AI Studio

Navigate to Google AI Studio

Create API Key

Click “Create API Key” and select a Google Cloud project (or create a new one)

Copy the Key

Copy the generated API key - you’ll use it in the next step

Google AI Studio offers a generous free tier with 60 queries per minute for Gemini 1.5 Flash. Perfect for testing and small-scale deployments.

Step 2: Configure Environment Variable

Set the GOOGLE_API_KEY environment variable in your system:

# Temporary (current session only)
export GOOGLE_API_KEY="your_api_key_here"

# Permanent (add to ~/.bashrc or ~/.zshrc)
echo 'export GOOGLE_API_KEY="your_api_key_here"' >> ~/.bashrc
source ~/.bashrc

Security Best Practices:

Never commit .env files to version control (add to .gitignore)
Use different API keys for development and production
Rotate keys periodically
Monitor API usage in Google Cloud Console

Verification Steps

After installation, verify everything is working correctly:

1. Check Python Version

python --version
# Expected: Python 3.10.x or 3.11.x

2. Verify Dependencies

pip list | grep -E "langchain|faiss|sentence-transformers"

Expected Output:

faiss-cpu                 1.7.4
langchain                 0.1.0
langchain-community       0.0.13
langchain-google-genai    0.0.5
langchain-huggingface     0.0.1
sentence-transformers     2.2.2

3. Test API Connection

Create a test script test_setup.py:

test_setup.py

import os
from langchain_google_genai import ChatGoogleGenerativeAI
from langchain_huggingface import HuggingFaceEmbeddings

# Test 1: API Key
print("[1/3] Checking API key...")
if not os.getenv("GOOGLE_API_KEY"):
    raise ValueError("❌ GOOGLE_API_KEY not found in environment variables")
print("✓ API key configured")

# Test 2: LLM Connection
print("\n[2/3] Testing Gemini connection...")
try:
    llm = ChatGoogleGenerativeAI(
        model="gemini-1.5-flash",
        temperature=0
    )
    response = llm.invoke("Hello, respond with just 'OK' if you can read this")
    print(f"✓ Gemini response: {response.content}")
except Exception as e:
    print(f"❌ Gemini connection failed: {e}")
    raise

# Test 3: Embeddings
print("\n[3/3] Testing embeddings model...")
try:
    embeddings = HuggingFaceEmbeddings()
    test_vec = embeddings.embed_query("test sentence")
    print(f"✓ Embeddings working (dimension: {len(test_vec)})")
except Exception as e:
    print(f"❌ Embeddings failed: {e}")
    raise

print("\n✅ All systems operational!")

Run the test:

python test_setup.py

Expected Output:

[1/3] Checking API key...
✓ API key configured

[2/3] Testing Gemini connection...
✓ Gemini response: OK

[3/3] Testing embeddings model...
✓ Embeddings working (dimension: 768)

✅ All systems operational!

4. Verify FAISS Installation

import faiss
import numpy as np

# Create a simple vector index
dimension = 128
index = faiss.IndexFlatL2(dimension)

# Add some random vectors
vectors = np.random.random((10, dimension)).astype('float32')
index.add(vectors)

print(f"✓ FAISS working - indexed {index.ntotal} vectors")

Troubleshooting

ImportError: No module named 'faiss'

Solution: Install the CPU version explicitly:

pip uninstall faiss faiss-cpu faiss-gpu
pip install faiss-cpu

If you have a GPU:

pip install faiss-gpu

SSL Certificate Errors with HuggingFace

Solution: Update certificates or disable SSL verification (not recommended for production):

pip install --upgrade certifi

Or set environment variable:

export CURL_CA_BUNDLE=""

Google API Key Not Recognized

Solution: Ensure the key is properly set and restart your terminal:

import os
print(os.getenv("GOOGLE_API_KEY"))  # Should print your key

If it returns None, the variable isn’t set. Re-run the export command and restart your Python session.

Out of Memory Errors

Solution: Reduce batch size or use a smaller embedding model:

embeddings = HuggingFaceEmbeddings(
    model_name="sentence-transformers/all-MiniLM-L6-v2"  # Smaller model
)

Pandas Version Conflicts

Solution: The project specifies pandas==2.2.2 for stability. If you encounter issues:

pip install --upgrade pandas==2.2.2

Optional: GPU Acceleration

For production deployments with large candidate databases (1000+ resumes), GPU acceleration significantly improves performance:

Install CUDA Toolkit

Download from NVIDIA CUDA Downloads

Install GPU Version of FAISS

pip uninstall faiss-cpu
pip install faiss-gpu

Verify GPU Detection

import faiss
print(f"GPU available: {faiss.get_num_gpus()}")

GPU acceleration provides 10-100x speedup for vector search on large datasets. However, the free Google Colab environment already includes GPU support, making it unnecessary for most users.

Next Steps

You’re all set! Here’s what to do next:

Run the Quickstart

Try the system with sample data in 5 minutes

Architecture Guide

Understand how components work together

Configuration

Customize models, prompts, and retrieval settings

API Reference

Explore available functions and classes

Installation Complete! You now have a working local environment for the RAG Recruitment Assistant.

Get Started

Core Concepts

Guides

System Requirements

Python Version

RAM

Storage

Operating System

Installation Methods

Option 1: Install with pip

Option 2: Install with Poetry

Option 3: Docker Container

Dependencies Breakdown

Core Dependencies

API Key Setup

Step 1: Get Your API Key

Step 2: Configure Environment Variable

Verification Steps

1. Check Python Version

2. Verify Dependencies

3. Test API Connection

4. Verify FAISS Installation

Troubleshooting

Optional: GPU Acceleration

Next Steps

Run the Quickstart

Architecture Guide

Configuration

API Reference

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

​System Requirements

Python Version

RAM

Storage

Operating System

​Installation Methods

​Option 1: Install with pip

​Option 2: Install with Poetry

​Option 3: Docker Container

​Dependencies Breakdown

​Core Dependencies

​API Key Setup

​Step 1: Get Your API Key

​Step 2: Configure Environment Variable

​Verification Steps

​1. Check Python Version

​2. Verify Dependencies

​3. Test API Connection

​4. Verify FAISS Installation

​Troubleshooting

​Optional: GPU Acceleration

​Next Steps

Run the Quickstart

Architecture Guide

Configuration

API Reference

Build docs developers (and LLMs) love

System Requirements

Installation Methods

Option 1: Install with pip

Option 2: Install with Poetry

Option 3: Docker Container

Dependencies Breakdown

Core Dependencies

API Key Setup

Step 1: Get Your API Key

Step 2: Configure Environment Variable

Verification Steps

1. Check Python Version

2. Verify Dependencies

3. Test API Connection

4. Verify FAISS Installation

Troubleshooting

Optional: GPU Acceleration

Next Steps