System Requirements
Before installing, ensure your system meets these requirements:Python Version
Python 3.10 or 3.11 (recommended)
RAM
Minimum 8GB RAM (16GB recommended for large candidate pools)
Storage
~2GB free space for dependencies and embedding models
Operating System
Windows, macOS, or Linux
Installation Methods
- pip (Recommended)
- Poetry
- Docker
Option 1: Install with pip
The simplest installation method using Python’s package manager.Install Dependencies
langchain- LLM application frameworklangchain-community- Community integrationslangchain-google-genai- Gemini LLM integrationlangchain-huggingface- HuggingFace embeddingssentence-transformers- Neural embedding modelsfaiss-cpu- Vector similarity searchpypdf- PDF processingreportlab- PDF generationpandas==2.2.2- Data manipulationmatplotlib- Static visualizationsplotly- Interactive visualizations
Dependencies Breakdown
Here’s what gets installed fromrequirements.txt:
View requirements.txt
View requirements.txt
Core Dependencies
| Package | Version | Purpose |
|---|---|---|
langchain | latest | Framework for building LLM applications |
langchain-google-genai | latest | Gemini 1.5 Flash integration |
langchain-huggingface | latest | HuggingFace embeddings wrapper |
sentence-transformers | latest | Pre-trained embedding models |
faiss-cpu | latest | Vector similarity search engine |
pypdf | latest | PDF document parsing |
pandas | 2.2.2 | Structured data manipulation |
GPU Acceleration: If you have a CUDA-compatible GPU, replace
faiss-cpu with faiss-gpu for faster vector search on large candidate databases.API Key Setup
The system requires a Google API key to use Gemini 1.5 Flash. Follow these steps to obtain and configure it:Step 1: Get Your API Key
Visit Google AI Studio
Navigate to Google AI Studio
Google AI Studio offers a generous free tier with 60 queries per minute for Gemini 1.5 Flash. Perfect for testing and small-scale deployments.
Step 2: Configure Environment Variable
Set theGOOGLE_API_KEY environment variable in your system:
Verification Steps
After installation, verify everything is working correctly:1. Check Python Version
2. Verify Dependencies
3. Test API Connection
Create a test scripttest_setup.py:
test_setup.py
4. Verify FAISS Installation
Troubleshooting
ImportError: No module named 'faiss'
ImportError: No module named 'faiss'
Solution: Install the CPU version explicitly:If you have a GPU:
SSL Certificate Errors with HuggingFace
SSL Certificate Errors with HuggingFace
Solution: Update certificates or disable SSL verification (not recommended for production):Or set environment variable:
Google API Key Not Recognized
Google API Key Not Recognized
Solution: Ensure the key is properly set and restart your terminal:If it returns
None, the variable isn’t set. Re-run the export command and restart your Python session.Out of Memory Errors
Out of Memory Errors
Solution: Reduce batch size or use a smaller embedding model:
Pandas Version Conflicts
Pandas Version Conflicts
Solution: The project specifies
pandas==2.2.2 for stability. If you encounter issues:Optional: GPU Acceleration
For production deployments with large candidate databases (1000+ resumes), GPU acceleration significantly improves performance:Install CUDA Toolkit
Download from NVIDIA CUDA Downloads
GPU acceleration provides 10-100x speedup for vector search on large datasets. However, the free Google Colab environment already includes GPU support, making it unnecessary for most users.
Next Steps
You’re all set! Here’s what to do next:Run the Quickstart
Try the system with sample data in 5 minutes
Architecture Guide
Understand how components work together
Configuration
Customize models, prompts, and retrieval settings
API Reference
Explore available functions and classes
Installation Complete! You now have a working local environment for the RAG Recruitment Assistant.