Overview
PentAGI integrates with multiple search engines to gather intelligence during penetration testing. While optional, enabling search engines significantly enhances PentAGI’s ability to research vulnerabilities, exploit techniques, and security tools.Search engine configuration is optional but recommended for better results. You can enable one or multiple search providers.
DuckDuckGo
DuckDuckGo provides privacy-focused web search without requiring an API key.Enable DuckDuckGo search engine integration
Features
- No API key required
- Privacy-focused search
- No rate limits
- Free to use
Example Configuration
.env
Google Custom Search
Google Custom Search provides comprehensive web search results with programmable access.Your Google API key from Google Cloud Console
Custom Search Engine ID from Programmable Search Engine
Language restriction code (optional)
Setup Instructions
- Create a Google Cloud Project at console.cloud.google.com
- Enable Custom Search API in your project
- Create API credentials (API Key)
- Create a Custom Search Engine at programmablesearchengine.google.com
- Get your Search Engine ID (CX key)
Example Configuration
.env
Tavily
Tavily provides AI-optimized search specifically designed for LLM applications.Your Tavily API key from tavily.com
Features
- Optimized for AI/LLM use cases
- Clean, structured results
- Fast response times
- Focused on developer documentation and technical content
Setup Instructions
- Visit tavily.com
- Sign up for an account
- Generate an API key from your dashboard
- Add the key to your
.envfile
Example Configuration
.env
Traversaal
Traversaal provides advanced web search with AI-powered result ranking.Your Traversaal API key from traversaal.ai
Features
- AI-powered result ranking
- Developer-focused search
- Technical documentation prioritization
- Clean JSON responses
Setup Instructions
- Visit traversaal.ai
- Create an account
- Generate an API key
- Add the key to your
.envfile
Example Configuration
.env
Perplexity
Perplexity combines search with AI-powered answer generation for comprehensive results.Your Perplexity API key from perplexity.ai
Perplexity model to use for searchAvailable models:
sonar- Standard modelsonar-pro- Enhanced model with better accuracy
Context size for search resultsAvailable sizes:
low- Minimal context, faster responsesmedium- Balanced context and speedhigh- Maximum context, slower responses
Features
- AI-powered answer synthesis
- Multiple model options
- Configurable context size
- Real-time web search combined with AI analysis
Setup Instructions
- Visit perplexity.ai
- Sign up for API access
- Generate an API key from your account settings
- Add the key to your
.envfile
Example Configuration
.env
Searxng
Searxng is a privacy-respecting metasearch engine that aggregates results from multiple sources.URL of your Searxng instance
Search categories to query (comma-separated)Available categories:
general- General web searchit- IT and technologyscience- Scientific contentnews- News articlessocial media- Social media content
Language code for search results
Safe search levelValues:
0- Disabled1- Moderate2- Strict
Time range for search resultsAvailable ranges:
day- Last 24 hoursweek- Last weekmonth- Last monthyear- Last year
Features
- Privacy-respecting metasearch
- Aggregates results from 70+ search engines
- Self-hosted option for complete control
- No tracking or profiling
- Highly configurable
Setup Instructions
Option 1: Use Public Instance- Find a public Searxng instance at searx.space
- Add the instance URL to your
.envfile
- Follow the Searxng installation guide
- Deploy using Docker:
- Add your instance URL to
.envfile
Example Configuration
.env
Complete Example Configuration
Here’s a complete example with all search engines configured:.env
Search Engine Selection Strategy
PentAGI automatically selects the most appropriate search engine based on:- Availability: Only configured search engines are used
- Query Type: Different engines excel at different query types
- Rate Limits: Automatically rotates between engines if rate limited
- Response Quality: Prioritizes engines with better historical results
Recommended Configurations
Minimal Setup:Proxy Configuration
All search engines support routing through a proxy:Global HTTP proxy URL for all search engines and external systems
The proxy is particularly useful for:
- Network isolation in secure environments
- Routing through VPNs
- Corporate proxy requirements
- Testing from specific geographic locations
Next Steps
LLM Providers
Configure OpenAI, Anthropic, Gemini, and more
Security Settings
Configure SSL, authentication, and secrets
Observability
Set up monitoring and analytics