Firecrawl Projects
A comprehensive collection of Python automation tools built on top of Firecrawl CLI, designed for web scraping, competitive research, documentation extraction, and content monitoring.Overview
Firecrawl Projects provides five powerful command-line tools that leverage the Firecrawl CLI to automate common web research and monitoring tasks. Each tool is designed to be run independently or chained together for complex workflows.Deep Research
Search and synthesize information from multiple web sources
Competitor Analysis
Compare and analyze competitor websites automatically
Documentation Scraper
Extract entire documentation sites into local markdown
Lead Extractor
Find contact information from company websites
Content Monitor
Track website changes over time with diff detection
Prerequisites
Tools
1. Deep Research
Comprehensive topic research that searches the web and synthesizes findings.- Overview
- Usage
- Example
Searches the web for your topic, scrapes the top results, and generates a detailed markdown research report.
2. Competitor Analysis
Analyze and compare competitor websites automatically.- What It Does
- Usage
- Example
- Maps website structure of each competitor
- Categorizes pages (pricing, features, about, blog, docs)
- Scrapes key pages for content analysis
- Generates comparison report
3. Documentation Scraper
Extract and organize documentation from any website into local markdown files.- Features
- Usage
- Example
- Discovers all documentation pages
- Filters to relevant URLs
- Converts to clean markdown
- Creates organized index
4. Lead Extractor
Extract business contact information from company websites.- What It Extracts
- Usage
- Example
- Email addresses
- Phone numbers
- Social media links (LinkedIn, Twitter, Facebook)
- Company name and description
5. Content Monitor
Track changes on websites over time with automatic diff detection.- Features
- Commands
- Workflow Example
- Monitor pages for content changes
- Save snapshots over time
- Show diffs when changes detected
- Maintain complete history
Installation
Common Patterns
Chaining Tools
Combine tools for powerful workflows:Scheduled Monitoring
Windows Task Scheduler:- Open Task Scheduler
- Create Basic Task
- Set trigger (e.g., daily at 9 AM)
- Action: Start a program
- Program:
python - Arguments:
content_monitor.py check - Start in:
C:\Users\ulilj\Firecrawl Projects\tools
- Program:
Batch Lead Extraction
Output Formats
- Markdown
- JSON
- CSV
Firecrawl CLI Integration
All tools use the Firecrawl CLI with these key features:--only-main-content
Extracts only the main content, removing navigation and ads
--format markdown
Returns clean markdown instead of HTML
--wait-for
Waits for JavaScript to render before scraping
Rate Limiting
Respects API rate limits automatically
Example CLI Calls
Tips & Best Practices
Respect Rate Limits
Space out large scraping jobs. The CLI handles rate limiting but be considerate.
Troubleshooting
Firecrawl CLI not found
Firecrawl CLI not found
Use full path:Or add to PATH in Windows:
- System Properties → Environment Variables
- Edit PATH
- Add:
C:\Users\ulilj\Firecrawl Projects\firecrawl-cli\.venv\Scripts
API key errors
API key errors
Reconfigure:Verify:
No results returned
No results returned
- Check if the URL is accessible
- Try scraping manually first:
firecrawl scrape URL - Increase wait time for JS-heavy sites
- Verify API quota hasn’t been exceeded
Content Monitor not detecting changes
Content Monitor not detecting changes
- Ensure the page content actually changed
- Some sites use dynamic timestamps that always show as changed
- Check
.content_monitor/directory for stored snapshots
Use Cases
Market Research
Track competitor pricing, features, and messaging changes over time
Sales Intelligence
Extract contact information for lead generation and outreach
Documentation Archive
Backup documentation before major version updates
Content Strategy
Analyze competitor content and identify gaps in your own
SEO Research
Research what content ranks well for your target keywords
Due Diligence
Gather comprehensive information about companies for investment research
Get Started with Firecrawl
Sign up for Firecrawl API access to use these tools