Common Issues
Installation Problems
Playwright Not Installed
Error:Missing Dependencies
Error:Graph Execution Errors
Empty Results
Problem: Graph executes but returns empty or null results.Node Execution Failures
Error:LLM Issues
API Key Errors
Error:Rate Limits
Error:Token Limit Exceeded
Error:Reduce Input Size
Reduce Input Size
Scraping Issues
Timeout Errors
Error:JavaScript-Heavy Sites
Problem: Content not loading because JavaScript isn’t executed. Solution:Anti-Scraping Measures
Problem: Website blocks or detects the scraper. Solution:Custom Node Issues
Input Expression Errors
Error:Output Not Updating State
Problem: Node executes but state doesn’t contain expected keys. Solution:Debugging Techniques
Enable Logging
Inspect State at Each Node
Test Nodes in Isolation
Use Try-Except Blocks
Performance Optimization
Slow Execution
Optimization Strategies
Optimization Strategies
1. Use Faster Models2. Reduce Chunk Size3. Skip Unnecessary Nodes4. Parallelize Multiple Scrapes
Memory Issues
Problem: High memory usage with large documents. Solution:Getting Help
Before Asking for Help
Check Documentation
Review the official documentation and examples.
Search Issues
Check GitHub Issues for similar problems.
Community Resources
- Discord: Join the community
- GitHub Discussions: Ask questions
- GitHub Issues: Report bugs
- Documentation: docs.scrapegraphai.com
FAQ
Why is my scraper returning null or empty results?
Why is my scraper returning null or empty results?
Check that:
- Your prompt is clear and specific
- The URL is accessible and contains the expected content
- JavaScript has time to load (increase timeout)
- You’re using
verbose: Trueto see what’s happening
How do I scrape JavaScript-heavy websites?
How do I scrape JavaScript-heavy websites?
Use the FetchNode with appropriate wait conditions:
Can I use local LLMs instead of OpenAI?
Can I use local LLMs instead of OpenAI?
Yes! Use Ollama or other local models:
How do I handle CAPTCHAs?
How do I handle CAPTCHAs?
CAPTCHAs typically require manual solving. Consider:
- Using authenticated sessions (cookies)
- Using the
storage_stateoption to persist auth - Third-party CAPTCHA solving services
- Checking if the site offers an API
My graph is slow. How can I speed it up?
My graph is slow. How can I speed it up?
- Use faster models (gpt-3.5-turbo vs gpt-4)
- Reduce chunk sizes
- Remove unnecessary nodes
- Parallelize multiple scrapes
- Use caching for repeated scrapes
Next Steps
- Review custom graphs documentation
- Learn about integrations
- Understand telemetry and privacy
