Overview
Universal Novel Scraper includes a sophisticated Cloudflare bypass mechanism that uses Electron’s built-in Chromium browser to act like a real user. When enabled, the scraper can detect and wait for manual solving of Cloudflare challenges.How It Works
The Cloudflare bypass system operates in several stages:Detection Phase
When the scraper loads a chapter page, it automatically checks for Cloudflare indicators:The system checks for:
- Page title keywords (“just a moment”, “cloudflare”, etc.)
- Cloudflare DOM elements (challenge widgets, turnstile)
- URLs containing “cloudflare”
Browser Window Display
When Cloudflare is detected and bypass is enabled, the scraper:
- Shows the hidden browser window
- Brings it to focus
- Displays a status message: “🛡️ Manual solve required”
When to Enable Cloudflare Bypass
✅ You SHOULD enable it when:
✅ You SHOULD enable it when:
- You see “Just a moment…” loading pages that never finish
- The site shows “Verify you are human” challenges
- Chapters fail to scrape with timeout errors
- The download console shows Cloudflare detection messages
❌ You DON'T need it when:
❌ You DON'T need it when:
- Chapters are scraping successfully without errors
- The site loads normally in a regular browser
- You’re getting content extraction errors (this is a different issue)
- Scraping is just slow (use rate limiting instead)
How to Use
Enable Before Starting
Configure Job Settings
Before clicking “Start Scrape”, check the “Enable Cloudflare Bypass” checkbox.
Start Scraping
Click “Start Scrape”. If Cloudflare is detected:
- A browser window will appear automatically
- You’ll see the Cloudflare challenge page
- Solve any CAPTCHAs or checkboxes if present
- Wait for the page to load the actual content
Optional: Show Browser Window
You can enable “Show Browser” mode to watch the scraping process in real-time. This is useful for debugging provider scripts or understanding why a site isn’t working.
Rate Limiting
When Cloudflare bypass is enabled, the scraper automatically uses longer delays between requests to appear more human:| Mode | Delay Between Chapters |
|---|---|
| Normal | 100-500ms |
| Cloudflare Bypass | 1500-4000ms (1.5-4 seconds) |
Troubleshooting
Browser window never appears
Browser window never appears
Possible causes:
- Cloudflare bypass is not enabled
- The site doesn’t use Cloudflare (check browser console for errors)
- Electron window is hidden behind other windows (check your taskbar)
- Stop the scrape
- Enable “Show Browser” toggle
- Restart the scrape and watch the browser window
Challenge keeps timing out
Challenge keeps timing out
If you’re unable to solve the challenge within 60 seconds:Solution:
- Some challenges require multiple steps
- Click the checkbox immediately when it appears
- Solve any CAPTCHA puzzles quickly
- If using VPN, try disconnecting (Cloudflare is stricter with VPN IPs)
Challenge solved but scraping still fails
Challenge solved but scraping still fails
The scraper waits 2 seconds after solving before extracting content.Possible issues:
- The site uses additional JavaScript protections
- Content extraction selectors are incorrect (provider issue)
- The page requires authentication/login
- Enable “Show Browser” to see what’s actually loading
- Check if the provider script is up to date
- See if the site requires a login (not currently supported)
Getting banned after multiple chapters
Getting banned after multiple chapters
Even with Cloudflare bypass, aggressive scraping can trigger rate limits.Solution:
- The random delays (1.5-4s) should prevent this in most cases
- Try scraping in smaller batches (pause and resume)
- Some sites have per-IP daily limits (wait 24 hours)
Technical Details
User Agent Spoofing
The scraper uses a realistic Chrome user agent:Referrer Headers
Some provider scripts use Google referrer to appear more legitimate:Best Practices
Use Sparingly
Only enable Cloudflare bypass when necessary. Normal mode is 3-8x faster.
Be Patient
Wait the full 60 seconds if the challenge is complex. Don’t spam-click.
One Session
You usually only solve the challenge once at the start. Later chapters work automatically.
Respect Sites
Don’t abuse the bypass. Scrape responsibly and respect site terms of service.
Next Steps
Provider System
Learn how to customize scraping for specific websites
Scraping Flow
Understand the complete scraping pipeline
