What is BlackWeb?
BlackWeb aggregates and consolidates hundreds of public blocklists from trusted sources worldwide, creating a unified, optimized blocklist specifically formatted for Squid-Cache proxy servers. This allows network administrators to easily implement comprehensive web filtering across their infrastructure.BlackWeb is NOT a blacklist service itself. It does not independently verify domains. Its purpose is to consolidate and reformat public blacklist sources to make them compatible with Squid-Cache.
Key Statistics
| ACL | Blocked Domains | File Size |
|---|---|---|
| blackweb.txt | 4,772,375 | 118.8 MB |
What Does BlackWeb Block?
BlackWeb consolidates blocklists targeting various categories of unwanted content:- Malware & Phishing: Malicious domains, phishing sites, and fraud attempts
- Trackers & Spyware: Advertising trackers, analytics, and surveillance software
- Adult Content: Pornography and explicit material
- Social Networks: Optional blocking of social media platforms
- Downloads & Warez: Piracy and illegal download sites
- Drugs & Weapons: Illegal marketplaces and related content
- Cryptocurrency Mining: Browser-based cryptominers
- Spam & Bots: Known spam domains and bot networks
Key Features
Massive Coverage
Nearly 4.8 million blocked domains from over 100 curated public sources
Squid-Optimized
Pre-formatted for Squid-Cache with optimized domain structure
Regular Updates
Automated update process with DNS validation and debugging
False Positive Filtering
Built-in allowlists for essential services (Google, Yahoo, GitHub, etc.)
How It Works
BlackWeb follows a comprehensive processing pipeline:- Collection: Downloads blocklists from 100+ public sources
- Extraction: Captures domains using regex pattern matching
- Normalization: Converts to Squid-Cache format (
.domain.com) - Deduplication: Removes overlapping subdomains (
.sub.example.comis removed if.example.comexists) - TLD Validation: Verifies domains have valid top-level domains
- Punycode Processing: Converts international domains to ASCII-compatible format
- DNS Lookup: Validates that domains actually exist (two-step verification)
- Filtering: Removes government domains (.gov, .mil) and allowlisted entries
- Optimization: Final sorting and compression
Source Transparency
BlackWeb aggregates data from over 100 trusted public sources, including:- Malware Intelligence: abuse.ch, OpenPhish, CriticalPathSecurity, cert.pl
- Ad & Tracker Lists: EasyList, AdGuard, Disconnect.me, Steven Black
- Security Researchers: Mitchell Krogza, hagezi, Firebog
- Academic Sources: Université Toulouse 1 Capitole
- Community Projects: Pi-hole, Ultimate Hosts Blacklist, hBlock
View Full Source List
View Full Source List
See the complete list of sources in the README.md SOURCES section.
Important Considerations
Domain Removal Requests
If your domain appears in BlackWeb and you believe this is an error:- BlackWeb consolidates public sources and does not independently blacklist domains
- Use the
checksources.shtool to identify which upstream source(s) list your domain - Contact the maintainer of that source list to request removal
- Once removed from the upstream source, it will automatically disappear from BlackWeb in the next update
Use Cases
BlackWeb is ideal for:- Corporate Networks: Enforce acceptable use policies and protect against malware
- Educational Institutions: Filter inappropriate content and prevent malware infections
- ISPs: Provide customer protection and reduce malicious traffic
- Home Networks: Protect family members from harmful content
- Research: Analyze malicious domain patterns and DGA detection
Community Recognition
BlackWeb is referenced and used by:- Wikipedia - Blacklist (computing)
- OSINT Framework - Domain Blacklists section
- Zeltser - Free Blocklists of Suspected Malicious IPs and URLs
- Secrepo - Samples of Security Related Data
- Multiple security research papers and blog posts
