Overview
Adapt continuously monitors your website for broken links and connectivity issues, helping you maintain a healthy site structure. The system tracks HTTP status codes, response times, and connection errors for every crawled URL.How It Works
Automatic Status Code Tracking
Every crawled page is checked for:- 404 Errors: Missing pages that need to be fixed or redirected
- 5xx Server Errors: Backend failures that affect user experience
- Timeouts: Pages that take too long to respond (default 30s timeout)
- Connection Failures: DNS errors, SSL issues, and network problems
- Redirect Chains: Multiple redirects that slow down page loads
Status Code Classification
Adapt categorises responses into:- 3xx redirects (tracked separately for performance analysis)
- 4xx client errors (404, 403, 401, etc.)
- 5xx server errors (500, 502, 503, etc.)
Task-Level Error Tracking
Each crawl task stores detailed error information:Accessing Broken Link Data
Via Dashboard
The dashboard displays real-time broken link counts:- Failed Tasks Counter: Shows total broken links in a job
- Task Status Filter: Filter to view only failed tasks
- Error Details: Click any failed task to see the specific error
Via API
Response Format
Common Error Types
404 Not Found
Cause: Page deleted or URL typo Solution: Create 301 redirect to current page or remove linkConnection Timeout
Cause: Page takes longer than 30 seconds to respond Solution: Optimise server response time or increase cache TTLSSL Certificate Errors
Cause: Expired or invalid SSL certificate Solution: Renew certificate and ensure proper chain configurationDNS Resolution Failures
Cause: Domain doesn’t resolve or DNS propagation issues Solution: Check DNS records and wait for propagationLink Discovery
Broken links are discovered through multiple sources:Sitemap URLs
All URLs listed insitemap.xml are automatically crawled and checked.
Discovered Links
Whenfind_links is enabled, Adapt extracts links from:
- Header navigation:
<header>elements - Footer navigation:
<footer>elements - Body content: Main content area
Hidden Link Filtering
Adapt automatically skips hidden elements to focus on user-facing links:display: noneorvisibility: hiddenstylesaria-hidden="true"attributes- Common hiding classes:
.hidden,.d-none,.sr-only
Retry Logic
Temporary failures are automatically retried:- Transient Errors: Network issues, timeouts
- Retry Count: Up to 3 attempts per URL
- Exponential Backoff: Increasing delays between retries
- Status Tracking: Each retry is logged separately
Permanent errors like 404s are not retried to avoid wasting resources.
Performance Impact
Broken link detection has minimal overhead:- No Extra Requests: Uses existing crawl data
- Efficient Storage: Only failed tasks store error details
- Real-time Updates: Status tracked as pages are crawled
Best Practices
Regular Monitoring
Schedule recurring crawls to catch broken links before users do
Fix at Source
Update links in your CMS rather than relying on redirects
Monitor External Links
Check links to external sites that might change or break
Review After Migrations
Always run a crawl after moving or restructuring content
Related Features
- Performance Monitoring - Track slow pages alongside broken links
- Scheduled Crawls - Automate regular link checking
- Cache Warming - Ensure working pages are cached