Best Practices

Overview

This guide covers proven strategies for monitoring website health, optimising performance, and maintaining high availability with Adapt.

Scheduling Strategy

Choose the Right Interval

Adapt supports recurring crawls at 6, 12, 24, or 48-hour intervals:

6-Hour Interval

Best for:

High-traffic production sites
E-commerce platforms
Sites with frequent content updates

Considerations:

Uses more daily page quota
Provides rapid issue detection
Keeps cache consistently warm

12-Hour Interval

Best for:

Business websites
Marketing sites with regular updates
SaaS application landing pages

Considerations:

Balanced quota usage
Twice-daily health checks
Good cache coverage

24-Hour Interval

Best for:

Corporate websites
Portfolio sites
Documentation sites
Blogs

Considerations:

Efficient quota usage
Daily health monitoring
Standard recommendation

48-Hour Interval

Best for:

Low-traffic sites
Archive sites
Development environments

Considerations:

Minimal quota usage
Less frequent updates
Lower cache coverage

Start with 24-hour intervals and adjust based on your site’s update frequency and traffic patterns.

Cache Warming Strategy

When to Warm Cache

After Publishing: Run a crawl immediately after deploying new content
Before Traffic Spikes: Warm cache before expected high-traffic events
After Cache Purges: Re-warm after manual cache clearing
Regular Maintenance: Schedule recurring crawls to keep cache fresh

Priority-Based Warming

Connect Google Analytics to enable priority-based cache warming:

Connect Analytics

Link your Google Analytics property in Organisation Settings.

Automatic Prioritisation

Adapt automatically prioritises high-traffic pages when warming cache.

Verify Coverage

Check job results to confirm your most-visited pages show cache HITs.

Without Analytics, Adapt warms the homepage first, then processes pages in discovery order.

Crawl Configuration

Sitemap vs Link Crawling

Use both methods for comprehensive coverage:

{
  "domain": "example.com",
  "options": {
    "use_sitemap": true,
    "find_links": true,
    "max_pages": 0,
    "concurrency": 20
  }
}

Method	Pros	Cons
Sitemap	Fast, comprehensive, respects your structure	Requires sitemap.xml
Link Crawling	Finds unlisted pages, validates internal links	Slower, may miss isolated pages

Enable both sitemap and link crawling to find all pages and validate all internal links.

Setting Max Pages

Production Sites

Set max_pages: 0 (unlimited) for complete site coverage:

{
  "max_pages": 0
}

Testing & Development

Use a limit for initial testing:

{
  "max_pages": 50
}

Large Sites (1000+ Pages)

Consider multiple focused crawls:

// Crawl 1: Homepage and main sections
{
  "domain": "example.com",
  "max_pages": 500
}

// Crawl 2: Blog archive
{
  "domain": "example.com",
  "max_pages": 500
}

Concurrency Settings

Adjust concurrency based on your hosting:

Hosting Type	Recommended Concurrency
Shared hosting	5-10
VPS / Cloud	20-30
Dedicated server	30-50
CDN (Cloudflare, etc.)	50-100

Higher concurrency speeds up crawls but increases server load. Start conservative and increase if your server handles it well.

Monitoring & Alerts

Set Up Slack Notifications

Install Slack App

Install the Adapt Slack app from your workspace settings.

Authorise Notifications

Grant permission for Adapt to send you direct messages.

Automatic Alerts

Receive notifications when:

Jobs complete
Jobs fail
Broken links are detected
Performance degrades

Monitor Usage Limits

Check usage regularly to avoid hitting limits:

curl https://adapt.app.goodnative.co/v1/usage \
  -H "Authorization: Bearer YOUR_JWT_TOKEN"

Set a calendar reminder to review usage weekly, or build automated alerts using the API.

Multi-Organisation Workflows

Organise by Client or Project

Agency Use Case

Create one organisation per client:

Client A → Organisation “Client A”
Client B → Organisation “Client B”
Internal → Organisation “Agency Internal”

Benefits:

Isolated data and limits
Easy client handoff
Clear billing separation

Multi-Site Business

Create organisations by environment or brand:

Production sites → Organisation “Production”
Staging sites → Organisation “Staging”
Partner sites → Organisation “Partners”

Benefits:

Environment isolation
Separate quota pools
Different team access

Team Management

Assign Multiple Admins

Always have at least 2 admins per organisation:

curl -X PATCH https://adapt.app.goodnative.co/v1/organisations/members/user_456 \
  -H "Authorization: Bearer YOUR_JWT_TOKEN" \
  -d '{"role": "admin"}'

Use Member Role for Limited Access

Grant “member” role to team members who need to view results but not manage settings.

Performance Optimisation

Identify Bottlenecks

Run Baseline Crawl

Create a job to establish baseline performance metrics.

Review Slow Pages

Export slow pages report and analyse common patterns:

Large images
Slow database queries
Third-party scripts
Cache misses

Implement Fixes

Address issues starting with highest-traffic pages.

Verify Improvements

Run another crawl and compare response times.

Cache Optimisation

Maximise cache hit ratio:

Enable Cache Warming: Run crawls after publishing
Monitor Hit Ratio: Aim for 80%+ cache hits
Fix Cache Misses: Investigate pages with consistent MISS status
Warm High-Traffic Pages: Prioritise pages with most visitors

Pages showing DYNAMIC cache status are expected — these are typically authenticated pages, search results, or personalised content.

Broken Link Management

Weekly Review Process

Export Broken Links

Download the broken links report from your latest crawl.

Categorise Issues

Group broken links by:

Internal vs external
High-traffic vs low-traffic
Critical vs non-critical

Fix High-Priority Issues

Address broken links on high-traffic pages first.

Update Links

Fix broken internal links by updating references.

Verify Fixes

Run another crawl to confirm all links resolve.

Proactive Prevention

Pre-Delete Checks

Before deleting pages, search your content for internal links to that page and update or remove them.

Link Validation

Run a crawl in staging before deploying to production.

Regular Audits

Schedule quarterly reviews of external links, as third-party sites change frequently.

API Integration Patterns

CI/CD Integration

Trigger crawls automatically after deployments:

# GitHub Actions example
- name: Trigger Adapt Crawl
  run: |
    curl -X POST https://adapt.app.goodnative.co/v1/jobs \
      -H "Authorization: Bearer ${{ secrets.ADAPT_API_KEY }}" \
      -H "Content-Type: application/json" \
      -d '{
        "domain": "example.com",
        "options": {"use_sitemap": true, "find_links": true}
      }'

Automated Reporting

Build custom reports using the API:

#!/bin/bash
# Weekly broken links report

JOB_ID=$(curl -s https://adapt.app.goodnative.co/v1/jobs?limit=1 \
  -H "Authorization: Bearer $TOKEN" | jq -r '.data.jobs[0].id')

curl "https://adapt.app.goodnative.co/v1/jobs/$JOB_ID/export?type=broken-links" \
  -H "Authorization: Bearer $TOKEN" > broken-links-report.json

# Send to Slack, email, or dashboard

Monitoring Scripts

Track performance trends:

import requests
import json
from datetime import datetime

def get_job_metrics(api_key):
    headers = {"Authorization": f"Bearer {api_key}"}
    response = requests.get(
        "https://adapt.app.goodnative.co/v1/jobs?limit=10",
        headers=headers
    )
    
    jobs = response.json()["data"]["jobs"]
    for job in jobs:
        print(f"Job {job['id']}: {job['progress']['percentage']}% complete")
        print(f"Avg response time: {job.get('stats', {}).get('avg_response_time')}ms")

get_job_metrics("your_api_key_here")

Security Best Practices

Use API Keys for Integrations

Create scoped API keys instead of using JWT tokens in automation:

curl -X POST https://adapt.app.goodnative.co/v1/auth/api-keys \
  -H "Authorization: Bearer YOUR_JWT_TOKEN" \
  -d '{"name": "CI/CD Pipeline", "scopes": ["jobs:read", "jobs:create"]}'

Rotate Keys Regularly

Rotate API keys quarterly and immediately after team member departures.

Restrict Member Access

Use the “member” role for users who only need to view results, reserving “admin” for users who manage settings and billing.

Troubleshooting Common Issues

Job Takes Too Long

Solutions:

Reduce concurrency to respect server rate limits
Set a max_pages limit for testing
Check if robots.txt specifies a high crawl-delay
Verify your hosting can handle the load

High Cache Miss Rate

Solutions:

Run crawls more frequently to keep cache warm
Check CDN settings for cache TTL
Verify pages are cacheable (not authenticated)
Review cache-control headers

Many 404 Errors

Solutions:

Check sitemap.xml for outdated URLs
Review recent content deletions
Validate internal link updates
Check for broken external links

Hitting Usage Limits

Solutions:

Reduce crawl frequency
Set max_pages limits
Upgrade to a higher plan
Stagger crawls across multiple days

Get Started

Core Features

Integrations

Guides

Best Practices

Overview

Scheduling Strategy

Choose the Right Interval

Cache Warming Strategy

When to Warm Cache

Priority-Based Warming

Crawl Configuration

Sitemap vs Link Crawling

Setting Max Pages

Concurrency Settings

Monitoring & Alerts

Set Up Slack Notifications

Monitor Usage Limits

Multi-Organisation Workflows

Organise by Client or Project

Team Management

Performance Optimisation

Identify Bottlenecks

Cache Optimisation

Broken Link Management

Weekly Review Process

Proactive Prevention

API Integration Patterns

CI/CD Integration

Automated Reporting

Monitoring Scripts

Security Best Practices

Troubleshooting Common Issues

Build docs developers (and LLMs) love

Get Started

Core Features

Integrations

Guides

​Overview

​Scheduling Strategy

​Choose the Right Interval

​Cache Warming Strategy

​When to Warm Cache

​Priority-Based Warming

​Crawl Configuration

​Sitemap vs Link Crawling

​Setting Max Pages

​Concurrency Settings

​Monitoring & Alerts

​Set Up Slack Notifications

​Monitor Usage Limits

​Multi-Organisation Workflows

​Organise by Client or Project

​Team Management

​Performance Optimisation

​Identify Bottlenecks

​Cache Optimisation

​Broken Link Management

​Weekly Review Process

​Proactive Prevention

​API Integration Patterns

​CI/CD Integration

​Automated Reporting

​Monitoring Scripts

​Security Best Practices

​Troubleshooting Common Issues

​Related Resources

Build docs developers (and LLMs) love

Overview

Scheduling Strategy

Choose the Right Interval

Cache Warming Strategy

When to Warm Cache

Priority-Based Warming

Crawl Configuration

Sitemap vs Link Crawling

Setting Max Pages

Concurrency Settings

Monitoring & Alerts

Set Up Slack Notifications

Monitor Usage Limits

Multi-Organisation Workflows

Organise by Client or Project

Team Management

Performance Optimisation

Identify Bottlenecks

Cache Optimisation

Broken Link Management

Weekly Review Process

Proactive Prevention

API Integration Patterns

CI/CD Integration

Automated Reporting

Monitoring Scripts

Security Best Practices

Troubleshooting Common Issues

Related Resources