Overview
Technical SEO rules ensure search engines can crawl, index, and understand your website properly. These rules cover infrastructure, crawlability, and server configuration.Sitemap Rules
Rule: crawl/sitemap-valid
What it checks:
- XML sitemap exists and is accessible
- Valid XML format
- URLs are absolute (not relative)
- No errors in sitemap structure
- Sitemap is discoverable in robots.txt
Common Issues & Fixes
Common Issues & Fixes
Issue: 8 sitemaps return unknown format errorsFix:
- Generate a valid XML sitemap
- Serve it at
/sitemap.xmlonly - Remove or redirect all other sitemap URLs
- Submit to Google Search Console
For large sites (>50,000 URLs), use a sitemap index file that references multiple smaller sitemaps.
Rule: crawl/sitemap-size
What it checks:
- Sitemap file size under 50MB (uncompressed)
- No more than 50,000 URLs per sitemap
- Proper compression (gzip recommended)
How to Handle Large Sitemaps
How to Handle Large Sitemaps
- Split by content type (pages, posts, products)
- Keep each sitemap under 50,000 URLs
- Compress with gzip to reduce bandwidth
- Update lastmod dates when content changes
Robots.txt
Rule: crawl/robots-txt
What it checks:
- robots.txt exists at
/robots.txt - Valid syntax
- Sitemap reference included
- No accidental disallows
Common Issues
Common Issues
Redirects & Status Codes
Rule: links/redirect-chains
What it checks:
- No redirect chains (A → B → C)
- Redirects use 301 (permanent) or 302 (temporary) correctly
- No redirect loops
- Minimal redirect hops
Fix Redirect Chains
Fix Redirect Chains
- Audit all redirects with a crawler
- Update redirects to point directly to final destination
- Use 301 for permanent moves, 302 for temporary
- Avoid meta refresh redirects (use server-side 301/302)
Rule: links/broken-links
What it checks:
- No 404 errors on internal links
- No 500 server errors
- All linked resources are accessible
Common HTTP Status Codes
Common HTTP Status Codes
| Code | Meaning | Impact |
|---|---|---|
| 200 | OK | ✅ Page loads successfully |
| 301 | Permanent Redirect | ⚠️ Should point to final URL |
| 302 | Temporary Redirect | ⚠️ Okay for short-term moves |
| 404 | Not Found | ❌ Broken link, bad UX |
| 410 | Gone (Permanent) | ⚠️ Better than 404 for removed content |
| 500 | Server Error | ❌ Critical issue, fix immediately |
| 503 | Service Unavailable | ❌ Temporary outage |
URL Structure
Rule: url/length
What it checks:
- URLs under 100 characters (optimal)
- No excessive parameters
- Clean, readable structure
URL Best Practices
URL Best Practices
- Use hyphens (not underscores) to separate words
- Keep URLs under 100 characters when possible
- Use lowercase letters only
- Include target keywords
- Avoid special characters and spaces
- Use semantic hierarchy:
/category/subcategory/page
Rule: url/parameters
What it checks:
- Minimal use of query parameters for content
- Tracking parameters don’t create duplicate content
- Proper use of canonical tags with parameters
Handling URL Parameters
Handling URL Parameters
Google Search Console: URL Parameters ToolTell Google how to handle parameters:
sort,filter,page: Changes content → Crawl every URLutm_source,sessionid: Doesn’t change content → Ignore parameter
Crawlability
Rule: crawl/depth
What it checks:
- Important pages are within 3 clicks of homepage
- No orphaned pages (pages with no internal links)
- Proper internal linking structure
Fix Deep Page Issues
Fix Deep Page Issues
Symptoms:
- Important pages not ranking
- Low crawl frequency
- Pages not appearing in search results
- Add links from homepage to important pages
- Create category/hub pages that link to related content
- Use breadcrumbs for navigation
- Add internal links within content
- Create an HTML sitemap
- Fix orphaned pages (add at least one internal link)
Index Status
Rule: crawl/indexability
What it checks:
- Pages aren’t blocked by robots.txt
- No
noindexmeta tags on important pages - Pages are accessible to crawlers
- No authentication walls for public content
Common Indexability Issues
Common Indexability Issues
- Thank you pages
- Search results pages
- Filtering/sorting variants
- Login/logout pages
- Draft/preview content
JavaScript & Rendering
Rule: crawl/rendering
What it checks:
- Content visible without JavaScript
- Critical content server-side rendered
- No infinite scroll issues
- Lazy loading implemented correctly
Fix SPA Rendering Issues
Fix SPA Rendering Issues
Problem: Thin content (0 words detected)Likely a Single Page Application (SPA) rendering issue—crawlers see empty content.Solutions:
- Server-Side Rendering (SSR): Render HTML on server
- Static Site Generation (SSG): Pre-render pages at build time
- Dynamic Rendering: Serve static HTML to bots, JS to users
- Prerendering Service: Use service like Prerender.io
Canonical URL Chains
Rule: core/canonical-chain
What it checks:
- No canonical chains (Page A → Page B → Page C)
- Canonical URLs are directly accessible
- Self-referencing canonicals on original pages
Fix Canonical Chains
Fix Canonical Chains
Related Pages
Core SEO Rules
Title tags, meta descriptions, H1 headings, and Open Graph
Performance Rules
Page speed, compression, caching, and optimization
Running Audits
Learn how to run website audits and identify technical issues
Interpreting Results
Understand health scores and prioritize fixes