Web Tools

Overview

Web tools enable agents to search the web and retrieve page content. The search tool supports multiple backends with automatic fallback, while the fetch tool extracts readable text from HTML pages.

Tools

web_search

Search the web and return a list of results with titles, URLs, and snippets.

query

string

required

Search query

max_results

integer

Maximum number of results to return (default 5, max 10)

Example:

result = await web_search(
    query="Python asyncio best practices",
    max_results=5
)

Returns: Formatted list of search results:

Search results for: Python asyncio best practices

1. Asyncio Best Practices Guide
   https://docs.python.org/3/library/asyncio.html
   Learn the recommended patterns for writing async Python code...

2. Real Python - Async IO in Python
   https://realpython.com/async-io-python/
   A complete guide to asynchronous programming in Python...

Search Backends

Brave Search (Primary)

When a Brave API key is configured, the tool uses Brave Search API for fast, ad-free results:

[profiles.default]
brave_api_key = "BSA_xxxxxxxxxxxxxxxxxxxxxxxx"

Get an API key at: https://brave.com/search/api/

DuckDuckGo (Fallback)

If Brave is unavailable or fails, the tool automatically falls back to DuckDuckGo HTML scraping. No API key required.

web_fetch

Fetch a URL and extract its main readable text content.

url

string

required

URL to fetch (must start with http:// or https://)

Example:

result = await web_fetch(
    url="https://github.com/grip/readme"
)

Returns: Extracted text content with HTML tags stripped, truncated at 50,000 characters.

Content Extraction

The web_fetch tool uses a minimal HTML-to-text converter that:

Strips script/style tags: Removes JavaScript, CSS, and other non-readable elements
Preserves structure: Adds line breaks for paragraphs, headings, and list items
Collapses whitespace: Reduces multiple blank lines to single line breaks
Skips navigation: Ignores nav, header, footer elements

Handled content types:

text/html or application/xhtml: Extracts readable text
text/*, application/json, application/xml: Returns raw content
Binary files: Reports file type and size

Configuration

Timeouts

HTTP requests use the following timeout settings (defined in code):

FETCH_TIMEOUT = httpx.Timeout(
    connect=10.0,   # Connection timeout
    read=30.0,      # Read timeout
    write=5.0,      # Write timeout
    pool=5.0        # Pool timeout
)

User Agent

All requests use a custom user agent to identify the bot:

grip/0.1 (AI Agent; +https://github.com/grip)

Redirects

The fetch tool follows up to 5 redirects automatically.

Error Handling

Search Errors

result = await web_search(query="test")
# If both Brave and DuckDuckGo fail:
# Returns: "Error: Web search failed: [error details]"

Fetch Errors

HTTP errors:

result = await web_fetch(url="https://example.com/404")
# Returns: "Error: HTTP 404 fetching https://example.com/404"

Timeout errors:

result = await web_fetch(url="https://slow-site.com")
# Returns: "Error: Timeout fetching https://slow-site.com"

Invalid URL:

result = await web_fetch(url="not-a-url")
# Returns: "Error: URL must start with http:// or https://"

Best Practices

Use specific queries for better search results
Limit results to avoid excessive API usage (max 10)
Check for errors in returned content before processing
Prefer Brave for production (requires API key but more reliable)
Handle timeouts gracefully for slow sites

Implementation

Defined in grip/tools/web.py. Uses:

httpx.AsyncClient for async HTTP requests
HTMLParser subclass for text extraction
Regex-based DuckDuckGo result parsing
Automatic fallback between search backends

REST API

CLI Commands

Built-in Tools

Overview

Tools

web_search

Search Backends

Brave Search (Primary)

DuckDuckGo (Fallback)

web_fetch

Content Extraction

Configuration

Timeouts

User Agent

Redirects

Error Handling

Search Errors

Fetch Errors

Best Practices

Implementation

Build docs developers (and LLMs) love

REST API

CLI Commands

Built-in Tools

​Overview

​Tools

​web_search

​Search Backends

​Brave Search (Primary)

​DuckDuckGo (Fallback)

​web_fetch

​Content Extraction

​Configuration

​Timeouts

​User Agent

​Redirects

​Error Handling

​Search Errors

​Fetch Errors

​Best Practices

​Implementation

Build docs developers (and LLMs) love

Overview

Tools

web_search

Search Backends

Brave Search (Primary)

DuckDuckGo (Fallback)

web_fetch

Content Extraction

Configuration

Timeouts

User Agent

Redirects

Error Handling

Search Errors

Fetch Errors

Best Practices

Implementation