What It Does
Browser Hand is an autonomous web browser agent that interacts with real websites on your behalf. It can navigate to URLs, click buttons and links, fill forms, read page content, take screenshots, and complete multi-step workflows—with mandatory approval for any purchase or payment. This is web automation with safety guardrails. Browser Hand will never spend your money without explicit confirmation.Key Features
- Multi-step workflows: Search products, fill forms, navigate complex sites
- Purchase approval gate: Mandatory confirmation before any payment
- Session persistence: Cookies and login state maintained across messages
- Visual verification: Screenshot after every action (optional)
- Error recovery: Handles page timeouts, missing elements, CAPTCHAs
- Cross-platform: Works on Windows, macOS, Linux
Activation
Requirements
Install Python 3
macOS:Windows:Linux (Debian/Ubuntu):Or download from python.org/downloads.Estimated time: 2-5 minutes
Install Playwright
Configuration Settings
Run the browser without a visible window. Recommended for servers. Set to
false for debugging (you’ll see the browser in action).CRITICAL: Require explicit user confirmation before completing any purchase or payment. Should NEVER be disabled unless you fully trust the workflow.
Maximum number of page navigations allowed per task to prevent runaway browsing:
- 10: Conservative
- 20: Balanced (default)
- 50: Thorough
How long to wait after clicking or navigating for the page to settle:
- auto: Auto-detect (wait for DOM) - default
- 1: 1 second
- 3: 3 seconds
Automatically take a screenshot after every click/navigate for visual verification. Useful for debugging but generates many files.
Required Tools
Browser Hand requires access to these tools (all built-in):browser_navigate,browser_click,browser_type,browser_screenshot,browser_read_page,browser_close— Browser automation primitivesweb_search,web_fetch— Fallback for non-interactive researchmemory_store,memory_recall— State persistenceknowledge_add_entity,knowledge_add_relation,knowledge_query— Workflow trackingschedule_create,schedule_list,schedule_delete— Scheduled workflowsfile_write,file_read— Screenshot and data storage
System Prompt Overview
Browser Hand operates in 5 phases:Understand the Task
Parses user request: What website(s) to visit? What information to find or action to perform? What are success criteria?
Navigate & Observe
Uses
browser_navigate to go to target URL. Reads page content to understand layout. Identifies relevant elements (buttons, links, forms, search boxes).Interact
Uses
browser_click for buttons and links (CSS selectors or visible text). Uses browser_type for filling form fields. Uses browser_read_page after each action to see updated state.MANDATORY Purchase/Payment Approval
CRITICAL RULE: Before completing ANY purchase, payment, or form submission involving money:
- Summarize what you are about to buy/pay for
- Show the total cost
- List all items in cart
- STOP and ask user for explicit confirmation
- Only proceed after receiving clear approval
Usage Examples
Product Search
- Navigate to Amazon
- Find the search box and type “USB-C cables”
- Filter results by price (under $20)
- Extract top 3 products with prices and ratings
- Report results (does NOT add to cart)
Price Comparison
- Navigate to each site
- Search for “AirPods Pro”
- Extract prices
- Report comparison table
Form Filling
- Navigate to the contact page
- Fill each field
- Screenshot before submitting (for verification)
- Submit the form
- Confirm submission success
E-commerce with Approval Gate
- Search for running shoes on Nike.com
- Filter by size 10 and price <$100
- Select a product
- Add to cart
- STOP and report: “Found Nike Revolution 6 ($89.99, size 10). Ready to add to cart. Approve?”
- Wait for your confirmation before proceeding
CSS Selector Reference
Browser Hand uses CSS selectors to identify page elements:Basic Selectors
Basic Selectors
| Selector | Description | Example |
|---|---|---|
#id | By ID | #checkout-btn |
.class | By class | .add-to-cart |
tag | By element | button, input |
[attr=val] | By attribute | [data-testid="submit"] |
Form Selectors
Form Selectors
| Selector | Use Case |
|---|---|
input[type="email"] | Email fields |
input[type="password"] | Password fields |
input[type="search"] | Search boxes |
input[name="q"] | Google/search query |
textarea | Multi-line text areas |
select[name="country"] | Dropdown menus |
button[type="submit"] | Submit buttons |
E-commerce Selectors
E-commerce Selectors
| Selector | Use Case |
|---|---|
.product-price, [data-price] | Product prices |
.add-to-cart, #add-to-cart | Add to cart buttons |
.cart-total, .order-total | Cart total |
.quantity, input[name="quantity"] | Quantity selectors |
.checkout-btn, #checkout | Checkout buttons |
Common Workflows
Login Pattern
Search Pattern
E-commerce Pattern (with Approval)
Dashboard Metrics
Browser Hand tracks three key metrics:Pages Visited
Total pages navigated across all sessions.
Tasks Completed
Total workflows successfully completed.
Screenshots
Total screenshots captured.
http://localhost:4200/hands/browser.
Error Recovery
Element Not Found
Element Not Found
Symptoms: “Element not found” errorRecovery:
- Try alternative selector
- Use visible text instead of CSS selector
- Scroll page to bring element into view
- Wait longer for page to load
Page Timeout
Page Timeout
Symptoms: Page doesn’t load within timeoutRecovery:
- Retry navigation
- Check if URL is correct
- Verify internet connection
CAPTCHA
CAPTCHA
Symptoms: CAPTCHA challenge appearsRecovery: Browser Hand cannot solve CAPTCHAs. It will inform you and stop. You must solve manually.
Login Required
Login Required
Symptoms: Page redirects to loginRecovery: Browser Hand will ask for credentials. Never stores passwords in memory after session ends.
Cookie Consent
Cookie Consent
Security Rules
Best Practices
Advanced Configuration
Custom Browser Path
Edit~/.openfang/hands/browser.toml to use a specific Chromium/Chrome binary:
Proxy Configuration
User Agent Override
Playwright Reference
Browser Hand uses Playwright under the hood. Here are the key operations:Navigation
Navigation
Clicking
Clicking
Typing
Typing
Reading Content
Reading Content
Screenshots
Screenshots
Limitations
Example Session
Next Steps
Lead Hand
Use Browser Hand to enrich leads with web data
Collector Hand
Automate competitive intelligence gathering
