What is agent-native?
agent-native is a macOS CLI tool that exposes native application UI elements to AI agents and automation scripts. Inspired by agent-browser from Vercel Labs, it brings the same level of structured control to desktop apps that agent-browser provides for web pages.Where agent-browser uses Chrome DevTools Protocol (CDP) and the DOM for web automation, agent-native uses macOS Accessibility APIs and the AX tree for native app automation.
Key capabilities
Discovery
List running apps, open applications, and explore their UI structure through the Accessibility tree
Snapshot
Capture interactive elements with persistent references (@n1, @n2) for reliable interaction
Interaction
Click buttons, fill text fields, check toggles, select from dropdowns using simple commands
State inspection
Read text, check element state, get attributes, and verify UI conditions
Core workflow
The typical agent-native workflow follows a simple pattern:When to use agent-native
agent-native is ideal for:- AI agent automation: Give LLMs structured access to desktop apps with JSON output
- GUI testing: Automate macOS app testing through the Accessibility APIs
- Desktop workflows: Chain together actions across multiple native applications
- System administration: Automate System Settings and other macOS utilities
- Accessibility tooling: Build tools that inspect and interact with app UI elements
Agent mode
Every command supports--json output for structured data suitable for LLM tool use:
Next steps
Installation
Install via Homebrew or build from source
Quick start
Complete tutorial from installation to first automation