Discover and explore top open-source AI tools and projects—updated daily.
vercel-labsBrowser automation CLI for AI agents
New!
Top 4.8% on SourcePulse
Summary
This project provides a command-line interface (CLI) for browser automation, specifically designed to be used by AI agents. It offers a fast, deterministic, and AI-friendly way to control web browsers, enabling agents to interact with web pages programmatically. The primary benefit is a streamlined workflow for AI-driven web tasks.
How It Works
The tool employs a client-daemon architecture, featuring a fast Rust CLI that communicates with a Node.js daemon managing the browser instance via Playwright. Its core innovation lies in a snapshot command that generates an accessibility tree with unique, stable refs (e.g., @e1). Subsequent actions like clicking or filling are performed using these refs, ensuring deterministic element selection that is highly compatible with AI parsing and decision-making. It also supports traditional CSS selectors and XPath for broader compatibility.
Quick Start & Requirements
npm install -g agent-browser. Download the default Chromium browser with agent-browser install. On Linux, system dependencies can be installed with agent-browser install --with-deps.Highlighted Details
snapshot with refs for AI agents to reliably identify and interact with elements.--session flag or AGENT_BROWSER_SESSION environment variable, each with its own cookies and history.Maintenance & Community
No specific details regarding maintainers, community channels (e.g., Discord, Slack), sponsorships, or roadmap were present in the provided README.
Licensing & Compatibility
Limitations & Caveats
The tool is a CLI, primarily intended for programmatic use by agents or scripts. While it supports multiple browsers via Playwright, Chromium is the default. The README does not detail specific limitations, known bugs, or unsupported platforms beyond the architecture's platform support matrix.
1 day ago
Inactive
MiguelsPizza