agent-browser-cli  by sleepinginsummer

Browser control CLI for AI agents

Created 2 weeks ago

New!

336 stars

Top 81.8% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

agent-browser-cli provides a command-line interface and Chrome extension to expose real browser sessions to AI agents, addressing the need for precise web interaction while preserving login states and cookies. It benefits AI developers, automation testers, and data scrapers by enabling tasks like page scanning, JS execution, CDP control, and screenshots within agent workflows.

How It Works

A Rust CLI interfaces with a Chrome MV3 extension (tmwd_cdp_bridge) to control the user's active browser via the Chrome DevTools Protocol (CDP). This enables precise page manipulation, including JS execution, DOM reading, cookie management, and screenshots. Unlike Selenium/Playwright, it augments existing sessions, ideal for agents interacting with pre-authenticated or complex web applications.

Quick Start & Requirements

  • Installation: CLI: npm install -g @sleepinsummer/agent-browser-cli. Chrome extension: Manually load from release zip or source (assets/tmwd_cdp_bridge/).
  • Prerequisites: Chrome browser on Windows (incl. WSL), macOS, Linux. WSL requires v2.0.0+, recommended Windows 11 22H2+ with networkingMode=mirrored. Linux needs Chrome/Chromium supporting extension installation.
  • Setup: Follow AI_INSTALL.md for CLI, extension loading, and SKILL.md integration.
  • Links: AI Install: AI_INSTALL.md. Full commands/SOP: SKILL.md.

Highlighted Details

  • Reuses existing Chrome sessions, preserving login state and cookies.
  • Offers tab scanning, JS execution, CDP control, screenshots, and file uploads.
  • Includes page simplification and list identification scripts for agent parsing.
  • Temporarily suppresses native browser alert/confirm/prompt dialogs during script execution.
  • Supports profile labeling for managing multiple Chrome profiles.

Maintenance & Community

Derived from the GenericAgent project. No specific community channels or maintainer details are provided in the README.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Permissive for commercial use and closed-source integration.

Limitations & Caveats

Currently in beta (0.3.1-beta.1). WSL support requires specific configurations. The Chrome extension needs an active web tab for connection, and its UI auto-hides. Port management between CLI and extension requires careful configuration.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
5
Star History
336 stars in the last 15 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Travis Fischer Travis Fischer(Founder of Agentic), and
2 more.

obscura by h4ckf0r0day

4.6%
14k
Lightweight headless browser for AI agents and web scraping
Created 1 month ago
Updated 6 days ago
Feedback? Help us improve.