mcp-browser-use  by Saik0s

MCP server for AI-driven browser automation and web research

Created 8 months ago
813 stars

Top 43.6% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This project provides an AI-driven browser automation server implementing the Model Context Protocol (MCP) for natural language control and web research. It targets developers and researchers building AI agents that require web interaction capabilities, offering seamless integration with various LLM providers and advanced browser automation tools.

How It Works

The server acts as an intermediary, translating natural language commands into browser actions via Playwright. It supports the MCP protocol for communication with AI agents, allowing them to execute tasks like page navigation, form filling, and element interaction. A key feature is its ability to connect to a user's existing Chrome/Chromium instance via the Chrome DevTools Protocol (CDP), enabling direct observation and use of personal browser profiles.

Quick Start & Requirements

  • Install with uv sync.
  • Requires Python 3.11+ and Playwright browsers (uv run playwright install).
  • Configuration is primarily via environment variables (e.g., OPENROUTER_API_KEY).
  • See Claude Desktop config example for setup.

Highlighted Details

  • Full MCP protocol implementation for AI agent communication.
  • Supports numerous LLM providers including OpenAI, Anthropic, Ollama, and OpenRouter.
  • Offers run_browser_agent for natural language browser control and run_deep_search for multi-step web research.
  • Can connect to a user's Chrome/Chromium instance via CDP (MCP_USE_OWN_BROWSER=true).

Maintenance & Community

No specific community links or notable contributors are mentioned in the README.

Licensing & Compatibility

MIT License. Permissive for commercial use and closed-source linking.

Limitations & Caveats

The project relies heavily on environment variables for configuration, which can be cumbersome for complex setups. While it supports connecting to an existing browser via CDP, managing browser state and potential conflicts requires careful user attention. Vision capabilities (MCP_USE_VISION) depend on the LLM model's support.

Health Check
Last Commit

3 months ago

Responsiveness

1+ week

Pull Requests (30d)
2
Issues (30d)
1
Star History
24 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
15 more.

stagehand by browserbase

0.5%
17k
AI browser automation framework for production
Created 1 year ago
Updated 1 day ago
Starred by Kevin Hou Kevin Hou(Head of Product Engineering at Windsurf), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
29 more.

browser-use by browser-use

0.6%
70k
SDK for AI agent browser control
Created 10 months ago
Updated 1 day ago
Feedback? Help us improve.