mcp-browser-use  by Saik0s

MCP server for AI-driven browser automation and web research

created 6 months ago
775 stars

Top 46.0% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides an AI-driven browser automation server implementing the Model Context Protocol (MCP) for natural language control and web research. It targets developers and researchers building AI agents that require web interaction capabilities, offering seamless integration with various LLM providers and advanced browser automation tools.

How It Works

The server acts as an intermediary, translating natural language commands into browser actions via Playwright. It supports the MCP protocol for communication with AI agents, allowing them to execute tasks like page navigation, form filling, and element interaction. A key feature is its ability to connect to a user's existing Chrome/Chromium instance via the Chrome DevTools Protocol (CDP), enabling direct observation and use of personal browser profiles.

Quick Start & Requirements

  • Install with uv sync.
  • Requires Python 3.11+ and Playwright browsers (uv run playwright install).
  • Configuration is primarily via environment variables (e.g., OPENROUTER_API_KEY).
  • See Claude Desktop config example for setup.

Highlighted Details

  • Full MCP protocol implementation for AI agent communication.
  • Supports numerous LLM providers including OpenAI, Anthropic, Ollama, and OpenRouter.
  • Offers run_browser_agent for natural language browser control and run_deep_search for multi-step web research.
  • Can connect to a user's Chrome/Chromium instance via CDP (MCP_USE_OWN_BROWSER=true).

Maintenance & Community

No specific community links or notable contributors are mentioned in the README.

Licensing & Compatibility

MIT License. Permissive for commercial use and closed-source linking.

Limitations & Caveats

The project relies heavily on environment variables for configuration, which can be cumbersome for complex setups. While it supports connecting to an existing browser via CDP, managing browser state and potential conflicts requires careful user attention. Vision capabilities (MCP_USE_VISION) depend on the LLM model's support.

Health Check
Last commit

1 month ago

Responsiveness

1+ week

Pull Requests (30d)
1
Issues (30d)
0
Star History
295 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems).

web-ui by browser-use

0.7%
14k
Web UI for AI browser agent
created 7 months ago
updated 2 months ago
Feedback? Help us improve.