Discover and explore top open-source AI tools and projects—updated daily.
CursorTouchLLM-powered autonomous browser agent for web task automation
Top 98.5% on SourcePulse
Summary
Web-Use is an intelligent, autonomous browser agent leveraging the Chrome DevTools Protocol (CDP) and multiple Large Language Models (LLMs) to automate complex web interactions. It targets developers and power users seeking to streamline tasks like navigation, form filling, smart searching, and file operations, significantly boosting productivity.
How It Works
The agent utilizes CDP for direct browser control and integrates with a diverse range of LLMs for reasoning. Its core innovation lies in constructing a "Semantic Tree" directly from the DOM, providing accurate structural context via CSS selectors and roles, rather than relying on brittle XPaths. It also supports the Web Model Context Protocol (WebMCP) for dynamic discovery and use of website-specific tools.
Quick Start & Requirements
uv sync to install dependencies..env file with necessary API keys (e.g., GOOGLE_API_KEY).uv run main.py.CONTRIBUTING file for development guidelines.Highlighted Details
Maintenance & Community
The project is primarily maintained by Jeomon George and Muhammad Yaseen. Specific community channels (like Discord/Slack) or a public roadmap are not detailed in the README.
Licensing & Compatibility
Licensed under the MIT License, permitting broad use, modification, and distribution, including for commercial purposes and linking within closed-source applications.
Limitations & Caveats
The agent operates with defined max_steps and max_consecutive_failures, indicating potential timeouts or execution limits. Advanced features like use_system_profile may require specific user environment setup. API key management is essential for certain LLM integrations.
2 days ago
Inactive
Skyvern-AI