MCP server for browser automation via Playwright
Top 3.1% on sourcepulse
This project provides a Model Context Protocol (MCP) server that leverages Playwright for browser automation, enabling Large Language Models (LLMs) to interact with web pages. It offers LLM-friendly, deterministic tool application via structured accessibility snapshots, bypassing the need for vision models and screenshots.
How It Works
The server operates by capturing Playwright's accessibility tree snapshots, which are then parsed into a structured format suitable for LLMs. This approach avoids the computational overhead and potential ambiguities of image-based analysis. Users can opt into "Vision Mode" to use screenshots for visual-based interactions, which is beneficial for LLMs that process coordinate-based inputs.
Quick Start & Requirements
npx @playwright/mcp@latest
or via VS Code CLI: code --add-mcp '{"name":"playwright","command":"npx","args":["@playwright/mcp@latest"]}'
mcp/playwright
image, supporting headless chromium.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
18 hours ago
1 day