MCP server for fetching web page content using Playwright
Top 45.3% on sourcepulse
Fetcher MCP provides a server for fetching web page content using Playwright, designed for users who need to handle dynamic JavaScript-rendered content and extract main article text. It offers intelligent content extraction, flexible output formats (HTML/Markdown), and parallel processing capabilities, making it suitable for researchers and developers building content aggregation or analysis tools.
How It Works
Fetcher MCP leverages Playwright to control headless Chromium browsers, enabling it to execute JavaScript and interact with modern web applications. It features an integrated Readability algorithm for intelligent content extraction, stripping away boilerplate like ads and navigation. The server also optimizes bandwidth by blocking non-essential resources and provides robust error handling for reliable operation.
Quick Start & Requirements
npx playwright install chromium
npx -y fetcher-mcp
npx -y fetcher-mcp --debug
Highlighted Details
fetch_urls
tool.Maintenance & Community
No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.
Licensing & Compatibility
Licensed under the MIT License, permitting commercial use and integration with closed-source projects.
Limitations & Caveats
The README suggests using waitForNavigation: true
and increasing timeouts for anti-crawler mechanisms or slow-loading sites, indicating potential challenges with certain dynamic or protected websites.
1 month ago
1 week