fetcher-mcp by jae-jae

MCP server for fetching web page content using Playwright

Created 11 months ago

991 stars

Top 37.3% on SourcePulse

Project Summary

Fetcher MCP provides a server for fetching web page content using Playwright, designed for users who need to handle dynamic JavaScript-rendered content and extract main article text. It offers intelligent content extraction, flexible output formats (HTML/Markdown), and parallel processing capabilities, making it suitable for researchers and developers building content aggregation or analysis tools.

How It Works

Fetcher MCP leverages Playwright to control headless Chromium browsers, enabling it to execute JavaScript and interact with modern web applications. It features an integrated Readability algorithm for intelligent content extraction, stripping away boilerplate like ads and navigation. The server also optimizes bandwidth by blocking non-essential resources and provides robust error handling for reliable operation.

Quick Start & Requirements

Install Playwright browsers: npx playwright install chromium
Run the server: npx -y fetcher-mcp
Debug mode: npx -y fetcher-mcp --debug
Configuration for Claude Desktop: See README for macOS/Windows paths.
Requires Node.js and npm/npx.

Highlighted Details

Supports JavaScript execution via Playwright.
Intelligent content extraction with Readability algorithm.
Parallel fetching of multiple URLs via fetch_urls tool.
Resource optimization by blocking images, stylesheets, fonts, and media.
Configurable parameters for timeouts, content extraction, and output format.

Maintenance & Community

No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The README suggests using waitForNavigation: true and increasing timeouts for anti-crawler mechanisms or slow-loading sites, indicating potential challenges with certain dynamic or protected websites.

fetcher-mcp by jae-jae

Explore Similar Projects

latent-browser by jbilcke

UglyFeed by fabriziosalmi

LLMFeeder by jatinkrmalik

wexin-read-mcp by Bwkyd

docmd by docmd-io

mdream by harlan-zw

parsera by raznem

markdown-site by waynesutton

fetch-mcp by zcaceres

sitefetch by egoist

tap4-ai-crawler by 6677-ai

llm-scraper by mishushakov