MCP server for converting files/web content to Markdown
Top 22.8% on sourcepulse
This project provides a Model Context Protocol (MCP) server for converting a wide array of content types, including documents (PDF, DOCX, XLSX, PPTX), images, audio, and web content (YouTube transcripts, Bing searches, general web pages), into Markdown format. It's designed for developers and applications needing to ingest and process diverse data sources into a unified, readable text representation.
How It Works
The server leverages a modular architecture, with distinct tools handling the conversion for each content type. It utilizes pnpm
for dependency management and building, and Node.js for server execution. The core advantage lies in its comprehensive support for various formats and its MCP-compliant interface, enabling seamless integration with other MCP-aware applications.
Quick Start & Requirements
pnpm install
pnpm run build
pnpm start
pnpm
. Audio transcription relies on an installed uv
(likely Universal Voice) instance, with its path configurable via the UV_PATH
environment variable.Highlighted Details
Maintenance & Community
Contributions are welcome via Pull Requests. No specific community channels or maintainer information are detailed in the README.
Licensing & Compatibility
Licensed under the MIT License. This permissive license allows for commercial use and integration into closed-source projects.
Limitations & Caveats
The README implies that audio transcription functionality is dependent on a correctly installed and configured uv
instance, which is not provided by this repository. The setup for uv
itself is not detailed.
2 months ago
1 day