markdownify-mcp  by zcaceres

MCP server for converting files/web content to Markdown

Created 9 months ago
2,125 stars

Top 21.2% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a Model Context Protocol (MCP) server for converting a wide array of content types, including documents (PDF, DOCX, XLSX, PPTX), images, audio, and web content (YouTube transcripts, Bing searches, general web pages), into Markdown format. It's designed for developers and applications needing to ingest and process diverse data sources into a unified, readable text representation.

How It Works

The server leverages a modular architecture, with distinct tools handling the conversion for each content type. It utilizes pnpm for dependency management and building, and Node.js for server execution. The core advantage lies in its comprehensive support for various formats and its MCP-compliant interface, enabling seamless integration with other MCP-aware applications.

Quick Start & Requirements

  • Install dependencies: pnpm install
  • Build the project: pnpm run build
  • Start the server: pnpm start
  • Requires Node.js and pnpm. Audio transcription relies on an installed uv (likely Universal Voice) instance, with its path configurable via the UV_PATH environment variable.

Highlighted Details

  • Supports conversion of PDF, DOCX, XLSX, PPTX, images, audio (with transcription), YouTube transcripts, Bing search results, and general web pages.
  • Provides a mechanism to retrieve existing Markdown files.
  • Designed as an MCP server for integration with other applications.

Maintenance & Community

Contributions are welcome via Pull Requests. No specific community channels or maintainer information are detailed in the README.

Licensing & Compatibility

Licensed under the MIT License. This permissive license allows for commercial use and integration into closed-source projects.

Limitations & Caveats

The README implies that audio transcription functionality is dependent on a correctly installed and configured uv instance, which is not provided by this repository. The setup for uv itself is not detailed.

Health Check
Last Commit

2 weeks ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
5
Star History
84 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Elvis Saravia Elvis Saravia(Founder of DAIR.AI), and
20 more.

markitdown by microsoft

6.7%
77k
Python tool for converting files to Markdown for LLM text analysis
Created 10 months ago
Updated 1 week ago
Feedback? Help us improve.