Discover and explore top open-source AI tools and projects—updated daily.
Omnimodal agent framework for ComfyUI and LLMs
Top 75.5% on SourcePulse
Pixelle-MCP is an open-source framework that bridges ComfyUI's visual node-based workflow system with Large Language Models (LLMs) via the MCP protocol. It enables users to transform ComfyUI workflows into callable "MCP Tools" with zero coding, allowing LLMs to dynamically execute complex multimodal AI generation tasks. The target audience includes AI researchers, developers, and power users seeking to integrate sophisticated generative AI capabilities into LLM-driven applications.
How It Works
The core innovation lies in the "Workflow-as-MCP Tool" solution. ComfyUI workflows are exported in an API format, with special syntax in node titles defining parameters and outputs. The MCP server then converts these exported workflows into MCP-compliant tools that LLMs can discover and invoke. This approach leverages the extensive ComfyUI ecosystem for multimodal generation (text, image, sound, video) and integrates with various LLMs through the LiteLLM framework, offering a flexible and extensible platform.
Quick Start & Requirements
config.yml.example
to config.yml
, and configure ComfyUI service address and LLM models.docker compose up -d
) or provided shell scripts (run.sh
, run.bat
).http://localhost:9003
, MCP Server at http://localhost:9002/sse
.Highlighted Details
Maintenance & Community
The project is actively developed and welcomes community contributions. Links to Discord and WeChat groups are provided for support and updates.
Licensing & Compatibility
Released under the MIT License, permitting commercial use and closed-source linking.
Limitations & Caveats
Optional parameters in ComfyUI workflows require default values to be set in the node. Fields already connected to other nodes are not parsed as parameters. Tool naming relies on the exported file name.
15 hours ago
Inactive