ComfyUI-IF_AI_tools  by if-ai

ComfyUI nodes for local/API LLMs & LMMs

Created 1 year ago
681 stars

Top 49.7% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides custom nodes for ComfyUI, enabling users to integrate local Large Language Models (LLMs) and Large Multimodal Models (LMMs) directly into their image generation workflows. It targets users seeking to leverage advanced AI capabilities like OCR, RAG, and object detection for prompt enhancement and content creation.

How It Works

The nodes facilitate interaction with various LLM backends, including Ollama, LlamaCPP, LMstudio, TextGen, and Transformers, as well as cloud APIs from providers like OpenAI, Google Gemini, and Anthropic. It supports multimodal inputs and features advanced RAG techniques like nanoGraphRAG and OCR-RAG, alongside object detection with Florence2. Users can also define custom "assistant" characters with specific system prompts and presets.

Quick Start & Requirements

  • Installation: Install via ComfyUI's custom node manager or clone the repository.
  • Prerequisites:
    • Poppler: Required for PDF operations. Install via scoop install poppler (Windows), sudo apt-get install poppler-utils (Debian/Ubuntu), or brew install poppler (macOS).
    • Ollama: Recommended for local LLM execution. Download from ollama.com and run models like ollama run llama3.2-vision.
    • API Keys: Set environment variables (e.g., XAI_API_KEY, GOOGLE_API_KEY) or use a .env file for cloud API access.
  • Setup: Requires ComfyUI and potentially downloading LLM models via Ollama.

Highlighted Details

  • Supports a wide range of local LLM runners and cloud APIs.
  • Integrates novel RAG methods (nanoGraphRAG, OCR-RAG) and object detection (Florence2).
  • Enables creation of custom AI assistants with system prompts and reasoning templates.
  • Offers specialized prompt generation models like llama3_if_ai_sdpromptmkr_q4km.

Maintenance & Community

The project is actively developed, with a stated intention to move prompt generation to a separate repository (ComfyUI-IF_AI_PromptImaGen). Support is available via GitHub starring, YouTube channel (Impact Frames), X (Impact Frames X), Ko-fi, and Patreon.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is marked with a TODO list indicating ongoing development, including bug fixes for the latest ComfyUI versions and improvements to the Graph Visualizer and IF_Assistants nodes. Frontend development for assistants and chat is also planned.

Health Check
Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
52
Star History
15 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.