ComfyUI-IF_AI_tools  by if-ai

ComfyUI nodes for local/API LLMs & LMMs

created 1 year ago
660 stars

Top 51.7% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides custom nodes for ComfyUI, enabling users to integrate local Large Language Models (LLMs) and Large Multimodal Models (LMMs) directly into their image generation workflows. It targets users seeking to leverage advanced AI capabilities like OCR, RAG, and object detection for prompt enhancement and content creation.

How It Works

The nodes facilitate interaction with various LLM backends, including Ollama, LlamaCPP, LMstudio, TextGen, and Transformers, as well as cloud APIs from providers like OpenAI, Google Gemini, and Anthropic. It supports multimodal inputs and features advanced RAG techniques like nanoGraphRAG and OCR-RAG, alongside object detection with Florence2. Users can also define custom "assistant" characters with specific system prompts and presets.

Quick Start & Requirements

  • Installation: Install via ComfyUI's custom node manager or clone the repository.
  • Prerequisites:
    • Poppler: Required for PDF operations. Install via scoop install poppler (Windows), sudo apt-get install poppler-utils (Debian/Ubuntu), or brew install poppler (macOS).
    • Ollama: Recommended for local LLM execution. Download from ollama.com and run models like ollama run llama3.2-vision.
    • API Keys: Set environment variables (e.g., XAI_API_KEY, GOOGLE_API_KEY) or use a .env file for cloud API access.
  • Setup: Requires ComfyUI and potentially downloading LLM models via Ollama.

Highlighted Details

  • Supports a wide range of local LLM runners and cloud APIs.
  • Integrates novel RAG methods (nanoGraphRAG, OCR-RAG) and object detection (Florence2).
  • Enables creation of custom AI assistants with system prompts and reasoning templates.
  • Offers specialized prompt generation models like llama3_if_ai_sdpromptmkr_q4km.

Maintenance & Community

The project is actively developed, with a stated intention to move prompt generation to a separate repository (ComfyUI-IF_AI_PromptImaGen). Support is available via GitHub starring, YouTube channel (Impact Frames), X (Impact Frames X), Ko-fi, and Patreon.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is marked with a TODO list indicating ongoing development, including bug fixes for the latest ComfyUI versions and improvements to the Graph Visualizer and IF_Assistants nodes. Frontend development for assistants and chat is also planned.

Health Check
Last commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
35 stars in the last 90 days

Explore Similar Projects

Starred by Jared Palmer Jared Palmer(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
1 more.

promptable by cfortuner

0%
2k
TS/JS library for building full-stack AI apps
created 2 years ago
updated 2 years ago
Feedback? Help us improve.