ComfyUI extension for advanced prompt/image processing
Top 98.6% on sourcepulse
This ComfyUI extension provides advanced prompt generation and image analysis capabilities, targeting users who want to enhance their AI image creation workflows. It offers nodes for GPT-4 powered text generation, image description via GPT-4 Vision, local LLM integration with Ollama, and sophisticated prompt structuring with dynamic category-based generation.
How It Works
The extension introduces several custom nodes. PromptGenerator
and APNextNode
allow for structured and randomized prompt creation, pulling elements from user-defined JSON files organized into categories. GPT4VisionNode
leverages GPT-4 Vision to analyze images and generate detailed descriptions, with options for output detail and length. GPT4MiniNode
and OllamaNode
provide text generation capabilities using OpenAI's GPT-4 and local Ollama models, respectively, supporting custom base prompts and output formatting. A PGSD3LatentGenerator
is also included for Stable Diffusion 3 latent creation.
Quick Start & Requirements
GPT4VisionNode
and GPT4MiniNode
. Set as an environment variable: OPENAI_API_KEY=sk-your-api-key-here
.OllamaNode
.comfyui_dagthomas/data/next/[CATEGORY_NAME]/
for APNextNode
customization.apntest.json
.Highlighted Details
APNextNode
allows users to define custom categories and fields via JSON files, enabling highly flexible and repeatable prompt construction.OllamaNode
allows integration with local language models, offering an alternative to cloud-based APIs.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 months ago
Inactive