Discover and explore top open-source AI tools and projects—updated daily.
dagthomasComfyUI extension for advanced prompt/image processing
Top 95.1% on SourcePulse
This ComfyUI extension provides advanced prompt generation and image analysis capabilities, targeting users who want to enhance their AI image creation workflows. It offers nodes for GPT-4 powered text generation, image description via GPT-4 Vision, local LLM integration with Ollama, and sophisticated prompt structuring with dynamic category-based generation.
How It Works
The extension introduces several custom nodes. PromptGenerator and APNextNode allow for structured and randomized prompt creation, pulling elements from user-defined JSON files organized into categories. GPT4VisionNode leverages GPT-4 Vision to analyze images and generate detailed descriptions, with options for output detail and length. GPT4MiniNode and OllamaNode provide text generation capabilities using OpenAI's GPT-4 and local Ollama models, respectively, supporting custom base prompts and output formatting. A PGSD3LatentGenerator is also included for Stable Diffusion 3 latent creation.
Quick Start & Requirements
GPT4VisionNode and GPT4MiniNode. Set as an environment variable: OPENAI_API_KEY=sk-your-api-key-here.OllamaNode.comfyui_dagthomas/data/next/[CATEGORY_NAME]/ for APNextNode customization.apntest.json.Highlighted Details
APNextNode allows users to define custom categories and fields via JSON files, enabling highly flexible and repeatable prompt construction.OllamaNode allows integration with local language models, offering an alternative to cloud-based APIs.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 week ago
1 day
lucidrains