ComfyUI extension for image captioning and tagging workflows
Top 55.8% on sourcepulse
This repository provides ComfyUI nodes for advanced image captioning and prompt generation, targeting users of Stable Diffusion and similar generative AI models. It integrates multiple powerful models like Joy_caption, MiniCPMv2_6, and Florence-2, enabling efficient batch processing and enhanced creative control for AI art generation.
How It Works
The project offers ComfyUI nodes that leverage state-of-the-art models for image analysis and text generation. It supports Joy_caption for detailed image descriptions, MiniCPMv2_6 for prompt generation, and Florence-2 for versatile captioning and prompt engineering. This modular approach allows users to combine different models for tailored workflows, aiming for faster processing and higher quality outputs compared to single-model solutions.
Quick Start & Requirements
python -m pip install -r requirements.txt
or run install_req.bat
.transformers
library is up-to-date.models\Joy_caption_alpha
, clip/siglip-so400m-patch14-384
, LLM/Meta-Llama-3.1-8B-bnb-4bit
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project relies on external model downloads, some of which require manual intervention. Specific version requirements for dependencies like transformers
are noted, and compatibility with different ComfyUI versions is not detailed.
5 months ago
1 day