taggui by jhc13

Desktop app for image dataset tagging/captioning, targeting generative AI

Created 2 years ago

1,237 stars

Top 31.8% on SourcePulse

Project Summary

TagGUI is a cross-platform desktop application designed for efficient management and captioning of image datasets, primarily targeting users creating data for generative AI models like Stable Diffusion. It offers a keyboard-friendly interface, tag autocompletion, integrated Stable Diffusion token counting, and automatic caption/tag generation using various models.

How It Works

TagGUI operates as a desktop application, allowing users to load directories of images. Tags are managed via associated .txt files, with changes automatically saved. A key feature is its auto-captioning capability, leveraging models like CogVLM, LLaVA, and WD Tagger. Users can select images, choose a captioning model, and provide prompts with template variables for dynamic content insertion. GPU acceleration is supported for NVIDIA hardware, with CPU fallback available.

Quick Start & Requirements

Installation: Download the latest release executable for your OS from the releases page. Extract and run.
Prerequisites:
- Python 3.12 (recommended) or 3.11.
- macOS users may need manual installation.
- Linux users might require libxcb-cursor0 and potentially python3.12-dev or python3.11-dev for CogVLM2.
- NVIDIA GPU with CUDA support is required for GPU-accelerated captioning.
Setup: Download and extract. Manual installation involves cloning the repo and installing from requirements.txt.
Links: Releases page: https://github.com/jhc13/taggui/releases

Highlighted Details

Supports automatic caption/tag generation with models including CogVLM, LLaVA, and WD Tagger.
Features advanced image list filtering with support for tag, caption, name, path, character count, and token count criteria, combinable with logical operators (AND, OR, NOT) and wildcards.
Keyboard-centric interface for rapid tagging, including tag autocompletion and batch operations.
Integrated Stable Diffusion token counter.

Maintenance & Community

The project appears to be maintained by a single developer, jhc13.
No explicit community links (Discord, Slack) or roadmap are provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. The repository's license file should be consulted for details.

Limitations & Caveats

No official macOS release is provided due to lack of testing hardware.
Some advanced model features might require specific Python development headers on Linux.
The README does not mention specific performance benchmarks or detailed hardware requirements for auto-captioning beyond GPU necessity.

Health Check

Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

2

Star History

33 stars in the last 30 days

Explore Similar Projects

Comfyui_CXH_joy_caption by StartHua

ComfyUI extension for image captioning and tagging workflows

Created 1 year ago

Updated 11 months ago

Semi-Auto-NovelAI-to-Pixiv by zhulinyv

NovelAI tool with WebUI for batch image generation

Created 1 year ago

Updated 2 months ago

breadboard by cocktailpeanut

Desktop app for browsing and managing AI-generated images

Created 3 years ago

Updated 1 year ago

GPT4V-Image-Captioner by jiayev

Image processing toolbox for image tagging

Created 2 years ago

Updated 1 year ago

DiffusionToolkit by RupertAvery

Image metadata indexer and viewer for AI-generated content

Created 3 years ago

Updated 5 days ago

danbooru-diffusion-prompt-builder by wfjsw

Tool for browsing Danbooru/NovelAI tags and embeddings

Created 3 years ago

Updated 1 year ago

PaddleMIX by PaddlePaddle

Multimodal toolkit for diverse AI tasks

Created 2 years ago

Updated 1 month ago

stable-diffusion-webui-wd14-tagger by kawalain

WebUI extension for image tagging

Created 3 years ago

Updated 2 years ago

Starred by

Max Howell

Max Howell(Author of Homebrew),

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and

1 more.

big-sleep by lucidrains

CLI tool for text-to-image generation

Created 5 years ago

Updated 3 years ago

Starred by

Jesse Clark

Jesse Clark(Cofounder of Marqo) and

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

recognize-anything by xinyu1205

Image tagging models for common/open-set categories and comprehensive captioning

Created 2 years ago

Updated 10 months ago

a1111-sd-webui-tagcomplete by DominikDoom

Tag autocompletion for Stable Diffusion WebUI

Created 3 years ago

Updated 2 months ago

Starred by

Deepak Pathak

Deepak Pathak(Cofounder of Skild AI; Professor at CMU),

Travis Fischer

Travis Fischer(Founder of Agentic), and

8 more.

sygil-webui by Sygil-Dev

Web UI for Stable Diffusion

Created 3 years ago

Updated 1 month ago

Feedback? Help us improve.