taggui  by jhc13

Desktop app for image dataset tagging/captioning, targeting generative AI

created 2 years ago
1,071 stars

Top 35.9% on sourcepulse

GitHubView on GitHub
Project Summary

TagGUI is a cross-platform desktop application designed for efficient management and captioning of image datasets, primarily targeting users creating data for generative AI models like Stable Diffusion. It offers a keyboard-friendly interface, tag autocompletion, integrated Stable Diffusion token counting, and automatic caption/tag generation using various models.

How It Works

TagGUI operates as a desktop application, allowing users to load directories of images. Tags are managed via associated .txt files, with changes automatically saved. A key feature is its auto-captioning capability, leveraging models like CogVLM, LLaVA, and WD Tagger. Users can select images, choose a captioning model, and provide prompts with template variables for dynamic content insertion. GPU acceleration is supported for NVIDIA hardware, with CPU fallback available.

Quick Start & Requirements

  • Installation: Download the latest release executable for your OS from the releases page. Extract and run.
  • Prerequisites:
    • Python 3.12 (recommended) or 3.11.
    • macOS users may need manual installation.
    • Linux users might require libxcb-cursor0 and potentially python3.12-dev or python3.11-dev for CogVLM2.
    • NVIDIA GPU with CUDA support is required for GPU-accelerated captioning.
  • Setup: Download and extract. Manual installation involves cloning the repo and installing from requirements.txt.
  • Links: Releases page: https://github.com/jhc13/taggui/releases

Highlighted Details

  • Supports automatic caption/tag generation with models including CogVLM, LLaVA, and WD Tagger.
  • Features advanced image list filtering with support for tag, caption, name, path, character count, and token count criteria, combinable with logical operators (AND, OR, NOT) and wildcards.
  • Keyboard-centric interface for rapid tagging, including tag autocompletion and batch operations.
  • Integrated Stable Diffusion token counter.

Maintenance & Community

  • The project appears to be maintained by a single developer, jhc13.
  • No explicit community links (Discord, Slack) or roadmap are provided in the README.

Licensing & Compatibility

  • The README does not explicitly state a license. The repository's license file should be consulted for details.

Limitations & Caveats

  • No official macOS release is provided due to lack of testing hardware.
  • Some advanced model features might require specific Python development headers on Linux.
  • The README does not mention specific performance benchmarks or detailed hardware requirements for auto-captioning beyond GPU necessity.
Health Check
Last commit

2 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
2
Star History
95 stars in the last 90 days

Explore Similar Projects

Starred by Max Howell Max Howell(Author of Homebrew) and Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers).

kohya_ss by bmaltais

0.3%
11k
GUI for Stable Diffusion training scripts
created 2 years ago
updated 1 week ago
Feedback? Help us improve.