taggui  by jhc13

Desktop app for image dataset tagging/captioning, targeting generative AI

Created 2 years ago
1,115 stars

Top 34.3% on SourcePulse

GitHubView on GitHub
Project Summary

TagGUI is a cross-platform desktop application designed for efficient management and captioning of image datasets, primarily targeting users creating data for generative AI models like Stable Diffusion. It offers a keyboard-friendly interface, tag autocompletion, integrated Stable Diffusion token counting, and automatic caption/tag generation using various models.

How It Works

TagGUI operates as a desktop application, allowing users to load directories of images. Tags are managed via associated .txt files, with changes automatically saved. A key feature is its auto-captioning capability, leveraging models like CogVLM, LLaVA, and WD Tagger. Users can select images, choose a captioning model, and provide prompts with template variables for dynamic content insertion. GPU acceleration is supported for NVIDIA hardware, with CPU fallback available.

Quick Start & Requirements

  • Installation: Download the latest release executable for your OS from the releases page. Extract and run.
  • Prerequisites:
    • Python 3.12 (recommended) or 3.11.
    • macOS users may need manual installation.
    • Linux users might require libxcb-cursor0 and potentially python3.12-dev or python3.11-dev for CogVLM2.
    • NVIDIA GPU with CUDA support is required for GPU-accelerated captioning.
  • Setup: Download and extract. Manual installation involves cloning the repo and installing from requirements.txt.
  • Links: Releases page: https://github.com/jhc13/taggui/releases

Highlighted Details

  • Supports automatic caption/tag generation with models including CogVLM, LLaVA, and WD Tagger.
  • Features advanced image list filtering with support for tag, caption, name, path, character count, and token count criteria, combinable with logical operators (AND, OR, NOT) and wildcards.
  • Keyboard-centric interface for rapid tagging, including tag autocompletion and batch operations.
  • Integrated Stable Diffusion token counter.

Maintenance & Community

  • The project appears to be maintained by a single developer, jhc13.
  • No explicit community links (Discord, Slack) or roadmap are provided in the README.

Licensing & Compatibility

  • The README does not explicitly state a license. The repository's license file should be consulted for details.

Limitations & Caveats

  • No official macOS release is provided due to lack of testing hardware.
  • Some advanced model features might require specific Python development headers on Linux.
  • The README does not mention specific performance benchmarks or detailed hardware requirements for auto-captioning beyond GPU necessity.
Health Check
Last Commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
1
Star History
31 stars in the last 30 days

Explore Similar Projects

Starred by Max Howell Max Howell(Author of Homebrew), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
1 more.

big-sleep by lucidrains

0%
3k
CLI tool for text-to-image generation
Created 4 years ago
Updated 3 years ago
Starred by Deepak Pathak Deepak Pathak(Cofounder of Skild AI; Professor at CMU), Travis Fischer Travis Fischer(Founder of Agentic), and
8 more.

sygil-webui by Sygil-Dev

0.0%
8k
Web UI for Stable Diffusion
Created 3 years ago
Updated 2 months ago
Feedback? Help us improve.