HCP-Diffusion  by IrisRainbowNeko

Universal Stable Diffusion toolbox

created 2 years ago
896 stars

Top 41.3% on sourcepulse

GitHubView on GitHub
Project Summary

HCP-Diffusion is a comprehensive toolbox for diffusion models, targeting researchers and power users who need a flexible and extensible framework for training and experimentation. It simplifies complex workflows by allowing users to define and combine various training techniques, such as LoRA, DreamBooth, and ControlNet, within a single Python configuration file.

How It Works

The framework utilizes the RainbowNeko Engine, which processes Python-based configuration files. This approach allows for direct function and class calls within configurations, enabling inheritance and dynamic instantiation of components. This design promotes extensibility and user-friendliness, simplifying the management of diverse training methodologies and model architectures.

Quick Start & Requirements

  • Install via pip: pip install hcpdiff
  • Initialize configuration: hcpinit
  • Install from source: git clone https://github.com/7eu7d7/HCP-Diffusion.git && cd HCP-Diffusion && pip install -e .
  • Optional: Install xformers for memory reduction and acceleration.
  • Official documentation: 📘English document

Highlighted Details

  • Supports Stable Diffusion 1.5, SDXL, and PixArt, with FLUX and SD3 in development.
  • Offers extensive fine-tuning capabilities including layer-wise LoRA configuration, multi-token prompt-tuning, and custom optimizers/LR schedulers.
  • Integrates with Hugging Face Accelerate, Colossal-AI, and xFormers for training acceleration.
  • Implements DreamArtist++ for controllable one-shot text-to-image generation with a single image.
  • Supports various dataset features like Aspect Ratio Bucketing and multi-source datasets.

Maintenance & Community

Maintained by HCP-Lab at Sun Yat-sen University.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Users should verify licensing for commercial use or closed-source integration.

Limitations & Caveats

Automatic evaluation metrics like FID and CLIP Score are still in development. Support for webdataset is also in development.

Health Check
Last commit

11 hours ago

Responsiveness

1 week

Pull Requests (30d)
2
Issues (30d)
0
Star History
8 stars in the last 90 days

Explore Similar Projects

Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake) and Travis Fischer Travis Fischer(Founder of Agentic).

lingua by facebookresearch

0.1%
5k
LLM research codebase for training and inference
created 9 months ago
updated 2 weeks ago
Starred by Logan Kilpatrick Logan Kilpatrick(Product Lead on Google AI Studio), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
3 more.

catalyst by catalyst-team

0%
3k
PyTorch framework for accelerated deep learning R&D
created 7 years ago
updated 1 month ago
Feedback? Help us improve.