HCP-Diffusion by IrisRainbowNeko

Universal Stable Diffusion toolbox

Created 2 years ago

908 stars

Top 40.0% on SourcePulse

Project Summary

HCP-Diffusion is a comprehensive toolbox for diffusion models, targeting researchers and power users who need a flexible and extensible framework for training and experimentation. It simplifies complex workflows by allowing users to define and combine various training techniques, such as LoRA, DreamBooth, and ControlNet, within a single Python configuration file.

How It Works

The framework utilizes the RainbowNeko Engine, which processes Python-based configuration files. This approach allows for direct function and class calls within configurations, enabling inheritance and dynamic instantiation of components. This design promotes extensibility and user-friendliness, simplifying the management of diverse training methodologies and model architectures.

Quick Start & Requirements

Install via pip: pip install hcpdiff
Initialize configuration: hcpinit
Install from source: git clone https://github.com/7eu7d7/HCP-Diffusion.git && cd HCP-Diffusion && pip install -e .
Optional: Install xformers for memory reduction and acceleration.
Official documentation: 📘English document

Highlighted Details

Supports Stable Diffusion 1.5, SDXL, and PixArt, with FLUX and SD3 in development.
Offers extensive fine-tuning capabilities including layer-wise LoRA configuration, multi-token prompt-tuning, and custom optimizers/LR schedulers.
Integrates with Hugging Face Accelerate, Colossal-AI, and xFormers for training acceleration.
Implements DreamArtist++ for controllable one-shot text-to-image generation with a single image.
Supports various dataset features like Aspect Ratio Bucketing and multi-source datasets.

Maintenance & Community

Maintained by HCP-Lab at Sun Yat-sen University.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Users should verify licensing for commercial use or closed-source integration.

Limitations & Caveats

Automatic evaluation metrics like FID and CLIP Score are still in development. Support for webdataset is also in development.

Health Check

Last Commit

1 week ago

Responsiveness

1 week

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

piecewise-rectified-flow by magic-research

PeRFlow: Plug-and-play accelerator for diffusion models (NeurIPS 2024)

Created 1 year ago

Updated 4 months ago

InstaFlow by gnobitab

One-step image generator using Rectified Flow (ICLR 2024)

Created 2 years ago

Updated 1 year ago

Nemotron by NVIDIA-NeMo

Open models for advanced AI workflows

Created 3 months ago

Updated 4 days ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face) and

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

refiners by finegrain-ai

Microframework for foundation model adaptation using PyTorch

Created 2 years ago

Updated 3 months ago

Starred by

Abubakar Abid

Abubakar Abid(Cofounder of Gradio).

Radiata by ddPn08

WebUI for stable diffusion, built on diffusers

Created 2 years ago

Updated 2 years ago

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral),

Hanlin Tang

Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), and

1 more.

diffusion by mosaicml

Diffusion model training code

Created 2 years ago

Updated 1 year ago

In-Context-LoRA by ali-vilab

IC-LoRA: Diffusion Transformer framework for visual generation tasks

Created 1 year ago

Updated 1 year ago

cube-studio by data-infra

Unified cloud-native AI platform for end-to-end ML workflows

Created 1 year ago

Updated 2 months ago

OneTrainer by Nerogar

Stable Diffusion training suite

Created 2 years ago

Updated 4 days ago

Starred by

Kevin Hou

Kevin Hou(Head of Product Engineering at Windsurf) and

Chuan Li

Chuan Li(Chief Scientific Officer at Lambda).

sd_dreambooth_extension by d8ahazard

Stable Diffusion WebUI extension for Dreambooth training

Created 3 years ago

Updated 3 months ago

Starred by

Lyumin Zhang

Lyumin Zhang(Author of ControlNet).

kohya-trainer by Linaqruf

Trainer for Stable Diffusion models, adapted for easier use

Created 3 years ago

Updated 1 year ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI) and

Thierry Moreau

Thierry Moreau(Principal Engineer at NVIDIA; Cofounder of OctoAI).

sd-scripts by kohya-ss

Training/generation scripts for Stable Diffusion models

Created 3 years ago

Updated 3 weeks ago

Feedback? Help us improve.