kohya-trainer by Linaqruf

Trainer for Stable Diffusion models, adapted for easier use

Created 3 years ago

1,913 stars

Top 22.4% on SourcePulse

View on GitHub

1 Expert Loves This Project

Lyumin Zhang

Author of ControlNet

Project Summary

This repository provides a collection of Google Colab notebooks for fine-tuning Stable Diffusion models, specifically targeting LoRA and Dreambooth training methods. It's designed for users who want to customize AI image generation models without deep technical setup, offering a streamlined workflow for creating custom datasets and training models.

How It Works

The project leverages the kohya-ss/sd-scripts library, adapting its functionalities into user-friendly Colab notebooks. It supports various training techniques, including LoRA (Low-Rank Adaptation) and Dreambooth, and integrates advanced features like aspect ratio bucketing, extended token lengths, and automatic captioning using BLIP and WD14Tagger. The architecture focuses on efficient memory usage and flexibility, allowing users to fine-tune models with less VRAM and customize training parameters extensively.

Quick Start & Requirements

Install/Run: Primarily used within Google Colab notebooks.
Prerequisites: Google Colab environment, GPU (T4 recommended), sufficient Google Drive storage for datasets and models.
Setup: Minimal setup within Colab, primarily involves running cells sequentially. Official documentation and examples are available within the README.

Highlighted Details

Comprehensive support for LoRA, LoCon, LoHa, and LyCORIS network types.
Advanced dataset preparation tools including image scraping, recursive captioning, and tag management.
Flexible training configurations with support for various optimizers (AdamW, Lion, DAdaptation) and learning rate schedulers.
Integration with Hugging Face Hub for model uploading and management.
Built-in support for Cagliostro Colab UI for a graphical interface.

Maintenance & Community

The project is actively maintained, with frequent updates reflecting changes in the underlying kohya-ss/sd-scripts. Community support and discussions are likely found via linked GitHub issues and potentially associated Discord/Slack channels (though not explicitly linked in the README).

Licensing & Compatibility

The repository's licensing is not explicitly stated in the provided README. However, it is based on kohya-ss/sd-scripts, which is typically under permissive licenses like MIT, allowing for commercial use and integration with closed-source projects.

Limitations & Caveats

The project is heavily reliant on the Google Colab environment, which may have usage limits or require paid tiers for extended or intensive training. Some advanced features or optimizers might require significant VRAM, potentially exceeding free-tier Colab capabilities. The README indicates a "burnout phase" at one point, suggesting potential for slower update cycles.

kohya-trainer by Linaqruf

Explore Similar Projects

naifu by Mikubill

ddpo-pytorch by kvablack

diffusion by mosaicml

BLIP3o by JiuhaiChen

finetrainers by huggingface

sd_dreambooth_extension by d8ahazard

Stable-Diffusion by FurkanGozukara

lora by cloneofsimo

Dreambooth-Stable-Diffusion by JoePenna

sd-scripts by kohya-ss

ai-toolkit by ostris

stable-diffusion-webui-colab by camenduru