ControlLoRA by HighCWu

Lightweight network to control Stable Diffusion spatial information

Created 3 years ago

621 stars

Top 53.1% on SourcePulse

Project Summary

ControlLoRA offers a lightweight solution for controlling spatial information in Stable Diffusion models, targeting users who find existing methods like ControlNet too large or cumbersome. By integrating LoRA's parameter-efficient fine-tuning approach, it enables users to achieve fine-grained spatial control with significantly smaller model sizes (~5-7M parameters, ~20-25M storage), facilitating easier sharing and experimentation.

How It Works

ControlLoRA combines the architectural principles of ControlNet with the parameter-efficient fine-tuning of LoRA. It decomposes prompt features and spatial information into a smaller network, allowing for flexible inference across different Stable Diffusion checkpoints. This approach reduces the overhead associated with larger control mechanisms, making it more accessible for users with limited resources or those who frequently switch between models.

Quick Start & Requirements

Install via pip.
Requires Python.
Pretrained models are available on Hugging Face.
Gradio apps are provided for trying pretrained models.
Refer to the tasks directory for training scripts, with strong recommendations to use the diffusers library for training code.
Official quick-start and demo links are available within the repository's apps directory.

Highlighted Details

Significantly smaller parameter count (~5-7M) and storage footprint (~20-25M) compared to ControlNet.
Enables cross-compatibility for inference (e.g., train on SD v1.5, infer on Anything v3.0).
Supports custom model architectures via configuration files in the configs directory.
Work-in-progress includes mixing LoRA and ControlLoRA, and expanding task types.

Maintenance & Community

QQ Group: 艾梦的小群
QQ Channel: 艾梦的AI造梦堂
Discord: AI Players - AI Dream Bakery

Licensing & Compatibility

The README does not explicitly state a license. Users should verify licensing for commercial use or closed-source linking.

Limitations & Caveats

The README notes that the provided OpenPose model has suboptimal performance due to training on only 100 MPII images, and suggests users train their own ControlLoRA models for better results. The Gradio UI for pose manipulation is also noted as difficult to customize.

ControlLoRA by HighCWu

Explore Similar Projects

CCSR by csslc

DeepCache by horseee

ddpo-pytorch by kvablack

Radiata by ddPn08

Stable-Diffusion-Android by ShiftHackZ

fastsdcpu by rupeshs

CatVTON by Zheng-Chong

kohya-trainer by Linaqruf

ControlNet-v1-1-nightly by lllyasviel

lora by cloneofsimo

ai-toolkit by ostris

ControlNet by lllyasviel