ControlLoRA  by HighCWu

Lightweight network to control Stable Diffusion spatial information

Created 2 years ago
608 stars

Top 54.0% on SourcePulse

GitHubView on GitHub
Project Summary

ControlLoRA offers a lightweight solution for controlling spatial information in Stable Diffusion models, targeting users who find existing methods like ControlNet too large or cumbersome. By integrating LoRA's parameter-efficient fine-tuning approach, it enables users to achieve fine-grained spatial control with significantly smaller model sizes (~5-7M parameters, ~20-25M storage), facilitating easier sharing and experimentation.

How It Works

ControlLoRA combines the architectural principles of ControlNet with the parameter-efficient fine-tuning of LoRA. It decomposes prompt features and spatial information into a smaller network, allowing for flexible inference across different Stable Diffusion checkpoints. This approach reduces the overhead associated with larger control mechanisms, making it more accessible for users with limited resources or those who frequently switch between models.

Quick Start & Requirements

  • Install via pip.
  • Requires Python.
  • Pretrained models are available on Hugging Face.
  • Gradio apps are provided for trying pretrained models.
  • Refer to the tasks directory for training scripts, with strong recommendations to use the diffusers library for training code.
  • Official quick-start and demo links are available within the repository's apps directory.

Highlighted Details

  • Significantly smaller parameter count (~5-7M) and storage footprint (~20-25M) compared to ControlNet.
  • Enables cross-compatibility for inference (e.g., train on SD v1.5, infer on Anything v3.0).
  • Supports custom model architectures via configuration files in the configs directory.
  • Work-in-progress includes mixing LoRA and ControlLoRA, and expanding task types.

Maintenance & Community

  • QQ Group: 艾梦的小群
  • QQ Channel: 艾梦的AI造梦堂
  • Discord: AI Players - AI Dream Bakery

Licensing & Compatibility

  • The README does not explicitly state a license. Users should verify licensing for commercial use or closed-source linking.

Limitations & Caveats

The README notes that the provided OpenPose model has suboptimal performance due to training on only 100 MPII images, and suggests users train their own ControlLoRA models for better results. The Gradio UI for pose manipulation is also noted as difficult to customize.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
5 more.

ai-toolkit by ostris

0.9%
6k
Training toolkit for finetuning diffusion models
Created 2 years ago
Updated 14 hours ago
Feedback? Help us improve.