ControlLoRA  by HighCWu

Lightweight network to control Stable Diffusion spatial information

created 2 years ago
603 stars

Top 55.0% on sourcepulse

GitHubView on GitHub
Project Summary

ControlLoRA offers a lightweight solution for controlling spatial information in Stable Diffusion models, targeting users who find existing methods like ControlNet too large or cumbersome. By integrating LoRA's parameter-efficient fine-tuning approach, it enables users to achieve fine-grained spatial control with significantly smaller model sizes (~5-7M parameters, ~20-25M storage), facilitating easier sharing and experimentation.

How It Works

ControlLoRA combines the architectural principles of ControlNet with the parameter-efficient fine-tuning of LoRA. It decomposes prompt features and spatial information into a smaller network, allowing for flexible inference across different Stable Diffusion checkpoints. This approach reduces the overhead associated with larger control mechanisms, making it more accessible for users with limited resources or those who frequently switch between models.

Quick Start & Requirements

  • Install via pip.
  • Requires Python.
  • Pretrained models are available on Hugging Face.
  • Gradio apps are provided for trying pretrained models.
  • Refer to the tasks directory for training scripts, with strong recommendations to use the diffusers library for training code.
  • Official quick-start and demo links are available within the repository's apps directory.

Highlighted Details

  • Significantly smaller parameter count (~5-7M) and storage footprint (~20-25M) compared to ControlNet.
  • Enables cross-compatibility for inference (e.g., train on SD v1.5, infer on Anything v3.0).
  • Supports custom model architectures via configuration files in the configs directory.
  • Work-in-progress includes mixing LoRA and ControlLoRA, and expanding task types.

Maintenance & Community

  • QQ Group: 艾梦的小群
  • QQ Channel: 艾梦的AI造梦堂
  • Discord: AI Players - AI Dream Bakery

Licensing & Compatibility

  • The README does not explicitly state a license. Users should verify licensing for commercial use or closed-source linking.

Limitations & Caveats

The README notes that the provided OpenPose model has suboptimal performance due to training on only 100 MPII images, and suggests users train their own ControlLoRA models for better results. The Gradio UI for pose manipulation is also noted as difficult to customize.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
11 stars in the last 90 days

Explore Similar Projects

Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

HALOs by ContextualAI

0.3%
873
Library for aligning LLMs using human-aware loss functions
created 1 year ago
updated 2 weeks ago
Feedback? Help us improve.