ControlNet by lllyasviel

Neural network structure for adding conditional control to diffusion models

Created 2 years ago

33,536 stars

Top 1.0% on SourcePulse

31 Experts Love This Project

vincentweisser

Vincent Weisser

Cofounder of Prime Intellect

codekansas

Cofounder of K-Scale Labs

louisgv

Cofounder of OpenRouter

ogabrielluiz

Gabriel Almeida

Cofounder of Langflow

and 27 more!

Project Summary

ControlNet provides a method to add conditional control to diffusion models, enabling fine-grained manipulation of image generation based on various inputs like edge maps, depth maps, or human poses. It's designed for researchers and artists looking to precisely guide text-to-image synthesis without compromising pre-trained diffusion models.

How It Works

ControlNet achieves control by duplicating diffusion model weights into a "locked" (original) and a "trainable" copy. The trainable copy learns the conditioning input via a "zero convolution" layer, which initially outputs zeros, preventing distortion. This architecture allows training on small datasets while preserving the integrity of the original, powerful diffusion model backbone.

Quick Start & Requirements

Install via conda env create -f environment.yaml and conda activate control.
Requires downloading pretrained models and detectors from Hugging Face.
Official implementation supports Stable Diffusion 1.5.
Demos available via python gradio_*.py scripts (e.g., gradio_canny2image.py).
See Hugging Face page for models and detectors.

Highlighted Details

ControlNet 1.1 released with new models.
Supports various conditioning inputs: Canny edges, M-LSD lines, HED boundaries, scribbles, human pose, semantic segmentation, depth maps, and normal maps.
"Guess Mode" allows generation without text prompts by inferring content from control maps.
ControlNets are composable for multi-condition control.
Feature for transferring ControlNet to any community SD1.X model.

Maintenance & Community

Active development with recent releases (ControlNet 1.1).
Mentions community contributions and related projects like Mikubill's A1111 Webui Plugin and ControlNet-for-Diffusers.
ArXiv link provided for the paper.

Licensing & Compatibility

License not explicitly stated in the README.
Compatible with Stable Diffusion 1.X models.

Limitations & Caveats

Some Gradio interfaces are noted as difficult to customize or buggy.
Anime Line Drawing model is not yet released due to risk evaluation.
Transferring ControlNet to community models is experimental.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

158 stars in the last 30 days

Explore Similar Projects

FreeDoM by yujiwen

ICCV 2023 paper implementing training-free conditional diffusion

Created 2 years ago

Updated 2 years ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

TCD by jabir-zheng

Distillation method for fast, high-quality image generation

Created 1 year ago

Updated 1 year ago

Starred by

comfyanonymous

comfyanonymous(Author of ComfyUI; Cofounder of Comfy Org).

SD-Latent-Interposer by city96

Neural network for Stable Diffusion latent space interoperability

Created 2 years ago

Updated 1 year ago

svdiff-pytorch by mkshing

PyTorch implementation for diffusion fine-tuning via compact parameter space

Created 2 years ago

Updated 1 year ago

ControlLoRA by HighCWu

Lightweight network to control Stable Diffusion spatial information

Created 2 years ago

Updated 1 year ago

Universal-Guided-Diffusion by arpitbansal297

PyTorch code for universal diffusion guidance

Created 2 years ago

Updated 2 years ago

Starred by

Forrest Iandola

Forrest Iandola(Author of SqueezeNet; Research Scientist at Meta).

CCSR by csslc

Research paper for content-consistent super-resolution via diffusion models

Created 2 years ago

Updated 5 months ago

InstaFlow by gnobitab

One-step image generator using Rectified Flow (ICLR 2024)

Created 2 years ago

Updated 1 year ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Chenlin Meng

Chenlin Meng(Cofounder of Pika), and

4 more.

v-diffusion-pytorch by crowsonkb

PyTorch code for v-objective diffusion model inference

Created 4 years ago

Updated 3 years ago

Starred by

Paras Jain

Paras Jain(Cofounder of Genmo),

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind), and

4 more.

stable-diffusion by pesser

Latent diffusion model research paper

Created 3 years ago

Updated 2 years ago

Starred by

Didier Lopes

Didier Lopes(Founder of OpenBB) and

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

ControlNet-v1-1-nightly by lllyasviel

ControlNet 1.1 is a research project for neural network control of diffusion models

Created 2 years ago

Updated 1 year ago

Starred by

Tim J. Baek

Tim J. Baek(Founder of Open WebUI),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

7 more.

Dreambooth-Stable-Diffusion by XavierXiao

Dreambooth implementation for Stable Diffusion

Created 3 years ago

Updated 3 years ago

Feedback? Help us improve.