MistoControlNet-Flux-dev  by TheMistoAI

ControlNet for lineart/outline sketches, compatible with Flux1.dev

created 11 months ago
350 stars

Top 80.8% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides ControlNet models specifically for the Flux1.dev diffusion model, targeting users who need to generate images from line art or outline sketches. It offers enhanced alignment and expressiveness for various lineart conditions using a dual-stream Transformer architecture, compatible with Flux1.dev's quantized models.

How It Works

The ControlNet utilizes a scalable Transformer module as its backbone, featuring a dual-stream structure. This design improves alignment and expressiveness for lineart and outline inputs without increasing inference time. It's trained for alignment with both T5 and clip-l TextEncoders, ensuring a balance between image conditioning and text prompts.

Quick Start & Requirements

  • Download the model from Huggingface: MistoLine_Flux.dev_v1.
  • Place the model in ComfyUI\models\TheMisto_model\.
  • Run using ComfyUI; an example workflow is provided in the workflow folder.
  • Conditioning image dimensions must be divisible by 16.
  • Requires TheMisto.ai Flux ControlNet ComfyUI suite.
  • Compatible with Flux1.dev's fp16/fp8 and other quantized models.
  • Recommended settings: 720px+ resolution, controlnet strength 0.6-0.85, guidance 3.0-5.0, 30+ steps.

Highlighted Details

  • ControlNet model parameters: ~1.4B.
  • Trained for alignment with T5 and clip-l TextEncoders.
  • Compatible with Flux1.dev's fp16/fp8 and quantized models (e.g., flux1-dev-Q4_K_S.gguf).
  • Performance is positively correlated with prompt quality; experiment with controlnet_strength.

Maintenance & Community

  • Developed by TheMisto.ai Team.
  • Community links: Discord (https://discord.gg/fTyDB2CU), X (https://x.com/AiThemisto79359).
  • Future product: Misto, a multi-modal AI creative tool.

Licensing & Compatibility

  • License: FLUX.1 [dev] Non-Commercial License.
  • Usage: Research and educational purposes only; commercial use is prohibited.

Limitations & Caveats

  • Not compatible with XLabs loaders and samplers.
  • Training requires consumer-grade GPUs (e.g., A100-80GB with bf16) and is computationally expensive; consumer GPUs are unsuitable for training.
  • ByteDance 8/16-step distilled models have not been tested.
Health Check
Last commit

11 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 90 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify) and Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers).

taesd by madebyollin

0.3%
759
Tiny AutoEncoder for Stable Diffusion latents
created 2 years ago
updated 3 months ago
Feedback? Help us improve.