MistoControlNet-Flux-dev by TheMistoAI

ControlNet for lineart/outline sketches, compatible with Flux1.dev

Created 1 year ago

356 stars

Top 78.8% on SourcePulse

Project Summary

This repository provides ControlNet models specifically for the Flux1.dev diffusion model, targeting users who need to generate images from line art or outline sketches. It offers enhanced alignment and expressiveness for various lineart conditions using a dual-stream Transformer architecture, compatible with Flux1.dev's quantized models.

How It Works

The ControlNet utilizes a scalable Transformer module as its backbone, featuring a dual-stream structure. This design improves alignment and expressiveness for lineart and outline inputs without increasing inference time. It's trained for alignment with both T5 and clip-l TextEncoders, ensuring a balance between image conditioning and text prompts.

Quick Start & Requirements

Download the model from Huggingface: MistoLine_Flux.dev_v1.
Place the model in ComfyUI\models\TheMisto_model\.
Run using ComfyUI; an example workflow is provided in the workflow folder.
Conditioning image dimensions must be divisible by 16.
Requires TheMisto.ai Flux ControlNet ComfyUI suite.
Compatible with Flux1.dev's fp16/fp8 and other quantized models.
Recommended settings: 720px+ resolution, controlnet strength 0.6-0.85, guidance 3.0-5.0, 30+ steps.

Highlighted Details

ControlNet model parameters: ~1.4B.
Trained for alignment with T5 and clip-l TextEncoders.
Compatible with Flux1.dev's fp16/fp8 and quantized models (e.g., flux1-dev-Q4_K_S.gguf).
Performance is positively correlated with prompt quality; experiment with controlnet_strength.

Maintenance & Community

Developed by TheMisto.ai Team.
Community links: Discord (https://discord.gg/fTyDB2CU), X (https://x.com/AiThemisto79359).
Future product: Misto, a multi-modal AI creative tool.

Licensing & Compatibility

License: FLUX.1 [dev] Non-Commercial License.
Usage: Research and educational purposes only; commercial use is prohibited.

Limitations & Caveats

Not compatible with XLabs loaders and samplers.
Training requires consumer-grade GPUs (e.g., A100-80GB with bf16) and is computationally expensive; consumer GPUs are unsuitable for training.
ByteDance 8/16-step distilled models have not been tested.

Health Check

Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

comfyanonymous

comfyanonymous(Author of ComfyUI; Cofounder of Comfy Org).

SD-Latent-Interposer by city96

Neural network for Stable Diffusion latent space interoperability

Created 2 years ago

Updated 1 year ago

ComfyUI-OmniGen by 1038lab

ComfyUI node for text-to-image generation and image editing

Created 1 year ago

Updated 10 months ago

ComfyUI-ELLA by TencentQQGYLab

ComfyUI nodes for ELLA, enhancing diffusion models with LLMs

Created 1 year ago

Updated 1 year ago

omini-kontext by Saquib764

Image editing framework with multi-image references

Created 7 months ago

Updated 5 months ago

T2ITrainer by lrzjason

Text-to-image training scripts

Created 1 year ago

Updated 1 week ago

ComfyUI-DyPE by wildminder

DyPE for FLUX: Artifact-free 4K+ image generation via ComfyUI

Created 4 months ago

Updated 2 months ago

awesome-tensorlayer by tensorlayer

Deep learning library for research and industry

Created 7 years ago

Updated 6 years ago

ComfyUI-PuLID-Flux by balazik

ComfyUI implementation for PuLID-Flux

Created 1 year ago

Updated 1 year ago

Lumina-Image-2.0 by Alpha-VLLM

Image generation research paper using a unified framework

Created 1 year ago

Updated 3 months ago

BLIP3o by JiuhaiChen

Unified multimodal model combining reasoning with generative diffusion

Created 10 months ago

Updated 2 months ago

mindone by mindspore-lab

Generative AI model & algorithm collection

Created 2 years ago

Updated 1 month ago

ACE_plus by ali-vilab

Image creation/editing via instruction-based content filling (research paper)

Created 1 year ago

Updated 10 months ago

Feedback? Help us improve.