BrushNet by TencentARC

Image inpainting model using decomposed dual-branch diffusion

Created 1 year ago

1,700 stars

Top 24.7% on SourcePulse

Project Summary

BrushNet is a plug-and-play diffusion model for image inpainting, designed to integrate seamlessly with existing pre-trained diffusion models like Stable Diffusion v1.5 and SDXL. It addresses the challenge of image inpainting by decomposing the learning process, allowing it to be applied to various inpainting scenarios with improved fidelity and control. The target audience includes researchers and developers working on image generation and editing tasks.

How It Works

BrushNet employs a dual-branch diffusion architecture that separates masked image features from noisy latent representations. This decomposition reduces the model's learning burden and enhances its ability to handle image inpainting tasks. By leveraging dense, per-pixel control throughout the pre-trained diffusion model, BrushNet achieves greater suitability for precise image manipulation.

Quick Start & Requirements

Install: Clone the repository and install dependencies using pip install -e . and pip install -r examples/brushnet/requirements.txt.
Prerequisites: PyTorch 1.12.1, Python 3.9. CUDA is implicitly required for GPU acceleration.
Data: Download BrushData, BrushBench, and EditBench datasets. Checkpoints for SD v1.5 and SDXL are available.
Demo: A Gradio demo is available via python examples/brushnet/app_brushnet.py.
Docs: Project page, ArXiv paper, and video are linked in the README.

Highlighted Details

Plug-and-play integration with Stable Diffusion v1.5 and SDXL.
Supports both segmentation mask-guided and random mask inpainting.
Achieved top prize in the CVPR2024 GenAI Media Generation Challenge.
Offers training and evaluation scripts for custom datasets and benchmarks.

Maintenance & Community

The project is from TencentARC, with contributions from researchers at The Chinese University of Hong Kong. Updates include the release of BrushEdit and stronger BrushNetX models. Community interaction points are not explicitly listed, but the project is associated with ECCV 2024.

Licensing & Compatibility

The repository is released under an unspecified license. The data download agreement includes terms and conditions. Compatibility for commercial use or closed-source linking is not detailed.

Limitations & Caveats

The provided SDXL checkpoint is an early version trained with a small batch size and may not perform optimally. Users are advised to train on custom data for specific industrial applications. The evaluation script requires disabling an NSFW detector for accurate results, and image generation may vary across different hardware setups.

BrushNet by TencentARC

Explore Similar Projects

LanPaint by scraed

inpaint-anything by Uminosachi

glid-3-xl-stable by Jack000

Kandinsky-3 by ai-forever

Paint3D by OpenTexture

DiffPIR by yuanzhi-zhu

glid-3-xl by Jack000

comfyui-inpaint-nodes by Acly

Kandinsky-2 by ai-forever

DALLE2-pytorch by lucidrains

latent-diffusion by CompVis

stablediffusion by Stability-AI