AI agent for image inpainting and editing
Top 57.2% on sourcepulse
BrushEdit is a unified AI agent for image inpainting and editing, targeting researchers and practitioners in computer vision and generative AI. It offers both automated and interactive editing capabilities, leveraging a pipeline that combines multi-modal large language models (MLLMs) with a dual-branch diffusion inpainting model (BrushNetX) for precise and context-aware image manipulation.
How It Works
BrushEdit employs a four-step pipeline: editing category classification, primary editing object identification, mask and target caption generation, and finally, image inpainting. Steps one through three utilize pre-trained MLLMs and detection models (GroundingDINO, SAM) to interpret user instructions, identify targets, and generate masks and descriptive captions. The core image editing is performed by BrushNetX, an enhanced diffusion model designed for high-fidelity inpainting and background preservation, guided by the generated masks and captions.
Quick Start & Requirements
pip install -e .
and pip install -r app/requirements.txt
.sh app/run_app.sh
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
diffusers
and BrushNet
, which have their own licenses. Compatibility for commercial use or closed-source linking is not explicitly stated.Limitations & Caveats
7 months ago
1 day