MagicQuill by ant-research

Interactive image editing system for precise manipulation (CVPR 2025 paper)

Created 1 year ago

3,670 stars

Top 13.2% on SourcePulse

Project Summary

MagicQuill is an intelligent, interactive image editing system designed for precise local edits, offering AI-powered suggestions and a user-friendly interface. It targets researchers and users seeking advanced image manipulation capabilities, enabling detailed control over edits through intuitive brush tools.

How It Works

MagicQuill leverages a combination of diffusion models and interactive brush strokes for image editing. Users can "add" elements, "subtract" unwanted parts, or precisely "color" regions. The system incorporates a "Draw and Guess" feature that predicts user intent from brush strokes, automatically filling prompts. This approach allows for fine-grained control over image generation and modification, moving beyond simple global adjustments.

Quick Start & Requirements

Install: Clone the repository with git clone --recursive https://github.com/magic-quill/MagicQuill.git, then follow setup scripts (windows_setup.bat or linux_setup.sh) or manual installation steps.
Prerequisites: Python 3.10, PyTorch 2.1.2 with CUDA 11.8 support, and approximately 25 GB for checkpoints. A GPU with at least 8GB VRAM is required.
Resources: Checkpoints download can be time-consuming.
Links: Demo Page, ComfyUI Node, Modelscope.

Highlighted Details

Accepted to CVPR 2025.
Supports multiple editing modes: add, subtract, and color brushes.
Features "Draw and Guess" for prompt auto-completion.
Offers fine-tuneable parameters for brush size, edge control, and generation strength.
Available as a Docker container for isolated environments.

Maintenance & Community

Active development with recent updates (Nov-Dec 2024) including UI enhancements and ComfyUI node release.
Mentions contributions from users like lior007, JamesIV4, and Furkan Gözükara.
No explicit community links (Discord/Slack) are provided in the README.

Licensing & Compatibility

Licensed under CC BY-NC 4.0.
Restrictions: Non-commercial use only. Prohibits generation of harmful content.

Limitations & Caveats

The CC BY-NC 4.0 license restricts commercial use. The system requires significant VRAM and checkpoint downloads, and the "Draw and Guess" feature may sometimes misinterpret user intent.

MagicQuill by ant-research

Explore Similar Projects

JarvisEvo by LYL1015

OneReward by bytedance

InstructDiffusion by cientgu

Comfyui-QwenEditUtils by lrzjason

BrushEdit by TencentARC

HiDream-E1 by HiDream-ai

ai-image-edit by chunxiuxiamo

pico-banana-400k by apple

CrossAttentionControl by bloc97

aice_ps by aigem

ml-mgie by apple

prompt-to-prompt by google