Image inpainting model for versatile image editing tasks
Top 39.7% on sourcepulse
PowerPaint is a versatile image inpainting model designed for researchers and practitioners in computer vision and generative AI. It offers a unified solution for text-guided object inpainting, object removal, shape-guided object insertion, and outpainting, all within a single model, simplifying complex image editing workflows.
How It Works
PowerPaint leverages task-specific prompts to guide its inpainting process, enabling it to handle diverse editing tasks with a single architecture. It builds upon the BrushNet framework, preserving cross-attention layers for prompt integration, which allows for fine-grained control over the inpainting results, particularly in shape-guided generation.
Quick Start & Requirements
conda create --name ppt python=3.9
), activate it (conda activate ppt
), and install dependencies (pip install -r requirements/requirements.txt
or conda env create -f requirements/ppt.yaml
).python app.py --share
. For PowerPaint-V2, use python app.py --share --version ppt-v2 --checkpoint_dir checkpoints/ppt-v2
. Model weights can be downloaded from Hugging Face.Highlighted Details
Maintenance & Community
The project is associated with OpenMMLab and has active development with recent updates in May 2024. Contact information for key contributors is provided.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
Training PowerPaint-V1 requires a large batch size (e.g., 1024), while V2 is more memory-efficient. The README mentions potential logical errors in ControlNet loading were rectified, suggesting past stability issues.
10 months ago
1 day