Script for Stable Diffusion WebUI enabling regional prompting
Top 25.3% on sourcepulse
This repository provides a custom script for AUTOMATIC1111's Stable Diffusion WebUI, enabling users to apply different prompts to distinct regions of an image. It targets users who need fine-grained control over image generation, allowing for complex compositions and stylistic variations within a single output.
How It Works
The script operates by dividing the image into specified regions, either horizontally, vertically, or in a 2D grid. Users define prompts for each region, separated by "BREAK" keywords. It supports two primary calculation modes: "Attention" mode, which is faster and suitable for most use cases, and "Latent" mode, which is slower but offers better separation for elements like LoRAs. New features include region specification via inpainting masks or prompt analysis.
Quick Start & Requirements
Highlighted Details
ADDCOL
and ADDROW
separators.Maintenance & Community
The project is actively maintained, with recent updates including support for reForge and bug fixes for Forge compatibility. Community discussions and support are likely available through the project's GitHub repository.
Licensing & Compatibility
The repository does not explicitly state a license in the provided README. Users should verify licensing for commercial use or closed-source integration.
Limitations & Caveats
Latent mode with LoRA/LoCon/LoHa may experience slower generation speeds or compatibility issues depending on VRAM and WebUI specifications. There are known issues with mask uploading/loading in Gradio and potential LoRA corruption in Latent mode, with suggested workarounds provided.
1 month ago
1 week