Python SDK for Stable Diffusion inference
Top 72.3% on sourcepulse
This Python library provides a lightweight SDK for interacting with the Automatic 1111 Stable Diffusion Web UI's core functionalities. It targets developers and researchers who need to programmatically generate, upscale, and edit images using state-of-the-art diffusion models, offering a modular and efficient alternative to direct Web UI interaction or other libraries.
How It Works
The SDK encapsulates the Automatic 1111 Web UI's pipelines, including Text-to-Image, Image-to-Image, Inpainting, and Outpainting. It utilizes a single pipeline object to support multiple operations, aiming to reduce RAM consumption compared to solutions requiring separate pipeline instantiations. The library also integrates direct model downloading from Civit AI and supports various upscaling models like Esrgan and Real Esrgan.
Quick Start & Requirements
pip3 install auto1111sdk
pip3 install git+https://github.com/saketh12/Auto1111SDK.git
Highlighted Details
Maintenance & Community
The project welcomes community contributions, including bug reports and feature requests. Contributions can be made via GitHub issues and pull requests.
Licensing & Compatibility
The library's licensing is not explicitly stated in the README, but it heavily relies on and integrates with the Automatic 1111 Stable Diffusion Web UI, which is typically under a permissive license. Compatibility for commercial use or closed-source linking would depend on the underlying licenses of its dependencies.
Limitations & Caveats
ControlNet currently only supports fp32 precision, with fp16 support planned. The roadmap indicates planned additions for Hires Fix, Refiner, LoRAs, Face restoration, and Dreambooth training scripts, suggesting these features are not yet fully implemented.
1 year ago
Inactive