Disco-Stable-Diffusion-Win-GUI by zhaoyun0071

Windows GUI for Stable Diffusion

Created 4 years ago

450 stars

Top 66.1% on SourcePulse

Project Summary

This project provides a Windows GUI for Disco Diffusion and Stable Diffusion, targeting users who want an easy-to-use, no-setup-required interface for AI image generation. It simplifies complex workflows by integrating features like ControlNet, LoRA support, video-to-image generation, and various AI-powered tools for image editing and enhancement.

How It Works

The GUI is built with PySide2 and wraps the Disco Diffusion and Stable Diffusion models. It offers a unified interface for various AI generation and manipulation tasks, including text-to-image, image-to-image, video generation from images, upscaling, and even 3D video conversion. Key features like ControlNet, LoRA, and faster Whisper for transcription are integrated to enhance creative control and efficiency.

Quick Start & Requirements

Install: Download and extract the provided package (e.g., from Baidu Netdisk, Tianyi Netdisk, or Google Drive).
Requirements: Windows OS, NVIDIA GPU with at least 3GB VRAM (2GB minimum, 30/20/10 series recommended). AMD GPUs are not supported.
Setup: Download and extract the application and move the models folder into the application directory.
Docs: Bilibili tutorials are linked for setup and feature explanations.

Highlighted Details

Supports ControlNet 1.1 and Tencent T2I-Adapter for advanced image generation.
Integrates LoRA and Lycoris model loading, along with VAE support.
Features video-to-image generation with automatic frame-by-frame referencing.
Includes AI upscaling, image colorization, background removal (rembg, SAM), and image-to-3D conversion.
Offers audio/video transcription via faster-whisper and integrates ChatGLM for text generation.

Maintenance & Community

The project is actively updated, with the latest version (V5.1) released on May 20, 2023. The developer is responsive to issues, as indicated by the "Contact me to solve problems" note.

Licensing & Compatibility

The project is based on open-source repositories (alembics/disco-diffusion, CompVis/stable-diffusion) but the specific license for this GUI wrapper is not explicitly stated in the README. Compatibility is limited to Windows and NVIDIA GPUs.

Limitations & Caveats

AMD GPU support is explicitly excluded. Some advanced features like image style imitation require higher VRAM (10GB+). The project's reliance on specific download links might pose long-term availability issues.

Disco-Stable-Diffusion-Win-GUI by zhaoyun0071

Explore Similar Projects

ComfyUI-StableDiffusion3-API by ZHO-ZHO-ZHO

frame by 66HEX

BizyAir by siliconflow

kandinsky-5 by kandinskylab

Google-Colab_Notebooks by Isi-dev

EasyAnimate by aigc-apps

HunyuanDiT by Tencent-Hunyuan

stt by jianchang512

sdnext by vladmandic

LTX-Video by Lightricks

DiffSynth-Studio by modelscope

Open-Sora by hpcaitech