Disco-Stable-Diffusion-Win-GUI  by zhaoyun0071

Windows GUI for Stable Diffusion

Created 3 years ago
450 stars

Top 66.9% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a Windows GUI for Disco Diffusion and Stable Diffusion, targeting users who want an easy-to-use, no-setup-required interface for AI image generation. It simplifies complex workflows by integrating features like ControlNet, LoRA support, video-to-image generation, and various AI-powered tools for image editing and enhancement.

How It Works

The GUI is built with PySide2 and wraps the Disco Diffusion and Stable Diffusion models. It offers a unified interface for various AI generation and manipulation tasks, including text-to-image, image-to-image, video generation from images, upscaling, and even 3D video conversion. Key features like ControlNet, LoRA, and faster Whisper for transcription are integrated to enhance creative control and efficiency.

Quick Start & Requirements

  • Install: Download and extract the provided package (e.g., from Baidu Netdisk, Tianyi Netdisk, or Google Drive).
  • Requirements: Windows OS, NVIDIA GPU with at least 3GB VRAM (2GB minimum, 30/20/10 series recommended). AMD GPUs are not supported.
  • Setup: Download and extract the application and move the models folder into the application directory.
  • Docs: Bilibili tutorials are linked for setup and feature explanations.

Highlighted Details

  • Supports ControlNet 1.1 and Tencent T2I-Adapter for advanced image generation.
  • Integrates LoRA and Lycoris model loading, along with VAE support.
  • Features video-to-image generation with automatic frame-by-frame referencing.
  • Includes AI upscaling, image colorization, background removal (rembg, SAM), and image-to-3D conversion.
  • Offers audio/video transcription via faster-whisper and integrates ChatGLM for text generation.

Maintenance & Community

The project is actively updated, with the latest version (V5.1) released on May 20, 2023. The developer is responsive to issues, as indicated by the "Contact me to solve problems" note.

Licensing & Compatibility

The project is based on open-source repositories (alembics/disco-diffusion, CompVis/stable-diffusion) but the specific license for this GUI wrapper is not explicitly stated in the README. Compatibility is limited to Windows and NVIDIA GPUs.

Limitations & Caveats

AMD GPU support is explicitly excluded. Some advanced features like image style imitation require higher VRAM (10GB+). The project's reliance on specific download links might pose long-term availability issues.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), Rodrigo Nader Rodrigo Nader(Cofounder of Langflow), and
1 more.

DiffSynth-Studio by modelscope

0.9%
10k
Open-source project for diffusion model exploration
Created 1 year ago
Updated 15 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luca Antiga Luca Antiga(CTO of Lightning AI), and
2 more.

mmagic by open-mmlab

0.1%
7k
AIGC toolbox for image/video editing and generation
Created 6 years ago
Updated 1 year ago
Feedback? Help us improve.