Disco-Stable-Diffusion-Win-GUI  by zhaoyun0071

Windows GUI for Stable Diffusion

created 3 years ago
450 stars

Top 67.9% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a Windows GUI for Disco Diffusion and Stable Diffusion, targeting users who want an easy-to-use, no-setup-required interface for AI image generation. It simplifies complex workflows by integrating features like ControlNet, LoRA support, video-to-image generation, and various AI-powered tools for image editing and enhancement.

How It Works

The GUI is built with PySide2 and wraps the Disco Diffusion and Stable Diffusion models. It offers a unified interface for various AI generation and manipulation tasks, including text-to-image, image-to-image, video generation from images, upscaling, and even 3D video conversion. Key features like ControlNet, LoRA, and faster Whisper for transcription are integrated to enhance creative control and efficiency.

Quick Start & Requirements

  • Install: Download and extract the provided package (e.g., from Baidu Netdisk, Tianyi Netdisk, or Google Drive).
  • Requirements: Windows OS, NVIDIA GPU with at least 3GB VRAM (2GB minimum, 30/20/10 series recommended). AMD GPUs are not supported.
  • Setup: Download and extract the application and move the models folder into the application directory.
  • Docs: Bilibili tutorials are linked for setup and feature explanations.

Highlighted Details

  • Supports ControlNet 1.1 and Tencent T2I-Adapter for advanced image generation.
  • Integrates LoRA and Lycoris model loading, along with VAE support.
  • Features video-to-image generation with automatic frame-by-frame referencing.
  • Includes AI upscaling, image colorization, background removal (rembg, SAM), and image-to-3D conversion.
  • Offers audio/video transcription via faster-whisper and integrates ChatGLM for text generation.

Maintenance & Community

The project is actively updated, with the latest version (V5.1) released on May 20, 2023. The developer is responsive to issues, as indicated by the "Contact me to solve problems" note.

Licensing & Compatibility

The project is based on open-source repositories (alembics/disco-diffusion, CompVis/stable-diffusion) but the specific license for this GUI wrapper is not explicitly stated in the README. Compatibility is limited to Windows and NVIDIA GPUs.

Limitations & Caveats

AMD GPU support is explicitly excluded. Some advanced features like image style imitation require higher VRAM (10GB+). The project's reliance on specific download links might pose long-term availability issues.

Health Check
Last commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.