dalle-playground  by saharmor

Text-to-image playground using Stable Diffusion

Created 4 years ago
2,758 stars

Top 17.3% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This project provides a web-based playground for generating images from text prompts, currently utilizing Stable Diffusion V2. It's designed for enthusiasts and developers interested in exploring text-to-image generation without complex setup.

How It Works

The project consists of a Python backend serving a React frontend. The backend handles image generation requests using Stable Diffusion V2, while the frontend provides a user interface for inputting prompts and viewing results. The architecture allows for flexible deployment, with options for cloud-hosted backends (like Google Colab) or local execution.

Quick Start & Requirements

  • Colab Backend: Run the provided Google Colab notebook, then access the frontend at https://saharmor.github.io/dalle-playground/?backendUrl=<your_colab_url>.
  • Local Backend: Requires Python 3.x, PyTorch with CUDA support. Install dependencies via pip install -r requirements.txt and run python3 app.py --port 8080. The frontend requires npm install and npm start.
  • Docker: Requires Docker and NVIDIA Container Toolkit. Run with docker-compose up.
  • WSL2: Requires specific setup for GPU support, including installing CUDA/CuDNN in Linux and compiling JAX from source with CUDA enabled.

Highlighted Details

  • Supports Stable Diffusion V2 for image generation.
  • Offers a GitHub-hosted frontend for quick testing with a remote backend.
  • Provides detailed instructions for local development and Docker deployment.
  • Includes specific guidance for WSL2 GPU setup challenges.

Maintenance & Community

The project was originally based on DALL-E Mini by @borisdayma. Further community or maintenance details are not specified in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the README.

Limitations & Caveats

Running the backend on Google Colab's free tier may lead to timeouts for generating more than a few images. The WSL2 setup for GPU acceleration requires significant manual compilation and configuration of JAX.

Health Check
Last Commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 30 days

Explore Similar Projects

Starred by Deepak Pathak Deepak Pathak(Cofounder of Skild AI; Professor at CMU), Travis Fischer Travis Fischer(Founder of Agentic), and
8 more.

sygil-webui by Sygil-Dev

0.0%
8k
Web UI for Stable Diffusion
Created 3 years ago
Updated 2 months ago
Feedback? Help us improve.