consistencydecoder  by openai

Improved decoding for stable diffusion VAEs

created 1 year ago
2,191 stars

Top 21.0% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides an improved decoder for Stable Diffusion VAEs, aiming to enhance image generation quality by offering an alternative to the standard GAN-based decoder. It is targeted at researchers and developers working with diffusion models who seek higher fidelity reconstructions and more consistent outputs.

How It Works

The Consistency Decoder leverages consistency models, a recent advancement in generative modeling, to replace the traditional GAN decoder in the Stable Diffusion VAE. This approach aims to produce more faithful and less artifact-prone reconstructions of latent representations compared to the standard GAN decoder.

Quick Start & Requirements

  • Primary install: pip install git+https://github.com/openai/consistencydecoder.git
  • Prerequisites: CUDA-enabled GPU (device="cuda:0"), PyTorch, diffusers library.
  • Model size: 2.49 GB.
  • Official quick-start and examples are available in the README.

Highlighted Details

  • Offers an alternative to the standard GAN decoder for Stable Diffusion VAEs.
  • Aims for improved image reconstruction quality and consistency.
  • Demonstrates a 256x256 image reconstruction example.

Maintenance & Community

This project is from OpenAI. No specific community channels or roadmap are detailed in the README.

Licensing & Compatibility

The repository does not explicitly state a license. This may restrict commercial use or integration into closed-source projects.

Limitations & Caveats

The README does not specify a license, which could be a significant blocker for commercial adoption. It also focuses on a specific VAE (Stable Diffusion v1.5) and a fixed image size (256x256), with no information on broader compatibility or performance with other models or resolutions.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
16 stars in the last 90 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify) and Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers).

taesd by madebyollin

0.5%
758
Tiny AutoEncoder for Stable Diffusion latents
created 2 years ago
updated 3 months ago
Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Travis Fischer Travis Fischer(Founder of Agentic), and
3 more.

consistency_models by openai

0.0%
6k
PyTorch code for consistency models research paper
created 2 years ago
updated 1 year ago
Feedback? Help us improve.