consistencydecoder by openai

Improved decoding for stable diffusion VAEs

Created 2 years ago

2,209 stars

Top 20.0% on SourcePulse

View on GitHub

7 Experts Love This Project

Shyamal Anadkat

Research Scientist at OpenAI

Georgios Konstantopoulos

CTO, General Partner at Paradigm

Simon Willison

Coauthor of Django

Vincent Weisser

Cofounder of Prime Intellect

and 3 more!

Project Summary

This project provides an improved decoder for Stable Diffusion VAEs, aiming to enhance image generation quality by offering an alternative to the standard GAN-based decoder. It is targeted at researchers and developers working with diffusion models who seek higher fidelity reconstructions and more consistent outputs.

How It Works

The Consistency Decoder leverages consistency models, a recent advancement in generative modeling, to replace the traditional GAN decoder in the Stable Diffusion VAE. This approach aims to produce more faithful and less artifact-prone reconstructions of latent representations compared to the standard GAN decoder.

Quick Start & Requirements

Primary install: pip install git+https://github.com/openai/consistencydecoder.git
Prerequisites: CUDA-enabled GPU (device="cuda:0"), PyTorch, diffusers library.
Model size: 2.49 GB.
Official quick-start and examples are available in the README.

Highlighted Details

Offers an alternative to the standard GAN decoder for Stable Diffusion VAEs.
Aims for improved image reconstruction quality and consistency.
Demonstrates a 256x256 image reconstruction example.

Maintenance & Community

This project is from OpenAI. No specific community channels or roadmap are detailed in the README.

Licensing & Compatibility

The repository does not explicitly state a license. This may restrict commercial use or integration into closed-source projects.

Limitations & Caveats

The README does not specify a license, which could be a significant blocker for commercial adoption. It also focuses on a specific VAE (Stable Diffusion v1.5) and a fixed image size (256x256), with no information on broader compatibility or performance with other models or resolutions.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

3 stars in the last 30 days