consistencydecoder  by openai

Improved decoding for stable diffusion VAEs

Created 1 year ago
2,195 stars

Top 20.6% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides an improved decoder for Stable Diffusion VAEs, aiming to enhance image generation quality by offering an alternative to the standard GAN-based decoder. It is targeted at researchers and developers working with diffusion models who seek higher fidelity reconstructions and more consistent outputs.

How It Works

The Consistency Decoder leverages consistency models, a recent advancement in generative modeling, to replace the traditional GAN decoder in the Stable Diffusion VAE. This approach aims to produce more faithful and less artifact-prone reconstructions of latent representations compared to the standard GAN decoder.

Quick Start & Requirements

  • Primary install: pip install git+https://github.com/openai/consistencydecoder.git
  • Prerequisites: CUDA-enabled GPU (device="cuda:0"), PyTorch, diffusers library.
  • Model size: 2.49 GB.
  • Official quick-start and examples are available in the README.

Highlighted Details

  • Offers an alternative to the standard GAN decoder for Stable Diffusion VAEs.
  • Aims for improved image reconstruction quality and consistency.
  • Demonstrates a 256x256 image reconstruction example.

Maintenance & Community

This project is from OpenAI. No specific community channels or roadmap are detailed in the README.

Licensing & Compatibility

The repository does not explicitly state a license. This may restrict commercial use or integration into closed-source projects.

Limitations & Caveats

The README does not specify a license, which could be a significant blocker for commercial adoption. It also focuses on a specific VAE (Stable Diffusion v1.5) and a fixed image size (256x256), with no information on broader compatibility or performance with other models or resolutions.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Christian Laforte Christian Laforte(Distinguished Engineer at NVIDIA; Former CTO at Stability AI), and
3 more.

taesd by madebyollin

0.3%
779
Tiny AutoEncoder for Stable Diffusion latents
Created 2 years ago
Updated 5 months ago
Starred by Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI) and Phil Wang Phil Wang(Prolific Research Paper Implementer).

Cosmos-Tokenizer by NVIDIA

0.1%
2k
Suite of neural tokenizers for image and video processing
Created 10 months ago
Updated 7 months ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), and
15 more.

taming-transformers by CompVis

0.1%
6k
Image synthesis research paper using transformers
Created 4 years ago
Updated 1 year ago
Feedback? Help us improve.