Tiny AutoEncoder for Stable Diffusion latents
Top 46.8% on sourcepulse
TAESD is a highly optimized, tiny autoencoder designed to decode Stable Diffusion latents into full-size images with minimal computational cost. It targets users of Stable Diffusion, particularly those needing real-time previewing or efficient standalone VAE functionality, offering a substantial speedup over standard VAEs at a modest quality trade-off.
How It Works
TAESD is a distilled version of Stable Diffusion's VAE, featuring a significantly smaller encoder and decoder. It employs convolutional layers with ReLU activations and upsampling layers. This architecture allows it to process latents efficiently, trading fine detail fidelity for speed and a reduced parameter count, making it suitable for resource-constrained environments or real-time applications.
Quick Start & Requirements
diffusers
library (safetensors format: taesd
, taesdxl
, taesd3
, taef1
).Highlighted Details
Maintenance & Community
The project is maintained by madebyollin. Integration into popular UIs like A1111 and ComfyUI suggests active community adoption.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
TAESD sacrifices fine detail quality for speed and size. While it offers a bounded receptive field, tiled decoding is still not recommended due to potential seam issues related to receptive field coverage.
3 months ago
1 day