TeaCache  by ali-vilab

Training-free caching approach for video diffusion model inference

created 8 months ago
1,016 stars

Top 37.5% on sourcepulse

GitHubView on GitHub
Project Summary

TeaCache is a training-free caching method designed to accelerate inference for diffusion models, particularly video diffusion models, by leveraging timestep embedding differences. It targets researchers and developers working with generative AI who need to optimize inference speed without retraining.

How It Works

TeaCache estimates and caches intermediate outputs based on the dynamic changes in timestep embeddings during the diffusion process. This approach avoids redundant computations by intelligently reusing previously computed states, leading to significant speedups. Its advantage lies in its training-free nature and broad applicability across various diffusion model architectures.

Quick Start & Requirements

  • Installation: Typically integrated into existing diffusion model frameworks (e.g., Diffusers, ComfyUI). Specific integration instructions vary by model and framework.
  • Prerequisites: Python, PyTorch, and dependencies of the target diffusion model. GPU acceleration is highly recommended for practical inference speeds.
  • Resources: Requires memory for caching, proportional to the model size and batch size.
  • Links:

Highlighted Details

  • CVPR 2025 Highlight (top 3.7% of papers).
  • Supports a wide range of models including Wan2.1, Cosmos, CogVideoX1.5, FLUX, TangoFlux, and more for Text-to-Video, Image-to-Video, Text-to-Image, and Text-to-Audio tasks.
  • Numerous community integrations for popular frameworks like ComfyUI and Diffusers.
  • Achieves significant inference speedups across various supported models.

Maintenance & Community

The project is actively maintained with recent updates and contributions from institutions like Alibaba Group and universities. It has a growing community with many integrations and support for new models.

Licensing & Compatibility

The majority of the project is released under the Apache 2.0 license. Users must also adhere to the licenses of the underlying diffusion models it integrates with. Apache 2.0 is generally permissive for commercial use.

Limitations & Caveats

The effectiveness and specific implementation details of TeaCache may vary depending on the target diffusion model architecture. While training-free, it requires careful integration into existing inference pipelines.

Health Check
Last commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
10
Star History
280 stars in the last 90 days

Explore Similar Projects

Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Travis Fischer Travis Fischer(Founder of Agentic), and
3 more.

consistency_models by openai

0.0%
6k
PyTorch code for consistency models research paper
created 2 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
21 more.

diffusers by huggingface

0.3%
30k
PyTorch/Flax library for diffusion model research and applications
created 3 years ago
updated 1 day ago
Feedback? Help us improve.