Awesome-Video-Diffusion-Models by ChenHsing

Survey on video diffusion models

Created 2 years ago

2,254 stars

Top 19.9% on SourcePulse

Project Summary

This repository serves as a comprehensive survey of video diffusion models, cataloging research across generation, editing, completion, enhancement, prediction, and understanding tasks. It targets researchers and practitioners in AI, computer vision, and multimedia, providing a structured overview of the rapidly evolving field of AI-powered video synthesis and manipulation.

How It Works

The survey categorizes video diffusion models based on their core methodologies (e.g., U-Net, Transformer-based) and conditioning mechanisms (e.g., text, pose, motion, sound, image). It systematically lists and describes numerous research papers, highlighting their contributions to advancing video generation quality, controllability, and efficiency. The organization facilitates understanding of the landscape, from foundational techniques to specialized applications.

Quick Start & Requirements

This repository is a curated list of research papers and does not contain executable code. Accessing the underlying models requires individual investigation of linked GitHub repositories and adherence to their specific setup instructions and dependencies.

Highlighted Details

Extensive categorization of video diffusion models by task and method.
Includes comprehensive lists of datasets, metrics, and benchmarks for evaluation.
Covers a wide range of conditioning modalities for video generation and editing.
Features recent advancements and foundational works in the field.

Maintenance & Community

The survey is updated periodically, with the latest version available on arXiv. It is accepted by ACM Computing Surveys (CSUR). Contact information for suggestions and feedback is provided.

Licensing & Compatibility

The repository itself is a survey and does not impose licensing restrictions. Individual linked projects will have their own licenses.

Limitations & Caveats

This is a survey and does not provide direct access to or implementation of the described models. Users must refer to the original research papers and their associated codebases for practical application.

Awesome-Video-Diffusion-Models by ChenHsing

Explore Similar Projects

Awesome-Text-to-Video-Generation by soraw-ai

TATS by songweige

awesome-video-generation by AlonzoLeeeooo

Gen-L-Video by G-U-N

kandinsky-5 by kandinskylab

Allegro by rhymes-ai

LTX-2 by Lightricks

VGen by ali-vilab

Awesome-Video-Diffusion by showlab

mochi by genmoai

SkyReels-V2 by SkyworkAI

generative-models by Stability-AI