Awesome-Video-Diffusion-Models  by ChenHsing

Survey on video diffusion models

created 2 years ago
2,165 stars

Top 21.3% on sourcepulse

GitHubView on GitHub
Project Summary

This repository serves as a comprehensive survey of video diffusion models, cataloging research across generation, editing, completion, enhancement, prediction, and understanding tasks. It targets researchers and practitioners in AI, computer vision, and multimedia, providing a structured overview of the rapidly evolving field of AI-powered video synthesis and manipulation.

How It Works

The survey categorizes video diffusion models based on their core methodologies (e.g., U-Net, Transformer-based) and conditioning mechanisms (e.g., text, pose, motion, sound, image). It systematically lists and describes numerous research papers, highlighting their contributions to advancing video generation quality, controllability, and efficiency. The organization facilitates understanding of the landscape, from foundational techniques to specialized applications.

Quick Start & Requirements

This repository is a curated list of research papers and does not contain executable code. Accessing the underlying models requires individual investigation of linked GitHub repositories and adherence to their specific setup instructions and dependencies.

Highlighted Details

  • Extensive categorization of video diffusion models by task and method.
  • Includes comprehensive lists of datasets, metrics, and benchmarks for evaluation.
  • Covers a wide range of conditioning modalities for video generation and editing.
  • Features recent advancements and foundational works in the field.

Maintenance & Community

The survey is updated periodically, with the latest version available on arXiv. It is accepted by ACM Computing Surveys (CSUR). Contact information for suggestions and feedback is provided.

Licensing & Compatibility

The repository itself is a survey and does not impose licensing restrictions. Individual linked projects will have their own licenses.

Limitations & Caveats

This is a survey and does not provide direct access to or implementation of the described models. Users must refer to the original research papers and their associated codebases for practical application.

Health Check
Last commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
92 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.