VBench by Vchitect

Benchmark suite for video generation models

Created 2 years ago

1,524 stars

Top 26.7% on SourcePulse

Project Summary

VBench provides a comprehensive benchmark suite for evaluating video generative models, targeting researchers and developers in the field of AI video generation. It offers a structured framework to assess various quality dimensions, enabling fine-grained and objective comparisons between different models.

How It Works

VBench decomposes "video generation quality" into 16 well-defined dimensions, each with a specific prompt suite and an automated evaluation method. It supports both Text-to-Video (T2V) and Image-to-Video (I2V) tasks, and can evaluate custom videos. The framework also incorporates human preference annotations to ensure alignment with human perception, and recent updates (VBench-2.0) extend evaluation to intrinsic faithfulness aspects like commonsense reasoning and physics.

Quick Start & Requirements

Installation: pip install vbench (requires PyTorch with CUDA <= 12.1). detectron2 is needed for some evaluations (pip install detectron2@git+https://github.com/facebookresearch/detectron2.git), which requires CUDA 11.X or 12.1.
Data: Download VBench_full_info.json for prompt suites.
Usage: vbench evaluate --videos_path <path> --dimension <dimension> or via Python API.
Links: Leaderboard, Model Info, Prompt Suites

Highlighted Details

Comprehensive evaluation across 16 dimensions, including technical quality and trustworthiness.
Supports both T2V and I2V models, with extensions for longer videos (VBench-Long) and intrinsic faithfulness (VBench-2.0).
VBench Arena allows users to view and vote on generated videos from over 40 supported models.
Released sampled videos and detailed model settings for transparency and reproducibility.

Maintenance & Community

Actively maintained with frequent updates, including VBench-2.0 and human anomaly detection pipelines.
Community engagement via GitHub issues and a Google Form for evaluation requests.
Related project: Awesome-Evaluation-of-Visual-Generation.

Licensing & Compatibility

The repository itself is likely under a permissive license (e.g., MIT, Apache 2.0, based on common practice for such projects), but specific license details are not explicitly stated in the README.
Compatibility for commercial use is generally expected for permissive licenses, but users should verify the specific license.

Limitations & Caveats

Detectron2 installation can be problematic and is restricted to specific CUDA versions (11.X or 12.1).
Some evaluation dimensions require specific dependencies or preprocessing steps (e.g., static video filtering for temporal flickering).

Health Check

Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)

2

Issues (30d)

5

Star History

52 stars in the last 30 days

Explore Similar Projects

Awesome-Text-to-Video-Generation by soraw-ai

AI video generation research and benchmarks

Created 2 years ago

Updated 2 days ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

MiraData by mira-space

Video dataset for long video generation research

Created 2 years ago

Updated 1 year ago

t2v-turbo by Ji4chenLi

Text-to-video generation research paper implementation

Created 1 year ago

Updated 1 year ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

VideoTuna by VideoVerses

Codebase for text-to-video applications

Created 1 year ago

Updated 5 months ago

t2v_metrics by linzhiqiu

Evaluation metric for text-to-image/video/3D models

Created 2 years ago

Updated 5 months ago

awesome-video-generation by AlonzoLeeeooo

Awesome list for video generation studies

Created 2 years ago

Updated 2 months ago

Allegro by rhymes-ai

Text-to-video model for generating short, high-quality videos

Created 1 year ago

Updated 1 year ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

Step-Video-T2V by stepfun-ai

Text-to-video model for generating high-fidelity, dynamic videos

Created 1 year ago

Updated 1 year ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

Awesome-Video-Diffusion by showlab

Curated list of video diffusion models for generation, editing, and more

Created 2 years ago

Updated 3 days ago

Starred by

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI),

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI), and

1 more.

SkyReels-V2 by SkyworkAI

Film generation model for infinite-length videos using diffusion forcing

Created 11 months ago

Updated 1 month ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n).

Wan2.2 by Wan-Video

Advanced video generation models with MoE architecture

Created 7 months ago

Updated 1 week ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI),

Paras Jain

Paras Jain(Cofounder of Genmo), and

7 more.

Open-Sora by hpcaitech

Video generation initiative for efficient, high-quality video production

Created 2 years ago

Updated 10 months ago

Feedback? Help us improve.