Film generation model for infinite-length videos using diffusion forcing
Top 13.3% on sourcepulse
SkyReels-V2 is an open-source video generation model designed for creating infinite-length films, addressing limitations in prompt adherence, visual quality, motion dynamics, and duration. It targets researchers and developers in AI video synthesis, offering a novel AutoRegressive Diffusion-Forcing architecture for state-of-the-art performance.
How It Works
SkyReels-V2 employs an AutoRegressive Diffusion-Forcing architecture, a novel approach that allows for indefinite video generation by treating each token with an independent noise level. This enables a form of partial masking, where the model learns to "unmask" variably noised tokens using cleaner ones as conditional information. This method builds upon full-sequence diffusion models and allows for seamless extension of video generation based on previous segments.
Quick Start & Requirements
pip install -r requirements.txt
.torchrun --nproc_per_node=N
.Highlighted Details
Maintenance & Community
The project is actively developed with recent releases in April 2025. Community support is available via Discord.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
Some model variants (e.g., 5B models, Camera Director models) are listed as "Coming Soon." The prompt enhancer, while useful, may lead to over-saturation with long prompts and requires significant VRAM (64GB+).
1 month ago
1 day