SD-CN-Animation by volotat

Video stylization tool using StableDiffusion and ControlNet

Created 2 years ago

815 stars

Top 43.5% on SourcePulse

1 Expert Loves This Project

jaretburkett

Founder of Ostris

Project Summary

This project provides an extension for the Automatic1111 Stable Diffusion web UI to automate video stylization and generation. It targets users looking to create stylized videos from existing footage (vid2vid) or generate entirely new videos from text prompts, offering control over resolution and length. The key benefit is enhanced stability and quality in video generation through optical flow estimation.

How It Works

The extension leverages RAFT for optical flow estimation in vid2vid mode to maintain animation stability and generate occlusion masks for frame-to-frame consistency. For text-to-video generation, it utilizes a "FloweR" method (in progress) to predict optical flow. ControlNet integration is crucial, especially in vid2vid, to prevent choppy results. The text-to-video mode also supports using video as guidance for ControlNet, enabling stronger stylization.

Quick Start & Requirements

Install via Automatic1111 web UI: Extensions tab -> Install from URL, enter https://github.com/volotat/SD-CN-Animation.git.
Requires Automatic1111 web UI.
Not compatible with Macs.
Ensure 'Apply color correction to img2img results to match original colors.' is disabled in Stable Diffusion settings.
Update web UI if encountering 'Need to enable queue to use generators.' errors.

Highlighted Details

Supports custom Stable Diffusion models.
Vid2vid mode allows fine-grained control via 'Extra params'.
Text-to-video mode automatically sets seed to -1 after the first frame to prevent blurring.
ControlNet can be used in text-to-video mode with video guidance for stronger stylization.

Maintenance & Community

Recent updates (v0.9) address multiple issues, improve vid2vid controls, and enhance ControlNet integration.
Primarily developed by volotat.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README.

Limitations & Caveats

Not compatible with Macs.
Potential issues with color correction settings and older web UI versions.
The "FloweR" method for text-to-video is noted as "work in progress."

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

1

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

vid2vid-zero by baaivision

Video editing research paper using image diffusion

Created 2 years ago

Updated 2 years ago

Allegro by rhymes-ai

Text-to-video model for generating short, high-quality videos

Created 1 year ago

Updated 11 months ago

DiT-Extrapolation by thu-ml

Enhancing video diffusion transformers for extended temporal generation

Created 10 months ago

Updated 1 month ago

Starred by

Evan Conrad

Evan Conrad(Cofounder of SF Compute).

WarpFusion by Sxela

AI tool for video-to-video transformation using Stable Diffusion

Created 3 years ago

Updated 8 months ago

Awesome-Video-Diffusion-Models by ChenHsing

Survey on video diffusion models

Created 2 years ago

Updated 6 months ago

TokenFlow by omerbt

Framework for consistent video editing using diffusion features (ICLR 2024)

Created 2 years ago

Updated 11 months ago

Rerender_A_Video by williamyang1991

Video-to-video translation framework for zero-shot text-guided video rendering

Created 2 years ago

Updated 1 year ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI) and

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

VGen by ali-vilab

Video synthesis codebase for state-of-the-art generative models

Created 2 years ago

Updated 1 year ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

Awesome-Video-Diffusion by showlab

Curated list of video diffusion models for generation, editing, and more

Created 2 years ago

Updated 3 weeks ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Yoland Yan

Yoland Yan(Cofounder of Comfy Org), and

2 more.

Tune-A-Video by showlab

Text-to-video generation via diffusion model fine-tuning

Created 3 years ago

Updated 2 years ago

Starred by

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI),

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI), and

1 more.

SkyReels-V2 by SkyworkAI

Film generation model for infinite-length videos using diffusion forcing

Created 9 months ago

Updated 5 months ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n).

Wan2.2 by Wan-Video

Advanced video generation models with MoE architecture

Created 5 months ago

Updated 3 weeks ago

Feedback? Help us improve.