Text2Video-Zero  by Picsart-AI-Research

Zero-shot video generator using text-to-image diffusion models

Created 2 years ago
4,216 stars

Top 11.6% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the official implementation for Text2Video-Zero, a method that leverages text-to-image diffusion models for zero-shot video generation and editing. It is designed for researchers and developers working with generative AI, offering capabilities to create videos from text prompts, guided by poses or edges, and to edit existing videos based on instructions.

How It Works

Text2Video-Zero adapts pre-trained text-to-image diffusion models to video generation by introducing cross-frame attention mechanisms and motion field enrichment. This approach allows the model to maintain temporal consistency and adhere to textual prompts, while also supporting conditional generation based on pose or edge maps derived from input videos.

Quick Start & Requirements

Highlighted Details

  • Zero-shot text-to-video generation.
  • Conditional generation with pose, edge, and depth maps.
  • Video instruction-guided editing (Instruct-Pix2Pix).
  • Support for arbitrary video lengths and custom Dreambooth models.
  • Low-memory inference options available.

Maintenance & Community

The project is actively maintained by Picsart AI Research. Community contributions are welcomed, with several external implementations and extensions linked in the README.

Licensing & Compatibility

The code is published under the CreativeML Open RAIL-M license, which is permissive for research and commercial use, with specific clauses regarding responsible AI use.

Limitations & Caveats

The project is an active research implementation. While optimizations exist for lower VRAM, performance may vary. Some features, like background smoothing, require additional components not included in this repository.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
17 stars in the last 30 days

Explore Similar Projects

Starred by Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI), Jiaming Song Jiaming Song(Chief Scientist at Luma AI), and
1 more.

SkyReels-V2 by SkyworkAI

3.3%
4k
Film generation model for infinite-length videos using diffusion forcing
Created 5 months ago
Updated 1 month ago
Feedback? Help us improve.