Text2Video-Zero  by Picsart-AI-Research

Zero-shot video generator using text-to-image diffusion models

created 2 years ago
4,200 stars

Top 11.9% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides the official implementation for Text2Video-Zero, a method that leverages text-to-image diffusion models for zero-shot video generation and editing. It is designed for researchers and developers working with generative AI, offering capabilities to create videos from text prompts, guided by poses or edges, and to edit existing videos based on instructions.

How It Works

Text2Video-Zero adapts pre-trained text-to-image diffusion models to video generation by introducing cross-frame attention mechanisms and motion field enrichment. This approach allows the model to maintain temporal consistency and adhere to textual prompts, while also supporting conditional generation based on pose or edge maps derived from input videos.

Quick Start & Requirements

Highlighted Details

  • Zero-shot text-to-video generation.
  • Conditional generation with pose, edge, and depth maps.
  • Video instruction-guided editing (Instruct-Pix2Pix).
  • Support for arbitrary video lengths and custom Dreambooth models.
  • Low-memory inference options available.

Maintenance & Community

The project is actively maintained by Picsart AI Research. Community contributions are welcomed, with several external implementations and extensions linked in the README.

Licensing & Compatibility

The code is published under the CreativeML Open RAIL-M license, which is permissive for research and commercial use, with specific clauses regarding responsible AI use.

Limitations & Caveats

The project is an active research implementation. While optimizations exist for lower VRAM, performance may vary. Some features, like background smoothing, require additional components not included in this repository.

Health Check
Last commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
36 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.