Discover and explore top open-source AI tools and projects—updated daily.
AI system for minute-level vlog generation from user descriptions
Top 69.2% on SourcePulse
Vlogger is an AI system designed to generate minute-level video blogs (vlogs) from user descriptions. It targets users who need to create longer, narrative-driven video content, offering a structured approach to complex video generation tasks. The system aims to simplify vlog creation by mimicking human production workflows, enabling coherent and engaging long-form video output from simple text prompts.
How It Works
Vlogger employs a modular architecture, leveraging a Large Language Model (LLM) as a "Director" to orchestrate the generation process. This Director decomposes the vlog creation into four stages: Script generation, Actor selection, ShowMaker (video snippet generation), and Voicer (audio generation). The core innovation is the "ShowMaker," a novel video diffusion model that acts as a videographer. ShowMaker enhances spatial-temporal coherence by incorporating textual and visual prompts from the Script and Actor stages, utilizing a mixed training paradigm for both text-to-video (T2V) generation and prediction.
Quick Start & Requirements
conda create -n vlogger python==3.10.11
and conda activate vlogger
, then pip install -r requirements.txt
../pretrained
directory.python sample_scripts/vlog_write_script.py
python sample_scripts/vlog_read_script_sample.py
Highlighted Details
Maintenance & Community
The project is associated with researchers from institutions like PJLab. Contact information for key contributors is provided. The code is built upon existing libraries like SEINE, LaVie, diffusers, and Stable Diffusion.
Licensing & Compatibility
The code is licensed under Apache-2.0. Model weights are fully open for academic research and permit free commercial usage. For commercial licensing inquiries, contact zhuangshaobin@pjlab.org.cn.
Limitations & Caveats
The system is not trained for realistic representation of people or events, and its use for generating demeaning, harmful, or violent content is prohibited. Users are solely liable for their actions.
4 months ago
1 day