Discover and explore top open-source AI tools and projects—updated daily.
crisng95AI video generation system for end-to-end content creation
Top 69.9% on SourcePulse
This project provides a standalone system for generating AI-driven video content via the Google Flow API, designed to automate the creation of professional-quality videos from concept to final output. It targets engineers, researchers, and power users seeking to streamline video production, offering end-to-end capabilities including narration, branding, and YouTube-ready formatting, thereby eliminating manual effort.
How It Works
The system employs a Python agent (FastAPI) that communicates with a Chrome extension (MV3 Service Worker) via WebSockets. This extension acts as a crucial bridge, handling authentication, reCAPTCHA solving, and proxying API calls to Google Flow. A core innovation is its "reference image system," which ensures visual consistency across scenes by generating and reusing distinct reference images for characters, locations, and props. Scene prompts then focus solely on actions, leveraging these references for appearance. The pipeline orchestrates story conception, entity definition, reference image generation, scene composition, video clip creation, text-to-speech narration, concatenation, thumbnail generation, and YouTube metadata creation.
Quick Start & Requirements
./setup.sh for a one-command setup that checks and installs Python 3.10+, pip, ffmpeg, ffprobe, and Chrome, then creates a virtual environment and installs dependencies.pip install -r requirements.txt.wsl --install) or Git Bash.https://labs.google/fx/tools/flowHighlighted Details
Maintenance & Community
facebook.com/groups/flowkit.flowboard.community.Licensing & Compatibility
Limitations & Caveats
The system is tightly coupled to the Google Flow API and requires a continuously running Chrome browser with the provided extension. Setup involves specific software versions (Python 3.10+, ffmpeg) and a Unix-like shell environment. Functionality like 4K upscaling is designated as "TIER_TWO only," implying potential cost or access limitations. The reliance on reCAPTCHA solving via the extension can introduce points of failure.
2 weeks ago
Inactive