TikTok video generator using Whisper, Edge TTS, and FFMPEG
Top 91.7% on sourcepulse
This project provides an end-to-end solution for generating TikTok videos using AI. It targets content creators looking to automate video production by transcribing audio, generating natural-sounding voiceovers, and assembling video content with customizable subtitles. The primary benefit is the significant reduction in manual effort required to create engaging short-form videos.
How It Works
The system orchestrates a pipeline involving several AI models and tools. It starts by fetching a background video (randomly or from a specified URL) and uses Microsoft Edge Cloud TTS for natural voiceovers. OpenAI's Whisper model transcribes the generated audio into SRT subtitles, which are then embedded into the background video using FFMPEG. The process is configurable via a JSON file and offers command-line options for customization.
Quick Start & Requirements
pip install -U -r requirements.txt
cookies.txt
file generated via a provided guide.streamlit run app.py --server.port=8501 --server.address=0.0.0.0
python main.py [OPTIONS]
Highlighted Details
Maintenance & Community
edge-tts
and stable-ts
.Licensing & Compatibility
Limitations & Caveats
6 months ago
Inactive