WebUI for video translation/dubbing
Top 20.7% on sourcepulse
YouDub-webui is a Gradio-based web UI for the YouDub project, designed to translate and dub YouTube videos into Chinese using AI. It targets users who want to localize foreign video content for educational, entertainment, or professional purposes, offering an accessible interface for non-technical users.
How It Works
The tool orchestrates a pipeline of AI models for video localization. It begins with video downloading, followed by AI-powered speech recognition (using WhisperX for accurate transcription and speaker diarization), large language model-based translation (leveraging OpenAI GPT and other LLMs), and AI voice cloning (utilizing Coqui AI TTS and Volcano Engine TTS for natural-sounding dubbing). Finally, it handles video processing for audio-video synchronization, subtitle embedding, and optional auto-upload to Bilibili.
Quick Start & Requirements
setup_windows
for automatic dependency installation (including CUDA 12.1 PyTorch) or pip install -r requirements.txt
followed by pip install TTS
for manual setup..env
file).run_windows.bat
or python app.py
.Highlighted Details
api-for-open-llm
.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project primarily targets Windows users with the provided scripts. Manual installation of specific PyTorch CUDA versions may be required if the automatic setup does not match the user's environment. Credentials for various services (OpenAI, Hugging Face, Volcano Engine, Bilibili) are necessary for full functionality.
1 year ago
1+ week