Discover and explore top open-source AI tools and projects—updated daily.
shang-zhuAI-powered video translation and dubbing
Top 44.4% on SourcePulse
Open-source Video Translation Skill
Violin is an open-source video translation skill that automates transcription, translation, and dubbing into 33 languages. It targets engineers and power users seeking efficient video localization, delivering synchronized, native-sounding voice-overs and optional subtitles, significantly reducing manual effort.
How It Works
The core pipeline leverages ffmpeg for audio extraction, Whisper Large v3 for transcription and timestamping, and a configurable LLM (defaulting to DeepSeek V4 Pro) for segment translation. Text-to-speech synthesis is handled by providers like Cartesia Sonic 3 or ElevenLabs. ffmpeg then re-syncs the video with the synthesized audio, optionally generating SRT subtitles. Its pluggable architecture allows interchangeable providers for each stage.
Quick Start & Requirements
uv tool install violin (recommended) or pip install violin.ffmpeg on PATH. API keys for chosen providers (e.g., Together AI, OpenAI, ElevenLabs) are necessary.Highlighted Details
Maintenance & Community
This is a personal open-source project. Contributions via PRs are welcome. For questions or bug reports, contact heyviolinai@gmail.com. No dedicated community channels (Discord/Slack) are listed.
Licensing & Compatibility
Limitations & Caveats
Style profiles are experimental. The project is a personal endeavor; users must ensure they have rights to translate content, as it's intended for Creative Commons, public domain, or self-owned recordings.
5 days ago
Inactive