pyvideotrans  by jianchang512

Video translation CLI tool

created 1 year ago
13,350 stars

Top 3.8% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a comprehensive video translation and dubbing tool, enabling users to translate video content into different languages, automatically generating and embedding subtitles and voiceovers. It caters to content creators, educators, and anyone needing to localize video content, offering a streamlined workflow for multilingual video production.

How It Works

The tool leverages a modular architecture, supporting various open-source and API-based solutions for core functionalities. Speech recognition can utilize faster-whisper, openai-whisper, or cloud APIs like Google Speech and Alibaba. Text translation is integrated with services such as Microsoft Translator, Google Translate, ChatGPT, and DeepL. Speech synthesis options include Microsoft Edge TTS, Google TTS, Azure AI TTS, and custom TTS servers like GPT-SoVITS and ChatTTS. It also incorporates uvr5 for background music separation and preservation.

Quick Start & Requirements

  • Installation: Pre-packaged executables are available for Windows 10/11. For macOS and Linux, source deployment via git clone and pip install -r requirements.txt is required.
  • Prerequisites: Python 3.10+ is recommended. macOS and Linux require Homebrew for libsndfile, ffmpeg, git, and python@3.10. Windows users need Python 3.10 and Git. CUDA 11.8+ is required for GPU acceleration (specifically torch==2.2.0 with cu118).
  • Setup: Detailed instructions are provided for Windows, macOS, and Linux, including environment setup and dependency installation.
  • Docs: https://pyvideotrans.com

Highlighted Details

  • Supports over 20 languages with automatic language detection.
  • Offers batch processing for multiple files and subtitle operations.
  • Includes features for subtitle editing, format conversion, video/audio merging, and YouTube video downloading.
  • Provides API access for core translation and speech functionalities.

Maintenance & Community

The project is actively maintained by jianchang512. Community support and updates can be found via their WeChat public account. Related projects like ChatTTS-ui and OTT are also mentioned.

Licensing & Compatibility

The repository does not explicitly state a license in the README. However, the inclusion of various third-party libraries implies adherence to their respective licenses. Commercial use compatibility would require further investigation into the licensing of all dependencies.

Limitations & Caveats

The pre-packaged Windows version may trigger antivirus warnings due to the use of pyinstaller. Users with CUDA versions lower than 12.x might need to install ctranslate2==3.24.0 if encountering compatibility issues with the default ctranslate2 version. Some "module not found" errors can be resolved by removing version specifiers in requirements.txt.

Health Check
Last commit

3 days ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
21
Star History
784 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.