easyvideotrans by sutro-planet

Web backend for AI video translation and dubbing

Created 1 year ago

485 stars

Top 63.4% on SourcePulse

Project Summary

This project provides a robust, modular, and self-hostable web backend for AI-powered video translation and dubbing, targeting users who need an efficient end-to-end solution for video localization. It aims to simplify the complex process of translating and dubbing videos, offering high-quality results and reducing manual effort.

How It Works

The system employs a microservices architecture, separating frontend requests, general video management, and GPU-intensive audio processing. It leverages Docker for deployment and Kubernetes for orchestration, ensuring scalability and resource optimization. The core workflow relies on a task queue (RabbitMQ) and workers for handling video processing, with a focus on using reliable and performant components like faster-whisper for transcription.

Quick Start & Requirements

Installation: Deploy via Kubernetes (kubectl apply -k ./k8s/prod) or Docker Compose (docker compose up).
Prerequisites: Python 3.9.19, PyTorch (GPU version required), FFmpeg, RabbitMQ. GPU with NVIDIA drivers is essential for the workload service.
Setup: Local setup involves installing Python dependencies (pip install -r requirements.txt), configuring RabbitMQ, and potentially downloading faster-whisper models. Kubernetes/Docker Compose deployment is recommended for ease of use.
Links: Online Demo, Grafana Monitoring, Frontend Repo, Offline Client

Highlighted Details

Microservices architecture with dedicated GPU workload container.
Supports self-hosting via Kubernetes and Docker Compose.
Utilizes faster-whisper for efficient speech-to-text.
Modular design allows for extensibility and secondary development.

Maintenance & Community

Active development with a focus on reliability and user experience.
Community support via a QQ group (ID: 536918174).
Developer presence on Bilibili and X (formerly Twitter).

Licensing & Compatibility

The README does not explicitly state a license. The project is presented as open-source and free to use. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The GPU version of PyTorch is mandatory, with no readily available CPU fallback.
Some TTS (Text-to-Speech) and translation models are still under evaluation or have stability issues.
Local deployment instructions may be outdated; refer to Dockerfiles for current environment setup.

Health Check

Last Commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

1

Star History

4 stars in the last 30 days

Explore Similar Projects

xiaoniu by agan-j

AI tool for multilingual video localization and content creation

Created 1 year ago

Updated 1 day ago

Synthalingua by cyberofficial

Real-time translation tool using AI for audio transcription and translation

Created 2 years ago

Updated 1 day ago

generate-subtitles by mayeaux

Web app for audio/video transcription and translation

Created 3 years ago

Updated 2 years ago

Auto-Synced-Translated-Dubs by ThioJoe

CLI tool for auto-synced, translated video dubs

Created 3 years ago

Updated 16 hours ago

SmartSub by buxuku

Cross-platform tool for batch generating & translating video/audio subtitles

Created 1 year ago

Updated 2 days ago

Chenyme-AAVT by chenyme

All-in-one tool for media translation automation

Created 2 years ago

Updated 9 months ago

SoniTranslate by R3gm

Gradio web UI for video translation with synchronized audio

Created 2 years ago

Updated 1 month ago

Linly-Dubbing by Kedreamix

AI dubbing/translation tool for multi-language video content creation

Created 1 year ago

Updated 10 months ago

YouDub-webui by liuzhao1225

WebUI for video translation/dubbing

Created 2 years ago

Updated 1 month ago

KrillinAI by krillinai

Video tool for translation and dubbing using LLMs

Created 1 year ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

VideoLingo by Huanshere

AI tool for automated video translation, localization, and dubbing

Created 1 year ago

Updated 7 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

MoneyPrinterTurbo by harry0703

AI tool for one-click short video generation from text prompts

Created 1 year ago

Updated 4 weeks ago

Feedback? Help us improve.