Real-time AI translator for cross-lingual online communication
Top 43.0% on sourcepulse
LanguageLeapAI provides real-time AI-powered translation for cross-lingual voice communication, targeting users who wish to converse with others speaking different languages. It offers both voice translation and audio subtitling capabilities, enabling seamless interaction in online games or other communication platforms.
How It Works
The system comprises two Python scripts leveraging free, open-source AI models. voice_translator.py
captures microphone input via push-to-talk, transcribes it using WhisperAI, translates the text with DeepL (or Google Translate as a fallback), and synthesizes speech in the target language using Voicevox (or Thorsten for German). subtitler.py
captures application audio output, transcribes and translates speech to English using WhisperAI, and displays it as on-screen subtitles using Tkinter. The architecture relies on Docker Compose for deploying WhisperAI and Voicevox, with optional Google Colab integration for GPU-accelerated processing.
Quick Start & Requirements
docker-compose up -d
) for WhisperAI and Voicevox, followed by running Python scripts (python voice_translator.py
or python subtitler.py
) from the src/
directory.Highlighted Details
Maintenance & Community
The project is maintained by SociallyIneptWeeb. Community links or roadmaps are not explicitly mentioned in the README.
Licensing & Compatibility
Limitations & Caveats
WhisperAI's accuracy is not guaranteed to be 100%, and it may struggle with multiple concurrent requests. Antivirus software might block ngrok/localtunnel connections used for cloud deployments. The current TTS implementation is primarily focused on Japanese (Voicevox) and German (Thorsten), requiring manual replacement for other languages.
2 years ago
1 day