GUI framework for audiobook, subtitle, and dubbing generation
Top 64.4% on sourcepulse
Pandrator is a free, GUI-driven application for transforming text-based documents (PDF, EPUB) and video files into audiobooks, subtitles, and dubbed videos. It targets users who want to leverage local AI models for text-to-speech, voice cloning, and translation without complex setup, offering a user-friendly interface and all-in-one packages.
How It Works
Pandrator acts as a framework orchestrating various open-source AI tools. For audiobooks, it processes text from PDFs, EPUBs, or plain text, splitting it into manageable segments for TTS engines like XTTS or Silero. It supports voice cloning via XTTS, enhanced by RVC, and allows LLM-based text preprocessing for naturalness. For dubbing, it transcribes video audio using WhisperX, translates subtitles via various APIs or local LLMs, and synthesizes new audio, finally synchronizing it with the video.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively developed by a self-identified "noob" developer seeking contributions and feedback. A Discord server is available for community interaction and support.
Licensing & Compatibility
The project's primary dependencies include open-source libraries. Specific licensing details for each component (e.g., XTTS, WhisperX) should be reviewed, as they may have their own terms of use. The project itself appears to be available under a permissive license, but this is not explicitly stated in the README.
Limitations & Caveats
Pandrator is in alpha stage, with the developer noting the code is not optimized and may lack features or reliability. Manual installation on Linux is required. Antivirus software may flag the Windows installer/launcher. Some advanced features require separate setup of external APIs and models.
3 months ago
1 day