Text-to-speech tool based on Coqui TTS engine
Top 23.1% on sourcepulse
AllTalk TTS is a Python-based extension for the Text Generation Web UI, offering advanced text-to-speech capabilities. It targets users who want to integrate high-quality, customizable speech synthesis into their workflows, particularly within conversational AI or content creation, by leveraging the Coqui TTS engine.
How It Works
AllTalk TTS is built upon the Coqui TTS engine, specifically supporting XTTSv2 models. It provides a user-friendly interface with features like model finetuning, support for custom local models, and efficient batch processing. Advanced options include DeepSpeed for performance acceleration and a low VRAM mode, making it accessible even on hardware with limited GPU memory.
Quick Start & Requirements
git clone https://github.com/erew123/alltalk_tts
) or use the provided setup scripts (atsetup.bat
/ ./atsetup.sh
) for standalone installations.Highlighted Details
Maintenance & Community
The project is maintained by a solo developer, with community support encouraged through discussions and issue reporting. Links to Discord/Slack are not explicitly provided in the README.
Licensing & Compatibility
The repository does not explicitly state a license. Compatibility with commercial or closed-source linking is not specified.
Limitations & Caveats
The project is primarily focused on Version 1, with Version 2 still evolving. Docker and Google Colab support are noted as experimental or in development. The developer is not the creator of the TTS models and advises users to consult original developers for model-specific issues.
2 weeks ago
1 day