Open-source TTS model
Top 5.2% on sourcepulse
Chatterbox TTS is an open-source text-to-speech model designed for content creators, developers, and AI agent builders. It offers state-of-the-art zero-shot voice cloning and unique emotion exaggeration control, aiming to provide high-quality, expressive speech synthesis that rivals closed-source solutions.
How It Works
Chatterbox utilizes a Llama backbone and alignment-informed inference for ultra-stable audio generation. Its key innovation is the emotion exaggeration control, allowing users to fine-tune the intensity and expressiveness of synthesized speech. This approach, combined with training on a large dataset, aims for superior quality and control in voice generation.
Quick Start & Requirements
pip install chatterbox-tts
example_tts.py
, example_vc.py
) are provided.Highlighted Details
audio_prompt_path
argument.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 day ago
Inactive