Realistic text-to-speech model under 25MB
New!
Top 6.8% on SourcePulse
KittenTTS is an open-source, ultra-lightweight text-to-speech model designed for high-quality voice synthesis on any device, even without a GPU. Targeting developers and users needing efficient, realistic TTS, it offers fast inference and premium voice options within a compact model size.
How It Works
KittenTTS utilizes a model with approximately 15 million parameters, optimized for CPU execution. This approach prioritizes efficiency and broad compatibility, enabling real-time speech synthesis on standard hardware without the need for specialized GPUs.
Quick Start & Requirements
pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl
Highlighted Details
Maintenance & Community
The project is currently in developer preview. A Discord server is available for community engagement.
Licensing & Compatibility
The license is not explicitly stated in the provided README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project is in developer preview, with the fully trained model weights and mobile/web SDKs yet to be released. The current release is a preview model.
1 day ago
Inactive