Rust crate for fast, high-quality TTS
Top 57.2% on sourcepulse
Kokoros provides an extremely fast, high-quality Text-to-Speech (TTS) inference engine implemented in Rust, based on the popular Kokoro model. It targets developers and users seeking real-time, embeddable TTS capabilities, offering significant performance gains over Python-based implementations.
How It Works
Kokoros leverages Rust for its performance and memory safety, enabling efficient inference of the 87M parameter Kokoro model. It integrates a phonemizer, removing external dependencies for end-to-end synthesis. The project supports multiple languages (English, Chinese, Japanese, German) with ongoing expansion plans.
Quick Start & Requirements
pip install -r scripts/requirements.txt
python scripts/fetch_voices.py
cargo build --release
./target/release/koko [options]
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is under active development, with some language support noted as "partly" implemented. The OpenAI server compatibility is still undergoing polish.
3 weeks ago
1 week