Discover and explore top open-source AI tools and projects—updated daily.
Text-to-speech with ONNX Runtime
Top 20.6% on SourcePulse
This project provides a text-to-speech (TTS) system leveraging the Kokoro-TTS model and ONNX Runtime for efficient inference. It targets developers and users seeking a fast, lightweight, and multilingual TTS solution, offering near real-time performance on Apple Silicon (M1) and a compact model size.
How It Works
The system utilizes ONNX Runtime for accelerated model execution, enabling fast inference speeds. It supports multiple languages and voices, with version 1.0 models available. The architecture is designed for efficiency, with quantized models being particularly lightweight.
Quick Start & Requirements
uv
for isolated Python environments: pip install uv
uv init -p 3.12
uv add kokoro-onnx soundfile
kokoro-v1.0.onnx
and voices-v1.0.bin
.examples/save.py
content into hello.py
and ensure model files are in the same directory.uv run hello.py
Highlighted Details
Maintenance & Community
Information regarding community channels, roadmap, or specific maintainers is not detailed in the provided README.
Licensing & Compatibility
The kokoro-onnx
package is licensed under MIT. The Kokoro model itself is licensed under Apache 2.0. These licenses appear compatible with most commercial and closed-source applications.
Limitations & Caveats
The README recommends using the misaki
g2p package for version 1.0 models, suggesting potential compatibility considerations or performance benefits with this specific g2p implementation. Further details on community support or project roadmap are not readily available.
3 months ago
Inactive