Discover and explore top open-source AI tools and projects—updated daily.
thewh1teagleText-to-speech with ONNX Runtime
Top 20.1% on SourcePulse
This project provides a text-to-speech (TTS) system leveraging the Kokoro-TTS model and ONNX Runtime for efficient inference. It targets developers and users seeking a fast, lightweight, and multilingual TTS solution, offering near real-time performance on Apple Silicon (M1) and a compact model size.
How It Works
The system utilizes ONNX Runtime for accelerated model execution, enabling fast inference speeds. It supports multiple languages and voices, with version 1.0 models available. The architecture is designed for efficiency, with quantized models being particularly lightweight.
Quick Start & Requirements
uv for isolated Python environments: pip install uvuv init -p 3.12uv add kokoro-onnx soundfilekokoro-v1.0.onnx and voices-v1.0.bin.examples/save.py content into hello.py and ensure model files are in the same directory.uv run hello.pyHighlighted Details
Maintenance & Community
Information regarding community channels, roadmap, or specific maintainers is not detailed in the provided README.
Licensing & Compatibility
The kokoro-onnx package is licensed under MIT. The Kokoro model itself is licensed under Apache 2.0. These licenses appear compatible with most commercial and closed-source applications.
Limitations & Caveats
The README recommends using the misaki g2p package for version 1.0 models, suggesting potential compatibility considerations or performance benefits with this specific g2p implementation. Further details on community support or project roadmap are not readily available.
4 months ago
Inactive
huggingface