Browser-based text-to-speech
Top 98.8% on sourcepulse
This project provides a 100% client-side, open-source text-to-speech (TTS) solution for the browser, leveraging the Kokoro-82M model. It's designed for users seeking private, offline TTS capabilities without server-side processing, offering high-quality audio generation directly within modern web browsers.
How It Works
The application utilizes the Kokoro-82M-v1.0-ONNX model, which is downloaded and cached locally. It employs Web Workers to prevent UI blocking during audio generation. For performance, it automatically prioritizes WebGPU acceleration on compatible hardware and falls back to WebAssembly (WASM) for broader browser support, ensuring efficient, high-quality (24kHz) streaming audio output.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is open source under the Apache 2 License. It acknowledges contributions from Hugging Face Transformers.js and the Kokoro-82M Model.
Licensing & Compatibility
Licensed under the Apache 2 License, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
Firefox is reported as not working at the moment. WebGPU support is recommended for optimal performance, implying potential limitations on older or less capable hardware.
1 month ago
Inactive