Offline STT engine for real-time speech recognition and VAD
Top 29.2% on sourcepulse
Sherpa-ncnn provides efficient, offline, real-time speech recognition and voice activity detection (VAD) for a wide range of devices and architectures. It targets developers building applications requiring on-device ASR and VAD, offering broad platform support and multiple language bindings.
How It Works
This project leverages the ncnn inference framework for optimized execution on diverse hardware, including CPUs and mobile platforms. It supports streaming speech-to-text and VAD, enabling real-time processing without internet connectivity. The architecture is designed for static linking, producing executables with minimal system dependencies beyond standard libraries.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project's licensing is not clearly stated in the README, which may impact commercial adoption. Specific performance benchmarks or detailed resource requirements for various platforms are not provided.
2 months ago
1 day