Open-source library for automatic speech recognition
Top 57.6% on sourcepulse
RapidASR is a cross-platform, commercial-grade open-source library for Automatic Speech Recognition (ASR) inference, targeting developers needing easy-to-use, offline Chinese and English speech recognition. It simplifies ASR model integration by leveraging ONNXRuntime and the FunASR framework, offering a unified API for various ASR models.
How It Works
The system processes audio input through a pipeline that includes optional Voice Activity Detection (VAD) to segment speech, followed by the core ASR inference using ONNXRuntime. It then applies punctuation restoration (RapidPunc) to refine the recognized text. This modular approach, powered by ONNXRuntime, ensures efficient, cross-platform inference and allows for easy integration of different ASR models, such as the Paraformer model from Alibaba Damo Academy.
Quick Start & Requirements
pip install rapid_paraformer
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project's core code has been merged into FunASR, suggesting potential future maintenance focus may shift. Specific licensing terms for commercial use are not detailed, which may require further investigation for enterprise adoption.
1 year ago
Inactive