Speech-to-text server framework with next-gen Kaldi
Top 47.3% on sourcepulse
Sherpa is an open-source speech-to-text inference framework designed for efficient deployment of end-to-end (E2E) models, specifically transducer and CTC-based architectures. It targets developers and researchers needing to integrate pre-trained speech recognition models into applications, offering both C++ and Python APIs for flexibility.
How It Works
Sherpa leverages PyTorch for its core inference engine, focusing on optimized deployment of E2E models. This approach allows for streamlined integration of advanced speech recognition capabilities directly into applications, bypassing the complexities of model training pipelines.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project explicitly states it is not for model training or fine-tuning; users interested in those aspects should refer to the icefall
project.
4 days ago
1 day