Discover and explore top open-source AI tools and projects—updated daily.
PaddlePaddle-based speech recognition framework
Top 41.5% on SourcePulse
PPASR is an end-to-end Chinese automatic speech recognition (ASR) framework built on PaddlePaddle. It offers a simplified and practical approach to ASR, supporting popular models like DeepSpeech2, Conformer, and Squeezeformer, with both streaming and non-streaming capabilities. The project targets developers and researchers looking for a flexible and efficient ASR solution that can be deployed on servers and edge devices like Nvidia Jetson.
How It Works
PPASR (V3) is a significant overhaul from V2, focusing on ease of use and enhanced performance. It utilizes kaldi_native_fbank
for faster, multi-platform compatible audio preprocessing and sentencepiece
for tokenization, enabling easier handling of multiple languages and mixed-language training. The framework supports various decoding methods (e.g., ctc_greedy_search
, ctc_prefix_beam_search
, attention_rescoring
) and data augmentation techniques for improved robustness.
Quick Start & Requirements
Highlighted Details
attention_rescoring
on WenetSpeech achieves a CER of 0.13786 on test_net
.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 months ago
Inactive