Discover and explore top open-source AI tools and projects—updated daily.
yeyupiaolingPaddlePaddle-based speech recognition framework
Top 41.3% on SourcePulse
PPASR is an end-to-end Chinese automatic speech recognition (ASR) framework built on PaddlePaddle. It offers a simplified and practical approach to ASR, supporting popular models like DeepSpeech2, Conformer, and Squeezeformer, with both streaming and non-streaming capabilities. The project targets developers and researchers looking for a flexible and efficient ASR solution that can be deployed on servers and edge devices like Nvidia Jetson.
How It Works
PPASR (V3) is a significant overhaul from V2, focusing on ease of use and enhanced performance. It utilizes kaldi_native_fbank for faster, multi-platform compatible audio preprocessing and sentencepiece for tokenization, enabling easier handling of multiple languages and mixed-language training. The framework supports various decoding methods (e.g., ctc_greedy_search, ctc_prefix_beam_search, attention_rescoring) and data augmentation techniques for improved robustness.
Quick Start & Requirements
Highlighted Details
attention_rescoring on WenetSpeech achieves a CER of 0.13786 on test_net.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
4 months ago
Inactive
OpenBMB
PaddlePaddle