Discover and explore top open-source AI tools and projects—updated daily.
Speech recognition toolkit for bridging research and industrial applications
Top 4.0% on SourcePulse
FunASR is a comprehensive, end-to-end speech recognition toolkit designed for both academic research and industrial applications. It provides a unified platform for speech recognition (ASR), voice activity detection (VAD), punctuation restoration, and other speech processing tasks, enabling researchers and developers to build and deploy ASR systems efficiently.
How It Works
FunASR leverages a modular architecture, allowing users to combine various pre-trained models for different speech tasks. It supports both non-streaming and streaming inference, utilizing models like Paraformer (a parallel Transformer) for high accuracy and efficiency. The toolkit facilitates fine-tuning on custom datasets and offers robust deployment options, including real-time and file-based transcription services.
Quick Start & Requirements
pip3 install -U funasr
modelscope
, huggingface_hub
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 week ago
1 week