Discover and explore top open-source AI tools and projects—updated daily.
kaldi-asrSpeech recognition toolkit for Linux, macOS, Cygwin, and Windows
Top 3.2% on SourcePulse
Kaldi is an open-source toolkit for speech recognition research and development, offering a comprehensive set of tools and recipes for building and deploying ASR systems. It is primarily targeted at researchers and developers in the speech technology domain, providing a flexible and powerful platform for experimentation and customization.
How It Works
Kaldi is built around a C++ core, emphasizing efficiency and performance. It employs a modular design, allowing users to easily integrate different acoustic models, language models, and decoding algorithms. The toolkit supports various speech recognition paradigms, including hybrid HMM-GMM and end-to-end deep neural network approaches, offering flexibility in model selection and training.
Quick Start & Requirements
./INSTALL for UNIX-like systems (Linux, macOS, Cygwin). Windows users should refer to windows/INSTALL.make, cmake, and specific libraries like lapack-devel and openfst-devel (on Fedora). CUDA is supported for GPU acceleration.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 month ago
Inactive
QwenLM
zilliztech