Discover and explore top open-source AI tools and projects—updated daily.
Offline speech recognition for 20+ languages
Top 3.8% on SourcePulse
Vosk-api provides an offline, open-source speech recognition toolkit for a wide range of platforms including Android, iOS, Raspberry Pi, and servers. It supports over 20 languages and dialects, offering continuous, large-vocabulary transcription with zero-latency streaming and reconfigurable vocabulary. The toolkit is designed for applications such as chatbots, smart home devices, virtual assistants, and for generating subtitles or transcriptions.
How It Works
Vosk utilizes small, efficient models (around 50MB) that enable continuous, large-vocabulary speech recognition. Its key advantage lies in its zero-latency streaming API, allowing for real-time transcription. The toolkit also supports vocabulary reconfiguration and speaker identification, making it adaptable to various use cases. Bindings are available for multiple programming languages, including Python, Java, Node.js, C#, C++, Rust, and Go.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 week ago
Inactive