Discover and explore top open-source AI tools and projects—updated daily.
Speech recognition for the browser
Top 63.2% on SourcePulse
Vosk-Browser is a JavaScript library that enables speech recognition directly within web browsers by leveraging a WebAssembly build of the Vosk speech recognition toolkit. It is designed for web developers who want to integrate real-time speech-to-text capabilities into their applications without relying on server-side processing. The library offers an easy-to-use API for handling microphone input and audio files, supporting multiple languages.
How It Works
This library utilizes a WebAssembly compilation of Vosk, specifically configured to run within a Web Worker. This approach offloads the computationally intensive speech recognition tasks from the main browser thread, preventing UI freezes and ensuring a smooth user experience. The library handles the complexities of Web Worker communication and audio processing, providing a straightforward interface for developers to interact with the Vosk engine.
Quick Start & Requirements
npm i vosk-browser
. Alternatively, use a CDN like jsDelivr.model.tar.gz
). The provided example demonstrates capturing microphone input and processing it for speech recognition.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive