vosk-api  by alphacep

Offline speech recognition for 20+ languages

Created 6 years ago
13,218 stars

Top 3.8% on SourcePulse

GitHubView on GitHub
Project Summary

Vosk-api provides an offline, open-source speech recognition toolkit for a wide range of platforms including Android, iOS, Raspberry Pi, and servers. It supports over 20 languages and dialects, offering continuous, large-vocabulary transcription with zero-latency streaming and reconfigurable vocabulary. The toolkit is designed for applications such as chatbots, smart home devices, virtual assistants, and for generating subtitles or transcriptions.

How It Works

Vosk utilizes small, efficient models (around 50MB) that enable continuous, large-vocabulary speech recognition. Its key advantage lies in its zero-latency streaming API, allowing for real-time transcription. The toolkit also supports vocabulary reconfiguration and speaker identification, making it adaptable to various use cases. Bindings are available for multiple programming languages, including Python, Java, Node.js, C#, C++, Rust, and Go.

Quick Start & Requirements

  • Installation and detailed documentation are available on the Vosk Website.

Highlighted Details

  • Supports 20+ languages and dialects with ongoing expansion.
  • Small model size (approx. 50MB) suitable for resource-constrained devices.
  • Zero-latency streaming API for real-time transcription.
  • Scalable from embedded devices (Raspberry Pi, Android) to server clusters.

Maintenance & Community

  • Information regarding contributors, sponsorships, or community channels is not detailed in the provided text.

Licensing & Compatibility

  • The provided text does not specify the license type or compatibility details.

Limitations & Caveats

  • The provided text does not mention any specific limitations, caveats, or known issues with the Vosk-api toolkit.
Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
3
Issues (30d)
12
Star History
249 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.