vosk-server  by alphacep

Offline speech recognition server

Created 6 years ago
1,165 stars

Top 33.2% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides an offline speech recognition server utilizing Vosk and Kaldi, catering to developers building applications like smart home devices, PBX systems, chatbots, and web-based services. It offers high accuracy and supports multiple communication protocols, enabling seamless integration into various platforms.

How It Works

The server leverages the Vosk API and Kaldi, a powerful speech recognition toolkit, to deliver accurate, offline speech recognition. It supports four major communication protocols: MQTT, gRPC, WebRTC, and WebSocket, allowing flexible integration with diverse systems and real-time data streaming.

Quick Start & Requirements

  • Installation and usage instructions are available on the Vosk Website.

Highlighted Details

  • Supports four major communication protocols: MQTT, gRPC, WebRTC, and WebSocket.
  • Enables offline, highly accurate speech recognition.
  • Suitable for smart home devices, PBX systems (FreeSWITCH, Asterisk), chatbots, and web applications.

Maintenance & Community

  • Further community and documentation links can be found on the Vosk Website.

Licensing & Compatibility

  • The specific license is not detailed in the provided text, but it is based on Vosk and Kaldi, which typically have permissive licenses. Further clarification on licensing would be needed for commercial use.

Limitations & Caveats

  • The README does not specify system requirements, setup complexity, or potential limitations of the server. Detailed documentation should be consulted for a comprehensive understanding.
Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
16 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.