vosk-server by alphacep

Offline speech recognition server

Created 7 years ago

1,258 stars

Top 30.7% on SourcePulse

Project Summary

This project provides an offline speech recognition server utilizing Vosk and Kaldi, catering to developers building applications like smart home devices, PBX systems, chatbots, and web-based services. It offers high accuracy and supports multiple communication protocols, enabling seamless integration into various platforms.

How It Works

The server leverages the Vosk API and Kaldi, a powerful speech recognition toolkit, to deliver accurate, offline speech recognition. It supports four major communication protocols: MQTT, gRPC, WebRTC, and WebSocket, allowing flexible integration with diverse systems and real-time data streaming.

Quick Start & Requirements

Installation and usage instructions are available on the Vosk Website.

Highlighted Details

Supports four major communication protocols: MQTT, gRPC, WebRTC, and WebSocket.
Enables offline, highly accurate speech recognition.
Suitable for smart home devices, PBX systems (FreeSWITCH, Asterisk), chatbots, and web applications.

Maintenance & Community

Further community and documentation links can be found on the Vosk Website.

Licensing & Compatibility

The specific license is not detailed in the provided text, but it is based on Vosk and Kaldi, which typically have permissive licenses. Further clarification on licensing would be needed for commercial use.

Limitations & Caveats

The README does not specify system requirements, setup complexity, or potential limitations of the server. Detailed documentation should be consulted for a comprehensive understanding.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

whisper.php by CodeWithKyrian

PHP binding for local speech-to-text, leveraging whisper.cpp

Created 1 year ago

Updated 7 months ago

voice-satellite-card-integration by jxlarrea

Browser-based voice AI for Home Assistant

Created 4 months ago

Updated 21 hours ago

vosk-browser by ccoreilly

Speech recognition for the browser

Created 5 years ago

Updated 7 months ago

openai-realtime-api-nextjs by cameronking4

Next.js starter for OpenAI Realtime API voice apps

Created 1 year ago

Updated 1 year ago

rustpbx by restsend

AI-powered PBX enabling intelligent voice communication

Created 1 year ago

Updated 15 hours ago

free4chat by i365dev

Real-time audio chat service emphasizing local-first and privacy

Created 4 years ago

Updated 1 month ago

rhasspy by rhasspy

Offline private voice assistant

Created 6 years ago

Updated 1 year ago

sherpa-ncnn by k2-fsa

Offline STT engine for real-time speech recognition and VAD

Created 3 years ago

Updated 8 months ago

Starred by

Taranjeet Singh

Taranjeet Singh(Cofounder of Mem0).

speechgpt by hahahumble

Web app for conversing with ChatGPT via speech

Created 3 years ago

Updated 2 years ago

sipsorcery by sipsorcery-org

Real-time communications SDK for C# and .NET

Created 10 years ago

Updated 1 day ago

Starred by

Tim J. Baek

Tim J. Baek(Founder of Open WebUI).

WhisperLiveKit by QuentinFuxa

Python package for real-time, local speech-to-text

Created 1 year ago

Updated 1 day ago

Starred by

Binyuan Hui

Binyuan Hui(Research Scientist at Alibaba Qwen) and

Benjamin Bolte

Benjamin Bolte(Cofounder of K-Scale Labs).

wenet by wenet-e2e

ASR toolkit for production-ready end-to-end speech recognition

Created 5 years ago

Updated 3 weeks ago

Feedback? Help us improve.