OBS plugin for local speech recognition and captioning
Top 43.1% on sourcepulse
LocalVocal is an OBS Studio plugin that provides real-time, local speech-to-text transcription and translation using AI models. It targets streamers, content creators, and accessibility users who need on-device captioning and translation without relying on cloud services, ensuring privacy and eliminating ongoing costs.
How It Works
The plugin leverages Whisper.cpp for efficient, CPU-based (with optional GPU acceleration via CUDA, ROCm, Vulkan, or Metal) processing of audio into text. Translation is handled by CTranslate2. This approach allows for high-performance, local operation, supporting a wide range of languages and offering flexibility in model selection, including custom GGML models.
Quick Start & Requirements
.github/scripts/build-macos
, .github/scripts/build-linux
, Build-Windows.ps1
).libssl-dev
.tiny.en
Whisper model; larger models can be downloaded. Performance depends heavily on CPU/GPU capabilities.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
5 months ago
Inactive