Native UI for live audio transcription/translation
Top 94.0% on sourcepulse
This project provides a native UI for the Whispering Tiger application, a tool for real-time audio transcription and translation. It targets users who need to integrate live speech-to-text and translation into various applications like streaming overlays or VRChat, offering a user-friendly interface for configuration and control.
How It Works
The UI acts as a control layer for the Whispering Tiger backend, managing audio input capture (including loopback audio for system sounds), AI model selection (for speech-to-text and translation), and output configuration via WebSockets or OSC. It supports GPU acceleration via CUDA for NVIDIA GPUs, allowing users to balance accuracy and performance by selecting AI model sizes and precision levels, with automatic model downloads.
Quick Start & Requirements
Whispering Tiger.exe
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
whispering-ui
. The linked whispering
repository is MIT licensed.Limitations & Caveats
15 hours ago
Inactive