Voice-to-text tool with AI cleanup for faster workflows
Top 86.5% on sourcepulse
WhisperChain is a Python application that streamlines workflows by converting speech to text, then using AI to clean up filler words and refine the output. It targets users seeking to accelerate tasks through voice input, offering features like real-time transcription, automatic clipboard integration, and global hotkey support.
How It Works
The system leverages Whisper.cpp for efficient, real-time speech recognition. Transcribed audio is then processed by LangChain for AI-powered cleanup, removing filler words and improving clarity. A FastAPI server handles audio streaming and processing via WebSockets, while a Streamlit UI provides an interactive interface for prompts and history. Global hotkeys trigger recording, and cleaned text is automatically copied to the clipboard.
Quick Start & Requirements
brew install ffmpeg portaudio
pip install whisperchain
.env
file, or ~/.whisperchain/.env
.whisperchain
streamlit run src/whisperchain/ui/streamlit_app.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project requires an OpenAI API key, incurring potential costs. System dependencies are specific to macOS, and Windows/Linux support is not detailed. The project is primarily command-line driven with an optional Streamlit UI.
5 months ago
Inactive