Live transcription tool for microphone and speaker output
Top 8.7% on sourcepulse
Ecoute is a live transcription tool designed for real-time transcription of both microphone input and system audio output, aiding users in conversations. It targets individuals needing immediate text feedback from their audio environment.
How It Works
Ecoute leverages the Whisper ASR model for transcription. By default, it uses the 'tiny' model for low resource consumption and fast responses, primarily supporting English. An optional --api
flag enables the use of OpenAI's Whisper API, offering significantly improved speed, accuracy, and multi-language support, albeit at a higher cost due to API usage.
Quick Start & Requirements
pip install -r requirements.txt
choco install ffmpeg
), OpenAI API key (for --api
flag).python main.py
or python main.py --api
for Whisper API.Highlighted Details
Maintenance & Community
Contributions are welcome via issues and pull requests.
Licensing & Compatibility
Licensed under the MIT License, permitting commercial use and closed-source linking.
Limitations & Caveats
Currently limited to default system microphone and speaker devices. The non-API 'tiny' Whisper model has reduced accuracy for accents and non-English languages. Multi-language support is pending for the local model.
4 months ago
1 day