CLI tool for real-time audio transcription/translation from livestreams
Top 98.8% on sourcepulse
This utility transcribes or translates audio from live streams in real-time, targeting users who need to process audio from online broadcasts. It leverages streamlink
for stream acquisition and OpenAI's Whisper for speech processing, offering a convenient command-line solution for content creators and analysts.
How It Works
The tool integrates streamlink
to fetch audio streams from various platforms, passing the stream URL to ffmpeg
for processing. OpenAI's Whisper model then transcribes or translates the audio. An optional faster-whisper
implementation is available for significant performance gains, offering up to 4x speed improvement and 2x memory reduction when models are converted to the CTranslate2 format.
Quick Start & Requirements
ffmpeg
and add it to your PATH.requirements.txt
if needed).git clone https://github.com/fortypercnt/stream-translator.git
pip install -r requirements.txt
python translator.py <URL> --flags
streamlink
plugins.Highlighted Details
faster-whisper
for enhanced performance.ffmpeg
, bypassing streamlink
.Maintenance & Community
No specific contributors, sponsorships, or community links (Discord/Slack, roadmap) are mentioned in the README.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
Whisper performance is heavily reliant on GPU acceleration; CPU execution is unlikely to be real-time. The history_buffer_size
parameter can lead to repetition if not configured carefully. The README does not mention testing on different operating systems or provide benchmarks beyond the faster-whisper
claims.
2 years ago
1 week