Discover and explore top open-source AI tools and projects—updated daily.
fortypercntCLI tool for real-time audio transcription/translation from livestreams
Top 97.7% on SourcePulse
This utility transcribes or translates audio from live streams in real-time, targeting users who need to process audio from online broadcasts. It leverages streamlink for stream acquisition and OpenAI's Whisper for speech processing, offering a convenient command-line solution for content creators and analysts.
How It Works
The tool integrates streamlink to fetch audio streams from various platforms, passing the stream URL to ffmpeg for processing. OpenAI's Whisper model then transcribes or translates the audio. An optional faster-whisper implementation is available for significant performance gains, offering up to 4x speed improvement and 2x memory reduction when models are converted to the CTranslate2 format.
Quick Start & Requirements
ffmpeg and add it to your PATH.requirements.txt if needed).git clone https://github.com/fortypercnt/stream-translator.gitpip install -r requirements.txtpython translator.py <URL> --flagsstreamlink plugins.Highlighted Details
faster-whisper for enhanced performance.ffmpeg, bypassing streamlink.Maintenance & Community
No specific contributors, sponsorships, or community links (Discord/Slack, roadmap) are mentioned in the README.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
Whisper performance is heavily reliant on GPU acceleration; CPU execution is unlikely to be real-time. The history_buffer_size parameter can lead to repetition if not configured carefully. The README does not mention testing on different operating systems or provide benchmarks beyond the faster-whisper claims.
2 years ago
Inactive
davabase