stream-translator by fortypercnt

CLI tool for real-time audio transcription/translation from livestreams

Created 3 years ago

260 stars

Top 97.7% on SourcePulse

Project Summary

This utility transcribes or translates audio from live streams in real-time, targeting users who need to process audio from online broadcasts. It leverages streamlink for stream acquisition and OpenAI's Whisper for speech processing, offering a convenient command-line solution for content creators and analysts.

How It Works

The tool integrates streamlink to fetch audio streams from various platforms, passing the stream URL to ffmpeg for processing. OpenAI's Whisper model then transcribes or translates the audio. An optional faster-whisper implementation is available for significant performance gains, offering up to 4x speed improvement and 2x memory reduction when models are converted to the CTranslate2 format.

Quick Start & Requirements

Install ffmpeg and add it to your PATH.
Install CUDA (version 11.3 or compatible, adjust requirements.txt if needed).
Clone the repository: git clone https://github.com/fortypercnt/stream-translator.git
Install dependencies: pip install -r requirements.txt
Ensure PyTorch is installed with CUDA support.
Run: python translator.py <URL> --flags
Supported sites depend on streamlink plugins.

Highlighted Details

Real-time transcription and translation of live streams.
Supports various Whisper model sizes and language detection.
Configurable parameters for transcription interval, history buffer, and beam search.
Option to use faster-whisper for enhanced performance.
Supports direct URL input to ffmpeg, bypassing streamlink.

Maintenance & Community

No specific contributors, sponsorships, or community links (Discord/Slack, roadmap) are mentioned in the README.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Whisper performance is heavily reliant on GPU acceleration; CPU execution is unlikely to be real-time. The history_buffer_size parameter can lead to repetition if not configured carefully. The README does not mention testing on different operating systems or provide benchmarks beyond the faster-whisper claims.

stream-translator by fortypercnt

Explore Similar Projects

yt-transcriber by pmarreck

orpheus-cpp by freddyaboulton

babelfish.ai by supabase-community

realtime-transcription-fastrtc by sofdog-gh

generate-subtitles by mayeaux

writeout.ai by beyondcode

liftoff by Tameyer41

vibe by thewh1teagle

speaches by speaches-ai

whisper_streaming by ufal

whisper_real_time by davabase

WhisperLive by collabora