Discover and explore top open-source AI tools and projects—updated daily.
YouTube transcription TUI app
Top 84.8% on SourcePulse
This tool provides a command-line interface for transcribing audio and video content, primarily from YouTube URLs. It targets users who need to process spoken content for text-based analysis, offering features like transcription, summarization, and translation, leveraging local AI models and optional cloud APIs.
How It Works
The application utilizes the Whisper model for local speech-to-text transcription, allowing users to select model sizes for a trade-off between accuracy and performance. For summarization and translation, it integrates with OpenAI or Claude APIs, requiring user-provided API keys. The core workflow involves fetching audio from a URL, processing it with Whisper, and then optionally piping the output to separate summarize
and translate
scripts.
Quick Start & Requirements
bash
, python3.12
, ffmpeg
, and optionally glow
.Highlighted Details
ffmpeg
.base
, small
, medium
, large
, large-v2
).yt-transcriber TEST
.Maintenance & Community
The project appears to be a personal utility with limited public community engagement signals in the README.
Licensing & Compatibility
The README does not explicitly state a license.
Limitations & Caveats
Speaker identification is marked as a work-in-progress (WIP). Support for other video platforms and conversion to a web service are future features. Setup has only been tested on macOS with Nix.
5 months ago
Inactive