Whisperer by tigros

CLI tool for batch speech-to-text using Whisper

Created 2 years ago

300 stars

Top 88.6% on SourcePulse

Project Summary

Whisperer is a batch speech-to-text tool designed for generating subtitles from video and audio files. It leverages OpenAI's Whisper model, specifically the GPU-accelerated whisper.cpp implementation, to process multiple files concurrently, scaling with available GPU memory.

How It Works

Whisperer utilizes whisper.cpp for efficient, GPU-accelerated inference of the Whisper model. It supports batch processing, launching multiple instances of the model to maximize throughput based on the user's GPU memory capacity. This approach offers significant speed improvements over CPU-based processing.

Quick Start & Requirements

Install via pip install whisperer.
Requires ffmpeg to be in the system's PATH.
Download Whisper models from Hugging Face (e.g., ggerganov/whisper.cpp/tree/main), ensuring not to use v3 models as they are not yet supported.
GPU with sufficient memory is required for optimal performance.

Highlighted Details

Utilizes whisper.cpp for significant GPU-accelerated speed improvements.
Supports batch processing, scaling with GPU memory.
Generates subtitles for video and audio files.

Maintenance & Community

No specific community channels or roadmap are mentioned in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.

Limitations & Caveats

The project explicitly states that v3 Whisper models are not yet supported. ffmpeg is a required external dependency not included with the package.

Health Check

Last Commit

7 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

whisper-at by YuanGongND

Joint audio tagging and speech recognition model

Created 2 years ago

Updated 1 year ago

RuntimeSpeechRecognizer by gtreshchev

Unreal Engine plugin for real-time, offline speech recognition

Created 2 years ago

Updated 8 months ago

decipher by dsymbol

CLI tool for AI-powered video subtitling

Created 3 years ago

Updated 11 months ago

whisper-website by Kabanosk

Web app for local speech-to-text using Whisper

Created 3 years ago

Updated 2 months ago

subgen by McCloudS

Subtitle generator for media servers

Created 2 years ago

Updated 1 month ago

whisper-ctranslate2 by Softcatala

CLI tool for faster Whisper transcription/translation

Created 2 years ago

Updated 2 days ago

transcribe-anything by zackees

CLI tool for Whisper AI transcription and translation

Created 4 years ago

Updated 4 weeks ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic) and

Didier Lopes

Didier Lopes(Founder of OpenBB).

yt-whisper by m1guelpf

CLI tool for generating YouTube subtitles

Created 3 years ago

Updated 1 year ago

faster-whisper-GUI by CheshireCC

GUI for faster-whisper/whisperX transcription

Created 2 years ago

Updated 11 months ago

auto-subtitle by m1guelpf

CLI tool for automatic video subtitling

Created 3 years ago

Updated 1 year ago

Whisper-WebUI by jhj0517

Web UI for Whisper-based subtitle generation

Created 2 years ago

Updated 2 weeks ago

Starred by

Pietro Schirano

Pietro Schirano(Founder of MagicPath),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

1 more.

whisper_real_time by davabase

Demo for real-time speech-to-text using OpenAI's Whisper

Created 2 years ago

Updated 6 months ago

Feedback? Help us improve.