insanely-fast-whisper  by Vaibhavs10

Fast Whisper transcription CLI

created 1 year ago
8,565 stars

Top 6.1% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides an opinionated Command Line Interface (CLI) for highly accelerated on-device audio transcription using OpenAI's Whisper models. It targets users needing to process large audio files quickly, offering transcription speeds up to 150 minutes in under 2 minutes on high-end GPUs.

How It Works

The CLI leverages Hugging Face Transformers, Optimum, and Flash Attention 2 for significant performance gains. It enables FP16 precision, batching, and optimized attention mechanisms to drastically reduce transcription time compared to standard implementations. The project also supports speaker diarization through integration with pyannote.audio.

Quick Start & Requirements

  • Install via pipx install insanely-fast-whisper.
  • Requires NVIDIA GPU with CUDA or macOS with Apple Silicon (mps).
  • For Flash Attention 2, manual installation might be needed: pipx runpip insanely-fast-whisper install flash-attn --no-build-isolation.
  • Official docs: https://github.com/Vaibhavs10/insanely-fast-whisper

Highlighted Details

  • Achieves ~1 minute 18 seconds for 150 minutes of audio with distil-whisper/large-v2 and Flash Attention 2.
  • Supports Whisper Large v3 with Flash Attention 2 for ~1 minute 38 seconds transcription time.
  • Includes options for batch size, device selection (cuda or mps), task (transcribe/translate), language detection, and timestamp granularity.
  • Offers speaker diarization with configurable speaker counts.

Maintenance & Community

Licensing & Compatibility

  • The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

  • The CLI is opinionated and primarily targets NVIDIA GPUs and macOS. Windows support for CUDA may require specific PyTorch installations to resolve CUDA enablement issues. MPS backend on Mac is more memory-intensive.
Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
254 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.