whisper-standalone-win  by Purfview

Standalone executables for local speech transcription

created 2 years ago
2,293 stars

Top 20.2% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides standalone Windows executables for OpenAI's Whisper and Faster-Whisper, targeting users who prefer not to manage Python environments. It offers a simplified way to perform speech-to-text transcription, with Faster-Whisper variants providing significant speed and resource improvements over the original Whisper model.

How It Works

The project packages pre-compiled versions of Whisper and Faster-Whisper, leveraging C++ compilation for standalone execution. Faster-Whisper, a key component, is optimized for speed and reduced memory usage, making it suitable for resource-constrained environments or faster processing. The executables can automatically detect and utilize CUDA-enabled GPUs for accelerated inference.

Quick Start & Requirements

  • Download executables from the Releases page.
  • Faster-Whisper executables are compatible with Windows 7, Linux v5.4, macOS v10.15 and above (x86-64).
  • Faster-Whisper-XXL/XXL Pro are compatible with Windows 10, Linux v5.15 and above (x86-64).
  • Whisper executables require Windows 10 and above (x86-64).
  • GPU acceleration requires CUDA detection.
  • A guide for command-line usage is available: youtube.com/watch?v=A3nwRCV-bTU

Highlighted Details

  • Faster-Whisper offers significant speed and resource improvements over original Whisper.
  • Faster-Whisper-XXL includes advanced features like MDX23 Kim_vocal_v2 audio preprocessing and multiple VAD methods.
  • Supports speaker diarization in XXL versions.
  • Progress bar displayed in the command-line title bar.

Maintenance & Community

  • Development appears to be primarily driven by the Purfview user.
  • Discussions regarding new features and issues are hosted in the Discussions section.

Licensing & Compatibility

  • The project bundles OpenAI's Whisper and Faster-Whisper, which have their own licenses. The specific license for the standalone executables themselves is not explicitly stated in the README.
  • Compatibility for commercial use or linking with closed-source applications would depend on the underlying Whisper and Faster-Whisper licenses.

Limitations & Caveats

The "Faster-Whisper" project is marked as deprecated in favor of "Faster-Whisper-XXL". A "Pro" version exists but is non-public and intended for donors. The README does not explicitly state the license for the compiled executables themselves, which could impact commercial use.

Health Check
Last commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
14
Star History
315 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.