whisper-standalone-win by Purfview

Standalone executables for local speech transcription

Created 2 years ago

2,783 stars

Top 17.0% on SourcePulse

Project Summary

This repository provides standalone Windows executables for OpenAI's Whisper and Faster-Whisper, targeting users who prefer not to manage Python environments. It offers a simplified way to perform speech-to-text transcription, with Faster-Whisper variants providing significant speed and resource improvements over the original Whisper model.

How It Works

The project packages pre-compiled versions of Whisper and Faster-Whisper, leveraging C++ compilation for standalone execution. Faster-Whisper, a key component, is optimized for speed and reduced memory usage, making it suitable for resource-constrained environments or faster processing. The executables can automatically detect and utilize CUDA-enabled GPUs for accelerated inference.

Quick Start & Requirements

Download executables from the Releases page.
Faster-Whisper executables are compatible with Windows 7, Linux v5.4, macOS v10.15 and above (x86-64).
Faster-Whisper-XXL/XXL Pro are compatible with Windows 10, Linux v5.15 and above (x86-64).
Whisper executables require Windows 10 and above (x86-64).
GPU acceleration requires CUDA detection.
A guide for command-line usage is available: youtube.com/watch?v=A3nwRCV-bTU

Highlighted Details

Faster-Whisper offers significant speed and resource improvements over original Whisper.
Faster-Whisper-XXL includes advanced features like MDX23 Kim_vocal_v2 audio preprocessing and multiple VAD methods.
Supports speaker diarization in XXL versions.
Progress bar displayed in the command-line title bar.

Maintenance & Community

Development appears to be primarily driven by the Purfview user.
Discussions regarding new features and issues are hosted in the Discussions section.

Licensing & Compatibility

The project bundles OpenAI's Whisper and Faster-Whisper, which have their own licenses. The specific license for the standalone executables themselves is not explicitly stated in the README.
Compatibility for commercial use or linking with closed-source applications would depend on the underlying Whisper and Faster-Whisper licenses.

Limitations & Caveats

The "Faster-Whisper" project is marked as deprecated in favor of "Faster-Whisper-XXL". A "Pro" version exists but is non-public and intended for donors. The README does not explicitly state the license for the compiled executables themselves, which could impact commercial use.

whisper-standalone-win by Purfview

Explore Similar Projects

Stage-Whisper by Stage-Whisper

Auralis by astramind-ai

LiveWhisper by Nikorasu

pywhispercpp by absadiki

ollama-voice-mac by apeatling

awesome-whisper by sindresorhus

transcribe-anything by zackees

whisper-ctranslate2 by Softcatala

ecoute by SevaSk

WhisperLiveKit by QuentinFuxa

buzz by chidiwilliams

speech_recognition by Uberi