faster-whisper-GUI  by CheshireCC

GUI for faster-whisper/whisperX transcription

created 2 years ago
2,561 stars

Top 18.7% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a graphical user interface (GUI) for the faster-whisper and whisperX speech-to-text libraries, targeting users who need to transcribe audio or video files into various subtitle formats. It simplifies the process of using these powerful models by offering a visual interface for parameter tuning and model management.

How It Works

The GUI leverages PySide6 for its user interface and integrates directly with faster-whisper and whisperX for transcription. It supports downloading models from Hugging Face, including the large-v3 model, and allows for model conversion. The software also incorporates the Demucs model for audio separation and offers features like batch processing, VAD parameter control, and word-level timestamps for enhanced transcription accuracy and usability.

Quick Start & Requirements

  • Install via pip install faster-whisper-GUI.
  • Requires Python 3.x.
  • Supports downloading models from Hugging Face.
  • Official documentation and demo links are not explicitly provided in the README.

Highlighted Details

  • Supports transcription to SRT, TXT, SMI, VTT, and LRC formats.
  • Integrates Demucs for audio source separation.
  • Offers word-level timestamps and Karaoka lyric support.
  • Includes batch processing capabilities.

Maintenance & Community

  • The project is hosted on GitHub by CheshireCC.
  • No specific community channels (Discord, Slack) or roadmap are mentioned.

Licensing & Compatibility

  • The README does not explicitly state a license.
  • Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project appears to be actively developed, with features like Demucs and WhisperX integration being relatively recent additions. Specific details on performance benchmarks or extensive testing are not provided.

Health Check
Last commit

7 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
2
Star History
243 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.