GUI tool for local audio transcription
Top 19.3% on sourcepulse
AsrTools is a user-friendly, GPU-free desktop application for batch audio-to-text transcription and subtitle generation. It targets users who need to quickly convert audio files into SRT or TXT formats without complex setup, offering a streamlined workflow for individuals and developers.
How It Works
The tool leverages multiple ASR (Automatic Speech Recognition) engines, allowing users to select their preferred option. It employs multi-threading for efficient batch processing of audio files, directly handling video inputs by extracting audio internally. The application is built with PyQt5 and qfluentwidgets, providing a visually appealing and intuitive graphical user interface.
Quick Start & Requirements
pip install -r requirements.txt
followed by python asr_gui.py
for source installation.requests
. No GPU required.Highlighted Details
Maintenance & Community
The project is actively developed, with recent updates adding video file support. Users can submit issues for support and feedback.
Licensing & Compatibility
The repository does not explicitly state a license. This may pose compatibility issues for commercial use or integration into closed-source projects.
Limitations & Caveats
The project's license is not specified, which could restrict commercial use. The README also points to a more advanced project, VideoCaptioner, for LLM-based features, suggesting AsrTools may be a simpler, foundational tool.
1 month ago
1 day