Discover and explore top open-source AI tools and projects—updated daily.
mehtabmahirDesktop app for fast, GPU-accelerated audio/video transcription
Top 69.1% on SourcePulse
EasyWhisperUI provides a user-friendly, cross-platform desktop interface for local audio and video transcription using the Whisper model, optimized with GPU acceleration. It targets users seeking efficient, private transcription solutions on their own hardware, offering a unified experience across Windows and macOS with features like batch processing and multi-language support.
How It Works
This project is built upon an Electron architecture, integrating React for the UI and inter-process communication (IPC) for secure communication between the renderer and main processes. It leverages whisper.cpp for core transcription capabilities, enabling GPU acceleration via Vulkan on Windows and Metal on macOS. Media file conversion is handled by FFmpeg. This approach provides a hardened, isolated UI environment and predictable management of Whisper binaries and models.
Quick Start & Requirements
.dmg from the project's Releases page. Installation is per-user.Highlighted Details
Maintenance & Community
The project relies on community donations for maintenance. Credits are given to whisper.cpp by Georgi Gerganov and FFmpeg. No specific community channels (like Discord/Slack) or roadmaps are detailed in the provided information.
Licensing & Compatibility
The application is proprietary ("All rights reserved") and explicitly prohibits commercial use, copying, modification, or distribution without the author's permission; these actions are permitted for personal use only. It incorporates whisper.cpp (MIT License) and FFmpeg (LGPL 2.1 License). Commercial use or linking within closed-source projects is restricted.
Limitations & Caveats
Linux support is explicitly stated as not yet implemented. The live transcription feature is marked as beta. The proprietary license severely restricts commercial application and redistribution.
1 day ago
Inactive