Whispo is an AI-powered dictation tool designed for seamless integration with any application supporting text input. It targets users seeking an efficient, local-first transcription solution that leverages advanced AI models for both speech-to-text and post-processing.
How It Works
Whispo utilizes a simple hotkey mechanism (hold Ctrl to record, release to transcribe) to capture audio. The core transcription is handled by OpenAI's Whisper model, with the flexibility to use custom API endpoints like Groq. Transcripts are automatically inserted into the active application. A key feature is its support for post-processing transcripts using Large Language Models (LLMs) such as OpenAI, Groq, and Gemini, enabling advanced text refinement.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
Currently in preview, with builds only available for macOS (Apple Silicon) and Windows x64. The AGPL-3.0 license requires careful consideration for integration into proprietary software.
8 months ago
1 day