Discover and explore top open-source AI tools and projects—updated daily.
hehehaiAI voice input and translation for macOS
New!
Top 87.9% on SourcePulse
Summary
Voxt is a beta macOS menu bar application designed for seamless voice input and translation. It targets macOS users seeking an efficient way to convert speech to text or translate it directly within any application, offering a "press to talk, release to paste" paradigm. The primary benefit is the integration of powerful on-device and cloud-based STT and LLM engines, providing both privacy and flexibility for users.
How It Works
Voxt leverages macOS's Accessibility and Event Tapping APIs to capture global hotkey triggers and audio input via AVAudioEngine. It supports two Speech-to-Text (STT) engines: MLX Audio for private, on-device processing with downloadable models, and Apple's Direct Dictation for zero-setup convenience. For text enhancement and translation, it integrates with Apple Intelligence Foundation Models or allows users to run custom local LLMs. The pipeline involves ASR, optional LLM enhancement (which can be contextually routed via App Branch rules), and a final translation step, all while maintaining a live floating overlay for user feedback.
Quick Start & Requirements
.zip release, unzip, and drag Voxt.app to the Applications folder.Voxt.xcodeproj in Xcode or use xcodebuild from the terminal.Highlighted Details
Maintenance & Community
No specific community channels (like Discord/Slack) or details on maintainers/sponsors are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The application is explicitly marked as "[Beta]". It requires significant macOS permissions (Accessibility, Microphone, Speech Recognition), which may be a privacy concern for some users. Performance and functionality of local models depend heavily on the user's hardware capabilities.
22 hours ago
Inactive
janhq