Discover and explore top open-source AI tools and projects—updated daily.
joewongjcmacOS voice input with local and LLM-powered optimization
New!
Top 40.8% on SourcePulse
Summary
Type4Me addresses the limitations of existing macOS voice input solutions by offering a flexible, privacy-focused tool. It targets power users and developers seeking efficient, customizable speech-to-text and command execution, benefiting from local processing, LLM integration, and complete data control.
How It Works
The core architecture supports both on-device (SherpaOnnx - Paraformer/Zipformer) and cloud-based (Volcengine, Deepgram) Automatic Speech Recognition (ASR). A key differentiator is its LLM integration, enabling advanced text optimization, translation, and command execution via customizable prompt templates utilizing context variables like {text}, {selected}, and {clipboard}. Its plugin-based ASR provider design facilitates extensibility, while all user data, including credentials and history, is stored locally.
Quick Start & Requirements
brew install cmake) are necessary for source builds.Highlighted Details
Maintenance & Community
The project encourages community contributions, particularly for adding support for additional ASR cloud providers. While specific community links (Discord, Slack) are absent, the README outlines contribution steps via Issues, Discussions, and Pull Requests, and mentions AI agent integration for development tasks.
Licensing & Compatibility
Licensed under the permissive MIT License, Type4Me is compatible with commercial use and closed-source linking. It requires macOS 14.0 or newer.
Limitations & Caveats
Local ASR setup involves manual model downloading and configuration. While the architecture supports numerous cloud ASR providers, only Volcengine and Deepgram currently have implemented client integrations. The application requires user intervention to bypass macOS security warnings on first launch.
2 days ago
Inactive
matthartman
Beingpax