openless  by Open-Less

AI-powered voice input for polished text and LLM prompts

Created 3 weeks ago

New!

1,589 stars

Top 25.8% on SourcePulse

GitHubView on GitHub
Project Summary

Summary OpenLess offers an open-source, local-first voice input solution for macOS and Windows, transforming spoken words into AI-polished text at the cursor. It provides a privacy-focused alternative to commercial SaaS tools by allowing users to integrate their own ASR and LLM credentials. The core benefit is faster, cleaner text generation, especially for crafting sophisticated AI prompts, while ensuring all data remains on the user's machine.

How It Works The app captures audio via a global hotkey, transcribes it using streaming ASR (Volcengine/Whisper-compatible), and polishes the text with a chat-completions LLM (Ark/DeepSeek-compatible). Its key feature is the "Structured" AI prompt mode, which converts loose speech into well-formed prompts for LLMs. The pipeline is Speech → Transcript → Cleanup → Insert at Cursor, with clipboard fallback.

Quick Start & Requirements Download .dmg (macOS) or .exe (Windows) from Releases. Requires macOS 12+ or Windows 10+. Grant Microphone and Accessibility (macOS) permissions. Users must provide their own Volcengine ASR and Ark/DeepSeek LLM API credentials via Settings. See USAGE.md for a walkthrough.

Highlighted Details

  • Cross-Platform: macOS (12+) and Windows (10+).
  • Open-Source & Local-First: Privacy-centric alternative to SaaS, data stays local.
  • BYO Credentials: Integrates user-provided ASR/LLM services, avoiding vendor lock-in.
  • AI Prompt Optimization: Dedicated "Structured" mode for precise LLM prompt generation.
  • Multiple Output Modes: Raw, light polish, structured, formal.
  • Customizable Dictionary: Improves ASR accuracy and LLM context.

Maintenance & Community Sponsored by 悠雾云数据 and jiangmuran. Community via QQ Group: 1078960553. Roadmap includes translation, style memory, and history enhancements.

Licensing & Compatibility MIT license permits broad use, including commercial. Local-first architecture enhances compatibility and privacy.

Limitations & Caveats Requires external ASR/LLM API credentials. macOS Accessibility permissions need an app restart. Tool focuses solely on text generation/polishing, not command execution or direct answering.

Health Check
Last Commit

19 hours ago

Responsiveness

Inactive

Pull Requests (30d)
311
Issues (30d)
218
Star History
1,628 stars in the last 27 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Michael Han Michael Han(Cofounder of Unsloth), and
1 more.

Orpheus-TTS by canopyai

0.1%
6k
Open-source TTS for human-sounding speech, built on Llama-3b
Created 1 year ago
Updated 5 months ago
Feedback? Help us improve.