Discover and explore top open-source AI tools and projects—updated daily.
yan5xuPrivacy-first Chinese desktop voice workflow for intelligent text processing
Top 22.8% on SourcePulse
QuQu (蛐蛐) is an open-source, free, desktop-based voice workflow tool designed as an alternative to Wispr Flow, specifically optimized for Chinese users. It prioritizes user privacy through local data processing and offers advanced text manipulation capabilities by integrating local speech recognition models with configurable large language models. This makes it suitable for individuals seeking a cost-effective, privacy-conscious, and powerful voice input solution for content creation, coding, and communication.
How It Works
QuQu employs a unique "two-stage engine" workflow: first, highly accurate Automatic Speech Recognition (ASR) using Alibaba's FunASR Paraformer model runs locally on the user's machine, ensuring data privacy and understanding nuanced Chinese internet language. Second, a Large Language Model (LLM) optimizes the transcribed text, automatically filtering out filler words, correcting speech errors in real-time (e.g., resolving self-corrections), and formatting output based on user-defined instructions. This approach allows QuQu to not only transcribe but also "understand" and "reshape" spoken language into desired text formats.
Quick Start & Requirements
pnpm run dev after setup.uv (recommended for automatic Python/dependency management), system Python with virtual environments, or an embedded Python environment for isolation. The uv method involves git clone, pnpm install, uv sync, uv run python download_models.py, and pnpm run dev.Highlighted Details
Maintenance & Community
The project welcomes community contributions through GitHub Issues for suggestions and bug reports, and Pull Requests for code contributions. Specific community channels like Discord or Slack are not detailed in the README.
Licensing & Compatibility
QuQu is licensed under the Apache License 2.0. This license is permissive and generally allows for commercial use and integration into closed-source projects.
Limitations & Caveats
Initial setup requires installing both Node.js and Python environments, with multiple options potentially adding complexity for novice users. Downloading FunASR models may be slow or encounter network issues depending on the user's location. Some macOS users might need to manage SSL certificate issues by installing a specific urllib3 version. The project's primary focus on Chinese language and domestic models suggests potential limitations in support or performance for other languages.
3 months ago
Inactive
janhq
LianjiaTech