Transcribro  by soupslurpr

Android app for private, on-device speech recognition

Created 1 year ago
575 stars

Top 56.2% on SourcePulse

GitHubView on GitHub
Project Summary

Transcribro offers a private, on-device speech-to-text solution for Android, functioning as both a standalone keyboard and a system service. It targets Android users prioritizing data privacy and local processing for voice input, providing a secure alternative to cloud-based services.

How It Works

The system leverages whisper.cpp for efficient, on-device execution of OpenAI's Whisper models, enabling local speech recognition. Voice Activity Detection (VAD) is handled by Silero VAD, ensuring accurate segmentation of speech. This combination allows for real-time transcription directly on the device without sending audio data to external servers.

Quick Start & Requirements

  • Install: Download from Accrescent app store (recommended) or GitHub releases.
  • Prerequisites: Android device. APK verification via apksigner is recommended if downloading directly.
  • Resources: No specific hardware requirements beyond a functional Android device are listed, implying compatibility with a wide range of hardware.
  • Docs: https://github.com/soupslurpr/Transcribro

Highlighted Details

  • On-device processing for enhanced privacy.
  • Utilizes whisper.cpp for efficient model execution.
  • Functions as a system-wide speech-to-text input method.

Maintenance & Community

Licensing & Compatibility

  • The README does not explicitly state a license. Branding restrictions are in place for derivative works.

Limitations & Caveats

Currently, Transcribro only supports English language recognition, although multi-language support is planned. The branding restrictions may impact the creation of modified versions intended for public release.

Health Check
Last Commit

2 weeks ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
3
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Travis Fischer Travis Fischer(Founder of Agentic).

RealtimeSTT by KoljaB

0.5%
9k
Speech-to-text library for realtime applications
Created 2 years ago
Updated 2 months ago
Feedback? Help us improve.