whisperIME by woheller69

Android IME based on Whisper

Created 1 year ago

531 stars

Top 59.6% on SourcePulse

Project Summary

This project provides an Android Input Method Editor (IME) and system-wide voice input service powered by OpenAI's Whisper. It targets Android users seeking efficient, offline voice-to-text capabilities for seamless integration with other applications or as a standalone dictation tool, offering privacy and convenience.

How It Works

WhisperIME leverages TFLite models of OpenAI's Whisper for on-device, offline voice recognition. Users can select between a fast, English-only model or a slower, multilingual model. The system integrates as an Android RecognitionService, allowing it to be chosen as the default voice input method, and supports standard Android intents for speech recognition.

Quick Start & Requirements

Install via F-Droid or direct APK download.
Initial launch requires internet to download Whisper TFLite models (~435 MB).
Voice recognition functions entirely offline after model download.
For system-wide voice input, configure via Android settings: System > Languages > Speech > Voice Input.

Highlighted Details

Functions as both an IME and a system-wide voice input service.
Supports standalone app usage with translation to English.
Offline processing ensures privacy and convenience.
Offers a choice between English-only (faster) and multilingual (broader language support) Whisper models.

Maintenance & Community

The project is maintained by woheller69. Further details on community or roadmap are not explicitly provided in the README.

Licensing & Compatibility

Licensed under MIT. The project utilizes components also under MIT and Apache-2.0 licenses. Compatible with commercial use and closed-source applications.

whisperIME by woheller69

Explore Similar Projects

Transcribro by soupslurpr

LiveWhisper by Nikorasu

RuntimeSpeechRecognizer by gtreshchev

AIVoiceChat by KoljaB

pywhispercpp by absadiki

ollama-voice-mac by apeatling

voice-chat-pdf by run-llama

whisper_android by vilassn

transcribe-anything by zackees

Scriberr by rishikanthc

RTranslator by niedev

sherpa-onnx by k2-fsa