Android app for offline speech recognition using OpenAI Whisper
Top 63.4% on sourcepulse
This repository provides two Android applications for offline speech recognition using OpenAI's Whisper model via TensorFlow Lite. It targets Android developers seeking to integrate robust ASR capabilities directly into their applications, offering both Java and native C++ interfaces for flexibility and performance.
How It Works
The project leverages TensorFlow Lite to run quantized Whisper models on Android devices. It offers two implementations: one using the TensorFlow Lite Java API for straightforward integration within Java-based Android projects, and another using the TensorFlow Lite Native API (C++) for potentially higher performance and lower overhead. A Python script is included to convert Whisper models to the TFLite format.
Quick Start & Requirements
whisper_java
or whisper_native
folders in Android Studio and build/run on a device or emulator.demo_and_apk
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README does not specify the exact Whisper model sizes supported or provide performance benchmarks. It also lacks explicit licensing information, which may impact commercial adoption. Users need to ensure correct audio file formats (16K, mono, 16bits) for transcription.
5 months ago
Inactive