Discover and explore top open-source AI tools and projects—updated daily.
dineshsoudagarLocal LLM inference on Android
Top 91.5% on SourcePulse
This project offers an Android application enabling local execution of Large Language Models (LLMs) like Gemma and Qwen directly on mobile devices. It provides offline, private, real-time chat, question answering, and advanced input capabilities (voice, image, OCR, camera), targeting users prioritizing on-device AI processing and data privacy.
How It Works
The application utilizes LiteRT and ONNX Runtime backends for efficient on-device LLM inference. A minimal base APK is installed, with users downloading desired models (Gemma, Qwen) post-installation. This architecture ensures full offline operation after model acquisition, supporting streaming responses, persistent local chat history, and in-app model management (download, switch, delete). It integrates voice input, image analysis via OCR, and camera capture for enhanced user interaction.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The provided README does not detail specific contributors, community channels (e.g., Discord, Slack), sponsorships, or a public roadmap.
Licensing & Compatibility
Limitations & Caveats
3 weeks ago
Inactive