Discover and explore top open-source AI tools and projects—updated daily.
OpenBMBOn-device multimodal chat for mobile
Top 86.7% on SourcePulse
This project provides fully offline, on-device multimodal chat capabilities for iOS, Android, and HarmonyOS NEXT, enabling users to run MiniCPM-V family models directly on their mobile devices. It targets developers and power users seeking to integrate advanced AI features into mobile applications without relying on cloud infrastructure, offering enhanced privacy and potentially lower latency.
How It Works
The core of the project utilizes a fork of llama.cpp specifically adapted for MiniCPM-V models. This enables efficient execution of large multimodal language models directly on mobile hardware. The approach bypasses the need for server-side processing, allowing for complete local inference and offering a novel solution for deploying sophisticated AI on resource-constrained mobile platforms.
Quick Start & Requirements
git clone --recurse-submodules --shallow-submodules). Platform-specific builds require native IDEs: Xcode for iOS (after building llama.xcframework via ./scripts/build_xcframework.sh), Android Studio for Android (using ./gradlew assembleDebug), and DevEco Studio for HarmonyOS. Pre-built packages are available via DOWNLOAD.md.Highlighted Details
llama.cpp integration.Licensing & Compatibility
The license for the OpenBMB/MiniCPM-V-Apps repository itself is not explicitly stated in the provided README. It bundles llama.cpp, which is typically MIT licensed. Compatibility for commercial use or linking with closed-source applications would depend on the specific licenses of the app repository and the MiniCPM-V model weights.
Limitations & Caveats
Building the llama.xcframework for iOS may produce harmless warnings. Deployment to iOS devices might require an Apple Developer account. Specific NDK and CMake versions are pinned for Android builds. Device RAM requirements are critical; lower-spec devices may struggle, especially with larger models like V 2.6, and larger context windows increase RAM usage linearly. The project manages llama.cpp as a submodule, necessitating specific cloning and update procedures.
1 day ago
Inactive