cross-platform-llm-client by orailnoor

Cross-platform AI chat client for local and cloud LLM inference

Created 2 months ago

665 stars

Top 49.8% on SourcePulse

Project Summary

A production-ready, cross-platform AI chat client built with Flutter, this project enables users to run LLMs locally on Android and iOS devices or seamlessly transition to cloud APIs. It offers a unified interface for both on-device and cloud-based AI interactions, giving users control over their data and model execution.

How It Works

The client leverages Flutter for its UI and state management (GetX), with local inference on Android and iOS powered by a custom llama.cpp plugin (llama_flutter_android) utilizing Vulkan (Android) and Metal (iOS) for GPU acceleration. It supports GGUF model formats, automatically detects device RAM for optimal configuration, and provides a fallback to cloud APIs (OpenAI, Anthropic, Google Gemini, Kimi) for enhanced capabilities or unsupported platforms. A Services layer abstracts inference, cloud communication, and data persistence (Hive).

Quick Start & Requirements

Prerequisites: Flutter SDK >= 3.3.0, Android SDK (API 26+), JDK 17, NDK (bundled with Android SDK).
Android: flutter pub get, cd android, ./gradlew assembleDebug (or assembleRelease).
iOS: flutter pub get, cd ios, pod install, flutter build ios. For iPad sideloading, download PrivateLM-iOS.zip from the Releases page and install the .ipa via AltStore, Sideloadly, or Xcode.
Web: flutter pub get, flutter build web --release.
Local Inference (iOS): Requires Metal GPU acceleration.
Local Inference (Web): Not currently supported; cloud-only (local coming soon).

Highlighted Details

Local Inference: GGUF models run directly on Android (Vulkan) and iOS (Metal) with GPU acceleration, requiring no internet after download.
Cloud API Integration: Seamless switching between OpenAI, Anthropic, Google Gemini, and Kimi (Moonshot AI).
Multimodal Chat: Supports sending text and images, with vision working for local (Qwen2-VL) and cloud models.
Smart Auto-Configuration: Automatically detects device RAM to recommend optimal context size and token limits.
Task Management: Includes a dedicated view for structured AI-assisted workflows alongside free-form chat.
Data Persistence: All chats, tasks, and settings are stored locally using Hive.

Maintenance & Community

No specific details on contributors, sponsorships, or community channels (like Discord/Slack) were found in the provided README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive for commercial use.

Limitations & Caveats

The Web platform currently only supports cloud APIs, with local inference planned for the future. iPhone support is experimental; iPad is the recommended iOS target due to RAM requirements for local models. Release builds require configuring signing keys for Android.

cross-platform-llm-client by orailnoor

Explore Similar Projects

geek_chat by geeker-ai

Box by jegly

minimal-chat by TannerMidd

kronk by ardanlabs

MiniCPM-V-Apps by OpenBMB

candle-vllm by EricLBuehler

Uncensored-Local-AI-Multiplatform by techjarves

JittorLLMs by Jittor

fullmoon-ios by mainframecomputer

web-llm-chat by mlc-ai

torchchat by pytorch

jan by janhq