Discover and explore top open-source AI tools and projects—updated daily.
HKUDSMobile AI agents for on-device smartphone interaction
Top 66.8% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> OpenPhone tackles the challenges of deploying AI agents on smartphones by introducing a compact, 3B-parameter, on-device vision-language foundation model. It targets developers and researchers seeking privacy-preserving, low-latency, and cost-free mobile AI solutions, enabling agentic capabilities directly on smartphones without cloud dependency.
How It Works
OpenPhone-3B is a vision-language model engineered for edge devices, striking a balance between capability and deployability on mobile NPUs and consumer GPUs. Its core innovation lies in a novel two-stage training approach combining Supervised Fine-Tuning (SFT) with GRPO-style Reinforcement Learning, utilizing synthetic GUI data. This methodology allows the 3B model to achieve performance comparable to larger 7B-9B models, offering significant speed and power efficiency advantages crucial for mobile environments.
Quick Start & Requirements
Evaluation primarily uses the AndroidLab benchmark framework. Recommended setup involves AVD on Mac (arm64). Model deployment and inference leverage pre-configured vLLM scripts. API setup for cloud model credentials is required in evaluation scripts. Detailed guides for model training and data generation are available separately.
Highlighted Details
Maintenance & Community
The project acknowledges related open-source contributions (AndroidLab, R1-V, LLaMA Factory) but does not detail specific maintainers, community channels, or a roadmap within the provided text.
Licensing & Compatibility
This project is released under the MIT License, permitting commercial use and closed-source linking.
Limitations & Caveats
Cloud models still handle approximately 65% of execution steps for complex reasoning. Batch testing scripts require manual transfer of generated evaluation files to prevent path conflicts. API setup for evaluation is currently manual.
3 weeks ago
Inactive