Discover and explore top open-source AI tools and projects—updated daily.
OpenBMBAutonomous multimodal mobile agent system
Top 62.0% on SourcePulse
AppCopilot addresses fundamental challenges in mobile agents: generalization across tasks, accuracy in on-screen interaction, long-horizon task completion, and efficiency on resource-constrained devices. It offers a general-purpose, on-device multimodal assistant capable of operating across applications and devices, benefiting researchers and developers aiming for sophisticated digital assistants.
How It Works
This system employs a multimodal, multi-agent architecture integrating foundation models with robust Chinese-English support. It leverages chain-of-thought reasoning, hierarchical task planning, and multi-agent collaboration for complex goal execution. The closed-loop system spans data collection, training, deployment, and efficient inference, with profiling-driven optimization for latency, memory, and energy across heterogeneous hardware.
Quick Start & Requirements
./YADB.pip install -r requirements.txt). Configure API keys and endpoints in ./wrappers/constants.py.run_agent.py (single-device) or cross_device_agent.py (multi-device).Highlighted Details
Maintenance & Community
Primary information source is the arXiv preprint arXiv:2509.02444. Contact email: qianc@sjtu.edu.cn. No community channels (Discord/Slack) are listed.
Licensing & Compatibility
The README does not specify a software license. Compatibility for commercial use or closed-source linking is undetermined.
Limitations & Caveats
The setup requires a dedicated server environment for vLLM model serving and complex network configurations (port forwarding), potentially posing an adoption barrier. The project is presented as a research artifact with a focus on demonstrating capabilities rather than a production-ready, easily deployable library.
1 day ago
Inactive