mobile-mcp  by mobile-next

MCP server for mobile automation via agentic workflows

Created 9 months ago
3,029 stars

Top 15.7% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a Model Context Protocol (MCP) server for automating iOS and Android mobile applications, targeting developers and AI agents. It offers a platform-agnostic interface to interact with native apps on simulators, emulators, and physical devices, simplifying mobile testing and data extraction.

How It Works

The server leverages native accessibility trees for efficient interaction, falling back to screenshot-based coordinate analysis when accessibility data is unavailable. This hybrid approach aims for deterministic tool application, reducing ambiguity compared to purely visual methods. It enables LLMs to interact with mobile UIs by evaluating screen content and executing actions based on structured data or visual analysis.

Quick Start & Requirements

  • Install/Run: npx -y @mobilenext/mobile-mcp@latest
  • Prerequisites: Xcode command line tools, Android Platform Tools, Node.js, MCP-supported foundational models (e.g., Claude, OpenAI Agent SDK), configured mobile SDKs (Xcode, Android SDK), and simulators/emulators or physical devices.
  • Setup: Integration with IDEs like Cursor, Claude, VS Code, or GitHub Copilot is configured via a JSON snippet.

Highlighted Details

  • Supports native app automation for testing and data entry.
  • Enables LLM-driven multi-step user journeys and agent-to-agent communication.
  • Extracts structured data from screen content.
  • Provides a "Visual Sense" fallback for interactions when accessibility data is missing.

Maintenance & Community

The project acknowledges contributors and provides a roadmap for future enhancements. Links to community resources are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.

Limitations & Caveats

The project requires significant setup of mobile development environments and AI agent integrations. Specific details on performance benchmarks or unsupported device/OS versions are not provided.

Health Check
Last Commit

5 days ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
6
Star History
404 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Gregor Zunic Gregor Zunic(Cofounder of Browser Use).

droidrun by droidrun

0.8%
7k
Framework for controlling Android devices via LLM agents
Created 9 months ago
Updated 1 day ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Yaowei Zheng Yaowei Zheng(Author of LLaMA-Factory), and
5 more.

trae-agent by bytedance

0.7%
10k
LLM-powered CLI for software engineering tasks
Created 7 months ago
Updated 3 months ago
Feedback? Help us improve.