PhoneAgent by rounak

Agentic iPhone automation

Created 7 months ago

568 stars

Top 56.6% on SourcePulse

View on GitHub

1 Expert Loves This Project

Joe Walnes

Head of Experimental Projects at Stripe

Project Summary

This project provides an iPhone agent powered by OpenAI's GPT-4.1 model, designed to automate tasks across multiple applications, mimicking human interaction without requiring a jailbreak. It's targeted at developers and power users looking to explore AI-driven mobile automation.

How It Works

The agent leverages Xcode's UI testing harness to inspect and interact with iOS applications and the system. It accesses the app's accessibility tree to perform actions like tapping, swiping, scrolling, and typing. A host app communicates with the UI test via a TCP server to trigger prompts, enabling the agent to execute commands like sending messages, downloading apps, or controlling device features.

Quick Start & Requirements

Clone the repository and open the Xcode project.
Run the testLoop function within PhoneAgentUITests.swift.
Requires an OpenAI API key, securely stored in the device's keychain.
Compatible with iOS.

Highlighted Details

Utilizes GPT-4.1 for sophisticated task execution.
Interacts with apps via their accessibility trees.
Supports voice commands and an optional always-on mode with a wake word.
Can follow up on tasks via completion notifications.

Maintenance & Community

This appears to be a personal project developed during an OpenAI hackathon, with no explicit mention of ongoing maintenance or community channels.

Licensing & Compatibility

The license is not specified in the README. Compatibility for commercial use or closed-source linking is not detailed.

Limitations & Caveats

The agent may struggle with UI elements during animations, doesn't always wait for long-running tasks, and currently lacks image recognition of the screen. Keyboard input and premature task abandonment are noted areas for improvement. This is experimental software, and user data is sent to OpenAI's API.

Health Check

Last Commit

7 months ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

9 stars in the last 30 days