Discover and explore top open-source AI tools and projects—updated daily.
callstackincubatorCLI for AI agents to automate mobile, TV, and desktop app interactions
Top 20.4% on SourcePulse
Summary
agent-device is a CLI tool enabling AI agents to control mobile, TV, and desktop apps. It facilitates real-world app interaction, UI inspection, and evidence collection using token-efficient accessibility snapshots, not pixel screenshots. This empowers AI agents for automated QA, development, and testing, closing the loop from code generation to verified execution and feedback.
How It Works
The tool leverages accessibility snapshots for compact UI trees, allowing agents to interact via element references (e.g., @e3). It supports actions like touch and text input, plus evidence capture (screenshots, video, logs, performance metrics) triggered on demand. It integrates with platform backends like XCTest (iOS/tvOS) and ADB (Android) for unified device automation.
Quick Start & Requirements
Install via npm install -g agent-device@latest. Prerequisites include Node.js 22+, Xcode (Apple targets), Android SDK + ADB (Android), and macOS Accessibility permissions (desktop). See agent-device.dev for details.
Highlighted Details
.ad replay scripts for CI/CD and local execution, alongside e2e tests and debugging artifacts.Maintenance & Community
Developed by Callstack. Contributing guidelines are available via CONTRIBUTING.md. Project resources are at agent-device.dev.
Licensing & Compatibility
Released under the permissive MIT license, allowing free use in commercial and closed-source projects.
Limitations & Caveats
Known limitations are documented separately. Critical setup steps include macOS Accessibility permissions for desktop automation. The tool's focus is agentic workflows, implying potential areas for refinement in human-centric usability.
16 hours ago
Inactive