Discover and explore top open-source AI tools and projects—updated daily.
remorsesDesktop automation CLI for AI agents
Top 94.2% on SourcePulse
A fast, native desktop automation CLI designed for AI agents, usecomputer enables programmatic control of macOS, Linux (X11), and Windows desktops. It provides essential functionalities like taking screenshots, controlling the mouse (move, click, drag, scroll), and synthesizing keyboard input, all executed via a Zig binary for high performance without a Node.js runtime. This tool is invaluable for AI agents that need to interact with graphical user interfaces to perform complex tasks.
How It Works
The core of usecomputer is a native Zig binary, leveraging N-API for its command-line interface. This approach bypasses the need for a Node.js runtime, ensuring efficient execution. It exposes granular control over desktop interactions, including precise mouse movements and clicks, keyboard input, and comprehensive screenshot capabilities. A key design choice is its screenshot scaling and coordinate mapping (coord-map) system, which normalizes image sizes and translates screenshot-relative coordinates back to absolute screen coordinates, facilitating reliable AI-driven UI interactions.
Quick Start & Requirements
npm install -g usecomputernpx skills add remorses/usecomputerDISPLAY set (Wayland via XWayland is supported).Highlighted Details
coord-map system to accurately translate coordinates from screenshots to real screen positions, essential for reliable UI interaction.AGENT_GRAPHICS environment variable for direct image output to stdout, streamlining integration with AI agents.stdin.Licensing & Compatibility
The provided README does not specify a software license. This absence makes it difficult to assess compatibility for commercial use or closed-source linking without further clarification.
Limitations & Caveats
Desktop automation input is blocked on locked Windows sessions. The README does not explicitly detail any alpha/beta status, but its focus on integration examples suggests a stable, production-ready state for its core features.
2 weeks ago
Inactive
askui
DevAgentForge