computer-agent  by suitedaces

Desktop app for AI computer control via Claude API

created 9 months ago
478 stars

Top 64.9% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a self-hosted desktop application that allows AI models, specifically Anthropic's Claude, to control a computer's mouse and keyboard. It's designed for users who want to experiment with AI-driven automation for tasks like browsing, coding, and general computer interaction, offering a novel way to delegate tasks to an AI.

How It Works

The application leverages Anthropic's computer use capability, enabling Claude to interpret natural language commands and execute them via simulated mouse and keyboard inputs. It uses Python with the PyQt framework for the desktop interface and relies on screenshots to understand the current UI state for accurate interaction.

Quick Start & Requirements

  • Install via pip install -r requirements.txt after setting up a Python 3.10+ virtual environment.
  • Requires an Anthropic API key, set in a .env file.
  • Recommended to install Firefox for improved UI detection.
  • Official quick-start and setup instructions are available in the README.

Highlighted Details

  • Enables AI to perform arbitrary tasks controllable by mouse and keyboard.
  • Supports macOS, Windows, and Linux platforms where Python runs.
  • Includes productivity keybindings for controlling agent actions.
  • Acknowledges that AI computer control is currently slow and experimental.

Maintenance & Community

The project is a recent creation, with the author open to contributions and ideas via Twitter (@ishanxnagpal). A formal roadmap is not yet established.

Licensing & Compatibility

Licensed under the Apache License 2.0, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

This is experimental software with potential for unintended actions. The AI may occasionally fail to take screenshots for validation, leading to incorrect input placement. Users are advised to monitor actions closely and be aware that Anthropic can view screenshots during operation.

Health Check
Last commit

6 months ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
0
Star History
39 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Toran Bruce Richards Toran Bruce Richards(Founder of AutoGPT), and
2 more.

OS-Copilot by OS-Copilot

0.1%
2k
OS agent for automating daily tasks
created 1 year ago
updated 10 months ago
Feedback? Help us improve.