PC-Agent  by GAIR-NLP

AI framework for autonomous PC task completion

created 9 months ago
274 stars

Top 95.2% on sourcepulse

GitHubView on GitHub
Project Summary

PC Agent provides a framework for autonomous digital agents by transferring human cognition. It targets researchers and developers building AI agents capable of complex, multi-step computer tasks, enabling AI to operate independently on a user's machine.

How It Works

The system employs a three-component pipeline: PC Tracker for large-scale human-computer interaction data collection, a Cognition Completion post-processing stage to convert raw data into cognitive trajectories, and a multi-agent system with planning and grounding agents for decision-making and visual understanding. This approach aims to imbue AI agents with human-like cognitive processes for more robust task execution.

Quick Start & Requirements

  • Install via conda env create -f environment.yml and conda activate pcagent.
  • Requires Python environment setup via Conda.
  • PC Tracker requires Windows for executable building (.\package.ps1).
  • Cognition completion requires an OpenAI API key.
  • See PC Tracker User Manual for detailed instructions.

Highlighted Details

  • PC Tracker is presented as the first lightweight infrastructure for large-scale HCI data collection.
  • Demo available showcasing autonomous computer control for complex, multi-step tasks.
  • PC Agent-E, a new SOTA open-source model for Windows, has been released.

Maintenance & Community

  • Project is actively developed with recent updates and releases.
  • Citation details provided for academic referencing.

Licensing & Compatibility

  • License details are not explicitly stated in the README.
  • Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The PC Tracker component is specifically noted as requiring Windows for executable building. The reliance on OpenAI API keys for a core processing step may introduce external costs and dependencies.

Health Check
Last commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
2
Star History
41 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Toran Bruce Richards Toran Bruce Richards(Founder of AutoGPT), and
2 more.

OS-Copilot by OS-Copilot

0.1%
2k
OS agent for automating daily tasks
created 1 year ago
updated 10 months ago
Feedback? Help us improve.