cua  by trycua

AI agent framework for computer OS control in virtual containers

Created 7 months ago
9,672 stars

Top 5.3% on SourcePulse

GitHubView on GitHub
Project Summary

c/ua (Computer-Use Agent) is a framework designed to enable AI agents to control full operating systems within high-performance virtual containers. It targets developers and researchers building AI agents that require interaction with desktop environments, offering near-native performance on Apple Silicon and flexible OS support.

How It Works

c/ua leverages Apple's Virtualization.framework for high-performance macOS and Linux virtualization on Apple Silicon, achieving up to 97% of native speed. It provides a Computer-Use Interface (CUI) that allows AI systems to observe and control these virtual environments, enabling interaction with applications, web browsing, and complex workflows. The framework integrates with various LLM providers for agentic capabilities.

Quick Start & Requirements

  • Install Lume CLI:
    /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/trycua/cua/main/libs/lume/scripts/install.sh)"
    
  • Pull macOS CUA image:
    lume pull macos-sequoia-cua:latest
    
  • Install Python libraries:
    pip install cua-computer cua-agent[all]
    
  • Prerequisites: Mac with Apple Silicon (M1/M2/M3/M4), macOS 15 (Sequoia) or newer, Python 3.10+.
  • Resources: 30GB+ disk space recommended.
  • Docs: Computer, Agent, Notebooks

Highlighted Details

  • Near-native performance (up to 97%) on Apple Silicon via Virtualization.framework.
  • Supports both macOS and Linux virtual environments within a single framework.
  • Integrated Computer-Use Interface for AI agent interaction with OS.
  • Built-in LLM integration for OpenAI, Anthropic, and other providers.
  • Gradio UI available for interactive agent deployment.

Maintenance & Community

The project has a Discord community (https://discord.com/invite/mVnXXpdE85) and lists several active contributors.

Licensing & Compatibility

Licensed under the MIT License. Microsoft's OmniParser, used within the project, is licensed under CC-BY-4.0. Compatible with commercial use under MIT terms.

Limitations & Caveats

Requires macOS 15 (Sequoia) or newer, which is currently in beta. The project is actively developed, and specific features or stability may vary.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
41
Issues (30d)
15
Star History
360 stars in the last 30 days

Explore Similar Projects

Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Zhen Lu Zhen Lu(Cofounder of Runpod), and
1 more.

agents-towards-production by NirDiamant

2.2%
13k
Production-ready GenAI agent tutorials
Created 3 months ago
Updated 2 weeks ago
Starred by Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
7 more.

SuperAGI by TransformerOptimus

0.1%
17k
Open-source framework for autonomous AI agent development
Created 2 years ago
Updated 7 months ago
Starred by Wes McKinney Wes McKinney(Author of Pandas), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
22 more.

autogen by microsoft

0.5%
50k
Agentic framework for multi-agent AI applications
Created 2 years ago
Updated 18 hours ago
Feedback? Help us improve.