open-computer-use  by LLmHub-dev

Autonomous agents with real computer control

Created 1 month ago
262 stars

Top 97.4% on SourcePulse

GitHubView on GitHub
Project Summary

Autonomous AI agents capable of performing tasks on computers are enabled by Open Computer Use, an open-source framework designed for scalable, auditable, and self-hostable AI workflows. It empowers developers by providing agents control over browsers, terminals, and desktop applications, offering a production-ready solution for sophisticated AI-driven automation.

How It Works

The system utilizes specialized Browser, Terminal, and Desktop agents orchestrated by a multi-agent executor. Browser agents navigate the web via search APIs and form filling, Terminal agents execute commands and manage files in isolated environments, and Desktop agents interact with GUIs using computer vision. An AI planner decomposes complex tasks, assigns them to agents, manages context, and handles feedback within a Dockerized Ubuntu VM.

Quick Start & Requirements

  • Prerequisites: Node.js 20+, Python 3.10+, npm, pip, Docker, Docker Compose, Supabase account, and API keys for AI providers (e.g., OpenAI, Anthropic) and Google Search.
  • Installation: Clone repo, set up Supabase schema, configure .env files, install frontend (npm install) and backend (pip install -r requirements.txt) dependencies.
  • Running: Use docker-compose up --build for full setup or manual start for frontend (npm run dev) and backend (python main.py).
  • Demos & Docs: Available via the project website (https://llmhub.dev) and in-README demo links.

Highlighted Details

  • Multi-Provider AI Support: Integrates numerous AI providers (OpenAI, Anthropic, Google Gemini, etc.) with Bring Your Own Keys (BYOK).
  • Real-Time Streaming & Feedback: Provides live task progress, tool calls, streaming responses, detailed logs, and VM screenshots.
  • Advanced Task Planning: AI planner automatically analyzes, decomposes, assigns, and executes complex tasks across specialized agents.
  • Secure VM Isolation: Agent sessions run in isolated, ephemeral Docker containers with sandboxed execution and network isolation options.

Maintenance & Community

Contributions are encouraged via GitHub issues and pull requests. Community support and updates are available on Discord and X (Twitter). A roadmap details planned features through Q2 2026.

Licensing & Compatibility

Licensed under the permissive Apache License 2.0, suitable for commercial use and integration into closed-source applications without significant restrictions.

Limitations & Caveats

The Desktop Agent currently supports Linux desktop environments; Windows and macOS support are planned. While production-ready, the quick start focuses on development, and significant setup is required.

Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
67 stars in the last 30 days

Explore Similar Projects

Starred by Andrew Ng Andrew Ng(Founder of DeepLearning.AI; Cofounder of Coursera; Professor at Stanford), Thomas Wolf Thomas Wolf(Cofounder of Hugging Face), and
4 more.

ag2 by ag2ai

0.7%
4k
AgentOS for building AI agents and facilitating multi-agent cooperation
Created 1 year ago
Updated 3 days ago
Starred by Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
7 more.

SuperAGI by TransformerOptimus

0.1%
17k
Open-source framework for autonomous AI agent development
Created 2 years ago
Updated 10 months ago
Feedback? Help us improve.