open-computer-use by LLmHub-dev

Autonomous agents with real computer control

Created 3 months ago

305 stars

Top 88.1% on SourcePulse

Project Summary

Autonomous AI agents capable of performing tasks on computers are enabled by Open Computer Use, an open-source framework designed for scalable, auditable, and self-hostable AI workflows. It empowers developers by providing agents control over browsers, terminals, and desktop applications, offering a production-ready solution for sophisticated AI-driven automation.

How It Works

The system utilizes specialized Browser, Terminal, and Desktop agents orchestrated by a multi-agent executor. Browser agents navigate the web via search APIs and form filling, Terminal agents execute commands and manage files in isolated environments, and Desktop agents interact with GUIs using computer vision. An AI planner decomposes complex tasks, assigns them to agents, manages context, and handles feedback within a Dockerized Ubuntu VM.

Quick Start & Requirements

Prerequisites: Node.js 20+, Python 3.10+, npm, pip, Docker, Docker Compose, Supabase account, and API keys for AI providers (e.g., OpenAI, Anthropic) and Google Search.
Installation: Clone repo, set up Supabase schema, configure .env files, install frontend (npm install) and backend (pip install -r requirements.txt) dependencies.
Running: Use docker-compose up --build for full setup or manual start for frontend (npm run dev) and backend (python main.py).
Demos & Docs: Available via the project website (https://llmhub.dev) and in-README demo links.

Highlighted Details

Multi-Provider AI Support: Integrates numerous AI providers (OpenAI, Anthropic, Google Gemini, etc.) with Bring Your Own Keys (BYOK).
Real-Time Streaming & Feedback: Provides live task progress, tool calls, streaming responses, detailed logs, and VM screenshots.
Advanced Task Planning: AI planner automatically analyzes, decomposes, assigns, and executes complex tasks across specialized agents.
Secure VM Isolation: Agent sessions run in isolated, ephemeral Docker containers with sandboxed execution and network isolation options.

Maintenance & Community

Contributions are encouraged via GitHub issues and pull requests. Community support and updates are available on Discord and X (Twitter). A roadmap details planned features through Q2 2026.

Licensing & Compatibility

Licensed under the permissive Apache License 2.0, suitable for commercial use and integration into closed-source applications without significant restrictions.

Limitations & Caveats

The Desktop Agent currently supports Linux desktop environments; Windows and macOS support are planned. While production-ready, the quick start focuses on development, and significant setup is required.

open-computer-use by LLmHub-dev

Explore Similar Projects

cyberdesk by cyberdesk-hq

hello-halo by openkursar

open-agent by AFK-surf

Agentarium by Thytu

toad by batrachianai

awesome-ai-sdks by e2b-dev

swarm by parruda

ryos by ryokun6

Claude-Cowork by DevAgentForge

Auto-GPT-ZH by kaqijiang

ag2 by ag2ai

SuperAGI by TransformerOptimus