open-computer-use  by e2b-dev

AI agent for computer control via LLMs

Created 10 months ago
1,558 stars

Top 26.8% on SourcePulse

GitHubView on GitHub
Project Summary

This project enables an AI agent to securely interact with a Linux desktop environment, controlled by open-source LLMs. It's designed for developers and researchers looking to build AI-powered automation tools that can perform complex tasks on a computer, offering a secure sandbox and flexible LLM integration.

How It Works

The system leverages the E2B Desktop Sandbox to provide a secure, isolated Linux environment. The AI agent controls the sandbox via keyboard, mouse, and shell commands, with its actions streamed live to the client. This approach allows for complex, multi-step operations without compromising the host system, and its modular design facilitates easy integration of various LLMs for vision, action, and grounding.

Quick Start & Requirements

  • Install: poetry install followed by poetry run start
  • Prerequisites: Python 3.10+, Git, ffmpeg, E2B API key, and API keys for chosen LLM providers (e.g., Groq, OpenAI, Anthropic). A Hugging Face token is required to bypass rate limits.
  • Setup: Requires cloning the repository, setting environment variables in a .env file, and installing dependencies via Poetry.
  • Docs: E2B Desktop Sandbox

Highlighted Details

  • Supports over 10 LLM providers including Groq, OpenAI, Anthropic, Gemini, and open-source models via Hugging Face Spaces (OS-Atlas, ShowUI).
  • Allows pausing the agent and providing new prompts at any time during execution.
  • Designed for Ubuntu but adaptable to other operating systems.
  • Live display streaming of the sandbox environment to the client.

Maintenance & Community

The project is maintained by e2b-dev. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking would require clarification on the licensing terms.

Limitations & Caveats

The project requires specific API keys for E2B and LLM providers, which may incur costs. The setup involves managing environment variables and dependencies via Poetry. The README does not detail specific performance benchmarks or known limitations of the sandbox environment.

Health Check
Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
0
Star History
121 stars in the last 30 days

Explore Similar Projects

Starred by Edward Z. Yang Edward Z. Yang(Research Engineer at Meta; Maintainer of PyTorch), Anton Osika Anton Osika(Cofounder of Lovable), and
3 more.

gptme by gptme

0.3%
4k
CLI tool for terminal agent workflows
Created 2 years ago
Updated 20 hours ago
Starred by Justin Spahr-Summers Justin Spahr-Summers(Cocreator of Model Context Protocol; MTS at Anthropic), Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), and
5 more.

mindcraft by mindcraft-bots

0.7%
4k
LLM agent for Minecraft
Created 2 years ago
Updated 1 day ago
Starred by Gregor Zunic Gregor Zunic(Cofounder of Browser Use) and Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

agent-zero by agent0ai

0.3%
12k
Agentic framework for general-purpose assistance
Created 1 year ago
Updated 14 hours ago
Starred by Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI), Didier Lopes Didier Lopes(Founder of OpenBB), and
5 more.

eliza by elizaOS

0.5%
17k
Agentic framework for autonomous agents
Created 1 year ago
Updated 18 hours ago
Feedback? Help us improve.