agent  by trymeka

Autonomous browsing agent for state-of-the-art web task completion

created 1 month ago
318 stars

Top 85.0% on SourcePulse

GitHubView on GitHub
Project Summary

Meka Agent is an open-source, autonomous agent designed for state-of-the-art web browsing and computer interaction. It aims to mimic human-like interaction by relying purely on visual input and operating within a full computer context, making it suitable for researchers and developers building complex automation workflows.

How It Works

Meka Agent utilizes a vision-centric approach, processing visual information from the computer environment to understand and act. It supports a flexible architecture, allowing users to integrate various Large Language Models (LLMs) with strong visual grounding (e.g., OpenAI o3, Claude Sonnet 4, Claude Opus 4) and infrastructure providers that offer OS-level controls beyond browser screenshots. This OS-level access is crucial for interacting with elements like dropdowns, alerts, and file uploads, which are often rendered at the system level.

Quick Start & Requirements

  • Install: npm install @trymeka/core @trymeka/ai-provider-vercel @ai-sdk/openai @trymeka/computer-provider-anchor-browser playwright-core
  • Prerequisites: OpenAI API Key, Anchor Browser API Key.
  • Setup: Requires Node.js environment. Configuration involves creating a .env file with API keys.

Highlighted Details

  • Achieves 72.7% on the WebArena benchmark.
  • Supports a "Bring Your Own LLM" philosophy via Vercel's ai-sdk.
  • Designed for extensibility with custom tools and providers.
  • Written in TypeScript for a typesafe API.

Maintenance & Community

The project is open-source with a call for contributions. Links to contributing guidelines are available.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The primary infrastructure provider mentioned is Anchor Browser, suggesting potential vendor lock-in or a need for specific VM-based environments for full functionality. While other providers are welcome, extensive testing is noted for OpenAI and Claude models.

Health Check
Last commit

5 days ago

Responsiveness

Inactive

Pull Requests (30d)
69
Issues (30d)
1
Star History
317 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Mckay Wrigley Mckay Wrigley(Founder of Takeoff AI), and
1 more.

chatGPTBox by ChatGPTBox-dev

0.1%
11k
Browser extension for ChatGPT integration
created 2 years ago
updated 4 days ago
Starred by Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
4 more.

composio by ComposioHQ

0.2%
26k
Toolset for AI agent integrations via function calling
created 1 year ago
updated 22 hours ago
Feedback? Help us improve.