EVAL  by corca-ai

Agentic framework for executing requests via search, code, and testing

Created 2 years ago
871 stars

Top 41.2% on SourcePulse

GitHubView on GitHub
Project Summary

EVAL is an AI agent designed to execute user requests by autonomously searching, coding, running, and testing on the internet. It targets users who need to automate complex tasks, from generating web applications to performing multimodal data analysis, by leveraging a flexible, self-evolving toolset.

How It Works

EVAL integrates LangChain, LlamaIndex, and Visual ChatGPT to create a versatile agent. It employs a modular architecture allowing it to dynamically create, modify, and execute code to build new tools as needed. This approach enables it to handle multimodal inputs (text, image, dataframe, with audio/video planned) and serve blocking processes like web applications.

Quick Start & Requirements

  • Install/Run: Use docker-compose up --build eval (CPU) or docker-compose up --build eval.gpu (GPU).
  • Prerequisites: Requires an OpenAI API key (OPENAI_API_KEY). GPU version has significant dependencies and is noted as unstable.
  • Setup: Docker installation is required.
  • Docs: Web GUI available at http://localhost:8000. API documentation via POST /api/execute.

Highlighted Details

  • Supports multimodal conversation including image understanding (BLIP) and generation (Stable Diffusion).
  • Can create and manage its own tools through code writing, patching, and deletion.
  • Includes built-in tools for terminal access, file operations, web search (Google, Bing, Wikipedia), and custom database lookups.
  • Offers both synchronous and asynchronous API execution endpoints.

Maintenance & Community

The project references LangChain, LlamaIndex, and Visual ChatGPT. No specific community channels or roadmap are detailed in the README.

Licensing & Compatibility

The README does not explicitly state a license. It references other projects with various licenses, implying potential compatibility considerations for commercial use.

Limitations & Caveats

The GPU version is described as "much heavier and unstable for now." Several features are marked as TODO, including GUI, memory saving, session management, and expanded multimodal support (audio, video).

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by Edward Z. Yang Edward Z. Yang(Research Engineer at Meta; Maintainer of PyTorch), Anton Osika Anton Osika(Cofounder of Lovable), and
3 more.

gptme by gptme

0.3%
4k
CLI tool for terminal agent workflows
Created 2 years ago
Updated 20 hours ago
Feedback? Help us improve.