ii-agent  by Intelligent-Internet

Agentic framework for building and deploying intelligent agents

created 3 months ago
2,776 stars

Top 17.5% on sourcepulse

GitHubView on GitHub
Project Summary

II-Agent is an open-source framework for building and deploying intelligent agents, primarily targeting developers and researchers interested in AI-driven workflow automation. It provides an agentic interface to Anthropic Claude models, enabling complex task execution across domains like research, content generation, data analysis, and software development.

How It Works

II-Agent employs a core agent architecture that interacts with Large Language Models (LLMs) using system prompting with dynamically tailored context and comprehensive interaction history management. It features intelligent context management to handle token limitations and systematic LLM invocation with capability selection, enabling iterative refinement through execution cycles. The framework supports structured reasoning for planning and reflection, problem decomposition, and transparent decision-making.

Quick Start & Requirements

  • Installation: Clone the repository, set up a Python virtual environment (python -m venv .venv, source .venv/bin/activate), and install the package (pip install -e .). Frontend setup requires cd frontend, npm install.
  • Prerequisites: Python 3.10+, Node.js 18+ (for frontend), Google Cloud project with Vertex AI API enabled or an Anthropic API key. API keys for search providers (Tavily, Jina, Firecrawl, SerpAPI) and OpenAI (for image/video) are also configurable via environment variables.
  • Setup: Requires configuring API keys and potentially Google Cloud credentials in a .env file.
  • Links: GAIA Benchmark Evaluation

Highlighted Details

  • Evaluated on the GAIA benchmark, demonstrating strong performance in complex reasoning, tool use, and multi-step planning despite identified benchmark issues.
  • Supports file system operations with intelligent code editing, secure command-line execution, and advanced web interaction.
  • Experimental support for multimodal capabilities including PDF, audio, image, video, and slides.
  • Features a WebSocket-based interface for interactive use with a React frontend, streaming operational events for responsive UX.

Maintenance & Community

  • The project acknowledges inspiration and component adaptation from AugmentCode, Manus, and Index Browser Use.
  • No specific community links (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

  • Licensed under Apache 2.0.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The project notes several issues encountered during GAIA benchmark evaluation, including annotation errors, outdated information, and language ambiguity, which may impact performance on specific benchmark tasks. Experimental multimodal features are noted.

Health Check
Last commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
16
Issues (30d)
8
Star History
2,874 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.