ii-agent by Intelligent-Internet

Agentic framework for building and deploying intelligent agents

Created 10 months ago

3,168 stars

Top 14.9% on SourcePulse

View on GitHub

2 Experts Love This Project

Vincent Weisser

Cofounder of Prime Intellect

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Project Summary

II-Agent is an open-source framework for building and deploying intelligent agents, primarily targeting developers and researchers interested in AI-driven workflow automation. It provides an agentic interface to Anthropic Claude models, enabling complex task execution across domains like research, content generation, data analysis, and software development.

How It Works

II-Agent employs a core agent architecture that interacts with Large Language Models (LLMs) using system prompting with dynamically tailored context and comprehensive interaction history management. It features intelligent context management to handle token limitations and systematic LLM invocation with capability selection, enabling iterative refinement through execution cycles. The framework supports structured reasoning for planning and reflection, problem decomposition, and transparent decision-making.

Quick Start & Requirements

Installation: Clone the repository, set up a Python virtual environment (python -m venv .venv, source .venv/bin/activate), and install the package (pip install -e .). Frontend setup requires cd frontend, npm install.
Prerequisites: Python 3.10+, Node.js 18+ (for frontend), Google Cloud project with Vertex AI API enabled or an Anthropic API key. API keys for search providers (Tavily, Jina, Firecrawl, SerpAPI) and OpenAI (for image/video) are also configurable via environment variables.
Setup: Requires configuring API keys and potentially Google Cloud credentials in a .env file.
Links: GAIA Benchmark Evaluation

Highlighted Details

Evaluated on the GAIA benchmark, demonstrating strong performance in complex reasoning, tool use, and multi-step planning despite identified benchmark issues.
Supports file system operations with intelligent code editing, secure command-line execution, and advanced web interaction.
Experimental support for multimodal capabilities including PDF, audio, image, video, and slides.
Features a WebSocket-based interface for interactive use with a React frontend, streaming operational events for responsive UX.

Maintenance & Community

The project acknowledges inspiration and component adaptation from AugmentCode, Manus, and Index Browser Use.
No specific community links (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

Licensed under Apache 2.0.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

The project notes several issues encountered during GAIA benchmark evaluation, including annotation errors, outdated information, and language ambiguity, which may impact performance on specific benchmark tasks. Experimental multimodal features are noted.

Health Check

Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

74 stars in the last 30 days