Discover and explore top open-source AI tools and projects—updated daily.
AI agent toolkit for real-world web environments
Top 73.9% on SourcePulse
AGI SDK is a Python toolkit for building, evaluating, and benchmarking AI browser agents against real-world web applications. It targets AI researchers and developers seeking to test agent capabilities in realistic, complex environments, offering a standardized benchmark and a platform for comparing agent performance.
How It Works
The SDK provides high-fidelity, deterministic web application clones (e.g., Amazon, DoorDash) built with modern web stacks. It uses a harness that orchestrates agent interactions with these environments, providing structured observations (DOM, accessibility tree, screenshots) and accepting actions as function calls. This approach allows for reproducible evaluation and direct comparison of agents on standardized tasks.
Quick Start & Requirements
pip install agisdk
playwright install --force
OPENAI_API_KEY
(or other supported LLM providers)brew install --cask playwright
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 days ago
Inactive