Web agent framework for online development, training, and evaluation
Top 96.9% on sourcepulse
WebCanvas is an open-source framework designed for building, training, and evaluating web agents in dynamic, real-time online environments. It addresses the limitations of static or isolated web agent development by providing a comprehensive suite of tools for realistic interaction and assessment, targeting researchers and developers building sophisticated web-based AI agents.
How It Works
WebCanvas employs a "KEY-NODE" based approach for web trajectory annotation, enabling granular, phase-based assessment of agent performance. It integrates live web environments for realistic feedback, supporting dynamic evaluation functions and offering metrics like USD efficiency. The framework is built with plug-and-play modules for planning, observation, memory, reward, action execution, and evaluation, facilitating easy iteration on LLM-based web agents.
Quick Start & Requirements
conda create -n webcanvas python=3.11
, conda activate webcanvas
, pip install -r requirements.txt
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The framework is in early stages (v0.0.4) with several items still in the TODO list, including batch evaluation, captcha solving services, and integration with more benchmark datasets like WebArena. The README notes that experimental environment (e.g., Windows server, US-based servers) can significantly impact agent performance.
3 weeks ago
1 week