Framework for training LLM web agents using reinforcement learning
Top 70.1% on sourcepulse
WebRL provides a framework for training Large Language Model (LLM) web agents using a self-evolving online curriculum reinforcement learning technique. It targets the WebArena environment, enabling agents to learn complex web navigation and task completion skills. The project is suitable for researchers and developers focused on autonomous agents and LLM-powered web interaction.
How It Works
WebRL employs a reinforcement learning approach where the agent learns to interact with web environments. A key innovation is the self-evolving online curriculum, which dynamically adjusts the learning tasks to optimize agent progress. This curriculum generation, coupled with an Outcome-supervised Reward Model (ORM), allows for efficient and effective training of web agents.
Quick Start & Requirements
python==3.10
), activate it, cd WebRL
, and run pip install -e .
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is presented as a research preprint, and its stability, long-term maintenance, and production readiness are not yet established. Specific hardware requirements for training and inference are not detailed.
1 month ago
1 week