AgentGym-RL  by WooooDyy

Train LLM agents for long-horizon, multi-turn decision-making

Created 1 month ago
432 stars

Top 68.8% on SourcePulse

GitHubView on GitHub
Project Summary

AgentGym-RL provides a framework for training LLM agents in multi-turn, long-horizon decision-making tasks using Reinforcement Learning. It targets researchers and developers aiming to enhance LLM agent capabilities, offering substantial performance gains that match or surpass commercial models across diverse real-world scenarios. [1, 2]

How It Works

The framework features a modular design (Environment, Agent, Training). Environments offer diverse scenarios (WebArena, Deep Search, TextCraft, BabyAI, SciWorld) via HTTP. The Agent module handles reasoning, and the Training module supports RL algorithms (PPO, GRPO, REINFORCE++) and other methods (SFT, DPO). A key innovation, ScalingInter-RL, progressively scales interaction horizons during training. This method balances exploration/exploitation, enabling stable and efficient optimization for long-horizon tasks.

Quick Start & Requirements

Installation involves pip install -e . for the core library and pip install transformers==4.51.3. Recommended environment: CUDA 12.4, PyTorch 2.4, Python 3.10. Specific installation steps for flash-attn are provided. Users must also download the AgentGym-RL-Data-ID dataset from Hugging Face and launch a separate environment server. Links to the paper [1], project page [2], and dataset [3] are available.

Highlighted Details

Qwen2.5-7B models trained with AgentGym-RL and ScalingInter-RL match or surpass top proprietary models (GPT-4o, Gemini-2.5-Pro) on 27 tasks. The framework supports diverse environments including WebArena, Deep Search, TextCraft, BabyAI, and SciWorld. It integrates multiple RL algorithms and provides a visualized user interface for trajectory analysis.

Maintenance & Community

The README acknowledges underlying projects but offers no specific details on current maintenance status, active contributors, or community channels like Discord or Slack.

Licensing & Compatibility

The provided README does not specify the software license, leaving commercial use or closed-source integration compatibility unclear.

Limitations & Caveats

Setup requires specific, potentially recent versions of CUDA, PyTorch, and Python. Reliance on external environment servers and the complexity of the training pipeline may present adoption challenges.


[1] https://arxiv.org/abs/2509.08755 [2] https://github.com/WoooDyy/AgentGym-RL [3] https://huggingface.co/datasets/AgentGym-RL-Data-ID

Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
2
Star History
197 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect).

GITM by OpenGVLab

0.2%
635
LLM agent for Minecraft open-world environments
Created 2 years ago
Updated 2 years ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Edward Sun Edward Sun(Research Scientist at Meta Superintelligence Lab).

Eureka by eureka-research

0.1%
3k
LLM-based reward design for reinforcement learning
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.