Discover and explore top open-source AI tools and projects—updated daily.
Surveying the landscape of agentic reinforcement learning for LLMs
New!
Top 49.5% on SourcePulse
This repository curates research papers on Agentic Reinforcement Learning (RL) for Large Language Models (LLMs), serving as a comprehensive survey. It categorizes advancements across key agent tasks like search, coding, and mathematics, offering researchers and practitioners a structured overview of LLM-agent RL techniques and their benefits.
How It Works
As a survey, this repository organizes and presents findings from academic literature on LLM agents enhanced by RL. It details various RL algorithms (PPO, DPO, GRPO families), their mechanisms, and objectives, categorized by task domains (Search, Code, Math, GUI, etc.). It also lists relevant environments and frameworks, providing links to the original research.
Quick Start & Requirements
This repository is a curated list of research resources, not a runnable project. Users should refer to the linked survey paper (https://arxiv.org/abs/2509.02547) and individual papers for setup and execution details of specific agentic LLM RL implementations.
Highlighted Details
Maintenance & Community
The provided README content does not contain information on maintainers, contributors, community channels, sponsorships, or a project roadmap.
Licensing & Compatibility
No license information is specified in the README. Licensing pertains to the individual research papers and projects linked within the survey.
Limitations & Caveats
This survey represents a snapshot of agentic LLM RL research. Some sections are marked "TO BE ADDED," indicating incompleteness. The field's rapid evolution means new techniques emerge frequently, and this survey may not capture the latest advancements.
1 week ago
Inactive