RL library for training LLM agents via GRPO
Top 12.7% on sourcepulse
OpenPipe ART is an open-source library designed to enhance the performance of multi-turn LLM agents through Reinforcement Learning (RL), specifically using the GRPO algorithm. It targets developers and researchers looking to fine-tune LLMs for agentic workflows with minimal code modification, offering a streamlined approach to RL training.
How It Works
ART employs a client-server architecture. The client integrates with existing agent codebases, routing LLM requests to the ART server. The server, running independently, manages model inference (via vLLM with LoRA) and the GRPO training loop. It collects agent trajectories, assigns rewards, trains the model, and updates the inference endpoint, creating a continuous feedback loop for agent improvement. This separation allows users to focus on agent logic while ART handles the RL complexities.
Quick Start & Requirements
pip install openpipe
Highlighted Details
Maintenance & Community
ART is under active development with contributions welcomed. Community interaction is encouraged via Discord.
Licensing & Compatibility
Limitations & Caveats
Gemma 3 models are explicitly listed as unsupported. The library is in active development, suggesting potential for breaking changes or evolving features.
1 day ago
1 day