RL-Factory  by Simple-Efficient

RL post-training framework for agentic learning

Created 3 months ago
1,505 stars

Top 27.5% on SourcePulse

GitHubView on GitHub
Project Summary

RL-Factory is an open-source framework designed for efficient Reinforcement Learning (RL) post-training of agent models, particularly for tool-use scenarios. It targets researchers and developers aiming to simplify the process of training agents that interact with tools, offering a 2x speedup in training time and a focus on ease of use through decoupled environment and reward function definitions.

How It Works

RL-Factory decouples the environment from the RL training process, allowing users to define tool configurations and reward functions with minimal code. It supports asynchronous, parallel tool-calling for enhanced training efficiency and features multi-turn tool-calling, model-based rewards, and native support for training with Qwen3 models and MCP tools. This architecture aims to let users focus on reward logic and tool setup while enabling hardcore developers to optimize training efficiency and model performance.

Quick Start & Requirements

  • Install: pip3 install -e . --no-deps (after installing other dependencies).
  • Prerequisites: CUDA >= 12.0 (12.4 recommended), Python >= 3.10 (3.10 recommended), vllm >= 0.8.3 (0.8.5 recommended for Qwen3).
  • Dependencies: Includes accelerate, bitsandbytes, deepspeed, flash-attn, peft, ray, torch, transformers, vllm, qwen-agent, llama_index, faiss-gpu-cu12 (optional), nvidia-cublas-cu12 (optional).
  • Setup: Requires significant dependency installation.
  • Docs: Tutorial, Installation, Framework Design.

Highlighted Details

  • Achieves 2x faster training compared to Search-R1, demonstrated with Qwen3 models.
  • Supports Qwen3 models out-of-the-box, showing superior tool-calling capabilities without SFT.
  • Enables flexible reward definition via rules, model-judging, or tools.
  • Facilitates seamless tool integration through configuration files.
  • Future plans include a WebUI for data processing, configuration, and project management.

Maintenance & Community

  • Fast release cycle planned for new features.
  • Contributions are welcomed via GitHub issues or direct contact (chaijiajun@meituan.com, gjyin@outlook.com).
  • Benefits from verl, Search-R1, and Qwen-Agent.

Licensing & Compatibility

  • The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Currently, only Qwen models have been tested. The framework is under rapid development, with a WebUI and broader model support planned for future releases. Optional dependencies like faiss-gpu-cu12 may be needed to resolve specific training issues.

Health Check
Last Commit

11 hours ago

Responsiveness

1 day

Pull Requests (30d)
37
Issues (30d)
5
Star History
154 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Will Brown Will Brown(Research Lead at Prime Intellect).

agent-lightning by microsoft

6.0%
2k
Train any AI agent with rollouts and feedback
Created 3 months ago
Updated 2 days ago
Starred by Evan Hubinger Evan Hubinger(Head of Alignment Stress-Testing at Anthropic), Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), and
1 more.

rl by pytorch

0.4%
3k
PyTorch library for reinforcement learning research
Created 3 years ago
Updated 2 days ago
Starred by Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), and
2 more.

coach by IntelLabs

0%
2k
Reinforcement learning framework for experimentation (discontinued)
Created 8 years ago
Updated 2 years ago
Feedback? Help us improve.