openrl by OpenRL-Lab

RL framework for single/multi-agent, offline RL, self-play, and NLP tasks

Created 2 years ago

803 stars

Top 44.0% on SourcePulse

Project Summary

OpenRL is a unified, PyTorch-based reinforcement learning framework designed for researchers and practitioners. It simplifies the training of diverse RL tasks, including single-agent, multi-agent, offline RL, self-play, and natural language processing, offering a flexible and efficient platform.

How It Works

OpenRL employs a modular design with high-level abstractions, enabling users to train various tasks through a consistent interface. It supports a wide range of algorithms (PPO, MAPPO, DQN, SAC, DDPG, etc.) and integrates seamlessly with popular environments like Gymnasium, PettingZoo, and SMACv2. The framework also incorporates advanced features such as DeepSpeed for training acceleration and Hugging Face for model/dataset imports.

Quick Start & Requirements

Install: pip install openrl or conda install -c openrl openrl. Source install: git clone ... && cd openrl && pip install -e .
Prerequisites: PyTorch. GPU acceleration is supported via Docker.
Resources: Training on CartPole-v1 takes seconds on a laptop.
Docs: OpenRL Documentation

Highlighted Details

Supports NLP/RLHF, Multi-agent, Self-Play, Offline RL, and DeepSpeed, differentiating it from libraries like Stable-Baselines3 and Ray/RLlib.
Features an "Arena" for competitive agent evaluation and Hugging Face integration for models and datasets.
Includes support for LSTM, GRU, Transformer models, and automatic mixed precision training.
Offers extensive environment support (Gymnasium, PettingZoo, SMACv2, Isaac Gym, etc.) and popular visualization tools (wandb, tensorboardX).

Maintenance & Community

Maintained by OpenRL-Lab, with active development and community contributions welcomed. Community channels include QQ, Slack, and Discord.

Licensing & Compatibility

Licensed under Apache 2.0, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The main branch is under active development; a stable branch is available for general use. The framework is still evolving, with ongoing documentation updates.

openrl by OpenRL-Lab

Explore Similar Projects

arena by diambra

machin by iffiX

pytorch-rl by navneet-nmk

alf by HorizonRobotics

stable-baselines by Stable-Baselines-Team

TextArena by TextArena

reinforcement-learning-algorithms by TianhongDai

pytorch-rl by jingweiz

DI-engine by opendilab

PettingZoo by Farama-Foundation

football by google-research

baselines by openai