PyTorch-RL by Khrylx

RL algorithms in PyTorch

Created 8 years ago

1,267 stars

Top 31.2% on SourcePulse

View on GitHub

3 Experts Love This Project

Chenlin Meng

Cofounder of Pika

Jiaming Song

Chief Scientist at Luma AI

Jerry Tworek

VP Research at OpenAI

Project Summary

This repository provides a PyTorch implementation of popular Deep Reinforcement Learning algorithms, including policy gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). It is targeted at researchers and practitioners in RL who need a solid, well-implemented baseline for these algorithms. The key benefit is a fast, efficient implementation with support for both discrete and continuous action spaces.

How It Works

The implementation leverages PyTorch for its neural network components and offers multiprocessing for parallel environment interaction, significantly speeding up sample collection. A notable feature is the fast Fisher vector product calculation for TRPO, which is crucial for the algorithm's stability and performance. The code structure separates different algorithms and provides clear examples for running them.

Quick Start & Requirements

Install via pip install -r requirements.txt (after cloning).
Requires PyTorch (version 0.4 or 0.3 via branch), mujoco-py, and gym.
For optimal performance, especially on Linux, set export OMP_NUM_THREADS=1.
Official examples: examples/trpo_gym.py, examples/ppo_gym.py, examples/a2c_gym.py, gail/gail_gym.py.

Highlighted Details

Implements TRPO, PPO, A2C, and GAIL.
Supports discrete and continuous action spaces.
Achieves up to 8x speedup using multiprocessing for sample collection.
Features a fast Fisher vector product calculation for TRPO.

Maintenance & Community

The repository is a personal project by Khrylx. There are no explicit mentions of community channels, active development, or partnerships in the README.

Licensing & Compatibility

The README does not explicitly state a license. The code references openai/baselines, which is MIT licensed, but this does not guarantee the license of this specific repository. Compatibility for commercial use is not specified.

Limitations & Caveats

The code is noted to work for PyTorch 0.4, with a separate branch for PyTorch 0.3, indicating potential compatibility issues with newer PyTorch versions. The project's maintenance status and community support are unclear from the provided README.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

6 stars in the last 30 days