PyTorch library for deep reinforcement learning algorithms
Top 11.4% on sourcepulse
This repository provides PyTorch implementations of classic and state-of-the-art deep reinforcement learning algorithms, targeting researchers and practitioners seeking clear code examples for learning and experimentation. It aims to offer a comprehensive collection of algorithms, including DQN, AC, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, and TD3, with ongoing development and additions of new methods.
How It Works
The project implements various deep reinforcement learning algorithms using PyTorch, a popular deep learning framework. It focuses on providing clear, understandable code for each algorithm, facilitating learning and modification. The implementations cover a range of approaches, from value-based methods like DQN to policy-based methods like REINFORCE and actor-critic methods, offering a broad spectrum of RL techniques.
Quick Start & Requirements
pip install -r requirements.txt
(after installing PyTorch and gym separately).tensorboardX
, gym
>= 0.10, PyTorch >= 0.4. Note: Python 3.7 is not supported due to TensorFlow compatibility.python TD3_BipedalWalker-v2.py --mode test
Highlighted Details
pytorch_MountainCar-v0.py
and TD3_BipedalWalker-v2.py
.Maintenance & Community
The repository is marked as "Active (under active development, breaking changes may occur)". No specific community links (Discord, Slack) or notable contributors are mentioned in the README.
Licensing & Compatibility
The README does not explicitly state a license. Given the nature of open-source projects and the lack of explicit mention, users should assume standard open-source licensing or inquire with the maintainer. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project is under active development, meaning breaking changes may occur. It requires older Python versions (<= 3.6) and PyTorch versions (>= 0.4), which might pose compatibility challenges with newer environments. Some implementations (SAC, TD3) are noted as not being from the original paper authors.
2 years ago
1+ week