chainerrl by chainer

Deep RL library for algorithm experimentation

Created 9 years ago

1,202 stars

Top 32.5% on SourcePulse

View on GitHub

7 Experts Love This Project

Aravind Srinivas

Cofounder of Perplexity

David Ha

Cofounder of Sakana AI

Junxiao Song

Research Scientist at DeepSeek

John Schulman

Cofounder of Thinking Machines Lab

and 3 more!

Project Summary

ChainerRL is a Python library for deep reinforcement learning, offering a comprehensive suite of state-of-the-art algorithms and techniques. It targets researchers and practitioners in RL, providing a flexible framework built on Chainer for developing and experimenting with agents.

How It Works

ChainerRL implements a wide array of RL algorithms, including DQN variants, DDPG, A3C, PPO, and SAC, supporting both discrete and continuous action spaces, recurrent models, and batch/asynchronous training where applicable. It leverages Chainer's flexibility for defining neural network architectures and training loops, enabling efficient implementation and customization of RL agents.

Quick Start & Requirements

Install: pip install chainerrl
Requirements: Python 3.6+, requirements.txt for other dependencies.
Documentation: ChainerRL Documentation
Examples: Atari and OpenAI Gym environments.

Highlighted Details

Implements advanced techniques like NoisyNet, Prioritized Experience Replay, Dueling Networks, and Normalized Advantage Function.
Supports visualization tools for inspecting and debugging agent behavior.
Compatible with any environment adhering to the OpenAI Gym interface.
Offers implementations for both synchronous (A2C) and asynchronous (A3C) training variants.

Maintenance & Community

The project is associated with the Chainer deep learning framework. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive MIT license allows for commercial use and integration with closed-source projects.

Limitations & Caveats

The library is built on Chainer, which has been succeeded by CuPy and PyTorch. While ChainerRL itself is functional, the underlying framework's development status may impact long-term support and integration with newer deep learning ecosystems.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

5 stars in the last 30 days