rainbow-is-all-you-need by Curt-Park

Deep reinforcement learning tutorial from DQN to Rainbow

Created 7 years ago

2,025 stars

Top 21.1% on SourcePulse

View on GitHub

3 Experts Love This Project

Benjamin Bolte

Cofounder of K-Scale Labs

Chaoyu Yang

Founder of Bento

Lukas Biewald

Cofounder of Weights & Biases

Project Summary

This repository offers a comprehensive, step-by-step tutorial series designed to demystify Deep Reinforcement Learning (DRL) algorithms, guiding users from the foundational Deep Q-Networks (DQN) all the way to the sophisticated Rainbow agent. It is meticulously crafted for engineers, researchers, and power users aiming to gain a deep, practical understanding of state-of-the-art DRL techniques. The project distinguishes itself by pairing clear theoretical backgrounds with robust, object-oriented Python implementations, enabling direct learning and hands-on experimentation. A key benefit is the option for cloud-based execution via molab, which eliminates the need for complex local setup, thereby accelerating the learning curve and fostering immediate engagement with the material.

How It Works

The project meticulously structures learning into distinct, digestible chapters, each dedicated to a specific DRL component or algorithm. This progressive approach covers essential milestones: DQN, Double DQN, Prioritized Experience Replay, Dueling Networks, Noisy Networks, Categorical DQN, N-step Learning, and culminates in the integrated Rainbow and Rainbow IQN agents. Each chapter thoughtfully integrates theoretical explanations with corresponding, well-organized object-oriented Python code. The implementation prominently features marimo, a novel reactive Python notebook framework. marimo enhances reproducibility, simplifies version control through better Git diffing, and provides an intuitive interactive UI, all while maintaining the simplicity of running as a pure .py file. This design philosophy ensures that complex DRL concepts are broken down into manageable, executable, and easily understandable modules.

Quick Start & Requirements

To commence, users must first install mise (e.g., via curl https://mise.run | sh). Subsequently, clone the repository using git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git and navigate into the project directory. The project setup is streamlined through make init and make setup commands, which automate Python environment creation and the installation of necessary packages. Experimentation with specific algorithms is straightforward: execute make run notebook=XX_algorithm.py, substituting XX_algorithm.py with the desired notebook file (e.g., 01_dqn.py). For users preferring to avoid local configuration, cloud execution is readily available via "Run in molab" links presented on preview pages, abstracting away hardware and software dependencies.

Highlighted Details

Effortless Cloud Execution: Seamlessly run all tutorials directly in the cloud via molab, eliminating the burden of local setup and configuration.
Advanced Notebook Technology: Leverages marimo for reactive, highly reproducible, and interactive Python notebooks that enhance the learning experience.
Comprehensive Algorithm Progression: Offers a detailed, step-by-step journey through the evolution of DRL algorithms, from foundational DQN to the state-of-the-art Rainbow agent.
Rich Theoretical Context: Provides direct links to seminal research papers for each covered algorithm, enabling deeper dives into the underlying theory.

Maintenance & Community

The project actively acknowledges its contributors, listing six individuals via a shield and adhering to the "all-contributors" specification, which encourages broad participation. Contributions of any nature are explicitly welcomed, and users are invited to submit issues or pull requests to foster project improvement and community engagement.

Licensing & Compatibility

Crucially, the repository's README does not specify a software license. This omission renders the licensing terms and any associated compatibility notes for commercial use or integration into closed-source projects undefined, posing a significant adoption blocker.

Limitations & Caveats

This project is fundamentally designed as an educational tutorial series, prioritizing learning over production deployment. Explicit hardware requirements, such as specific GPU models or CUDA versions, are not detailed, although such resources are often beneficial for accelerating DRL training. The most critical limitation for potential adopters is the absence of a declared software license, which prevents a clear understanding of usage rights, distribution, and modification permissions.

Health Check

Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

2 stars in the last 30 days