RL algorithms implementation with research-friendly features
Top 7.0% on sourcepulse
CleanRL provides high-quality, single-file implementations of popular Deep Reinforcement Learning algorithms, targeting researchers and practitioners who need clear, understandable, and reproducible code. It offers a research-friendly environment with features like Tensorboard logging, local reproducibility, and cloud integration, enabling efficient experimentation and prototyping.
How It Works
CleanRL's core philosophy is to encapsulate each algorithm variant within a single, standalone Python file. This approach prioritizes clarity and ease of understanding over modularity, allowing users to grasp all implementation details without navigating complex class hierarchies. This design choice facilitates rapid prototyping and debugging of advanced features.
Quick Start & Requirements
poetry install
or pip install -r requirements/requirements.txt
(with optional dependencies for specific environments like Atari, MuJoCo, Procgen, etc.).poetry run python cleanrl/ppo.py --env-id CartPole-v0 --total-timesteps 50000
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
CleanRL is not designed as a modular library and involves code duplication across algorithm implementations. The project is migrating to Gymnasium, with ongoing progress tracked in issue #277. Some optimizations, like envpool
for Atari, are Linux-specific.
3 weeks ago
1 day