CORL  by tinkoff-ai

Offline RL library with single-file implementations of SOTA algorithms

created 2 years ago
1,234 stars

Top 32.6% on sourcepulse

GitHubView on GitHub
Project Summary

CORL is a Python library providing single-file, research-friendly implementations of state-of-the-art Offline and Offline-to-Online Reinforcement Learning algorithms. It aims to simplify experimentation and reproducibility for researchers and practitioners in offline RL, offering a clean codebase inspired by the popular cleanrl library.

How It Works

CORL implements algorithms as self-contained Python files, promoting clarity and ease of modification. Each implementation is designed for reproducibility and includes integration with Weights and Biases for experiment tracking. The library covers a wide range of algorithms, including conservative Q-learning (CQL), implicit Q-learning (IQL), decision transformers (DT), and more, with benchmarks provided on standard datasets like D4RL.

Quick Start & Requirements

  • Install via pip install -r requirements/requirements_dev.txt or use Docker.
  • Requires Python and potentially CUDA-enabled GPUs for training.
  • Official documentation and benchmarks are available.

Highlighted Details

  • Single-file implementations for 13 SOTA ORL algorithms.
  • Benchmarked performance on Gym-MuJoCo, Maze2d, Antmaze, and Adroit datasets.
  • Weights and Biases integration for experiment tracking.
  • Includes both offline and offline-to-online variants for several algorithms.

Maintenance & Community

The project is maintained by Tinkoff AI. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The library is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While comprehensive, the README notes that benchmark results can vary significantly between papers and implementations, suggesting users verify results. The project also points to a separate library, Katakomba, for discrete control tasks.

Health Check
Last commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
40 stars in the last 90 days

Explore Similar Projects

Starred by John Yang John Yang(Author of SWE-bench, SWE-agent), Lysandre Debut Lysandre Debut(Chief Open-Source Officer at Hugging Face), and
3 more.

cleanrl by vwxyzjn

0.5%
8k
RL algorithms implementation with research-friendly features
created 6 years ago
updated 3 weeks ago
Feedback? Help us improve.