CORL by tinkoff-ai

Offline RL library with single-file implementations of SOTA algorithms

Created 3 years ago

1,302 stars

Top 30.6% on SourcePulse

Project Summary

CORL is a Python library providing single-file, research-friendly implementations of state-of-the-art Offline and Offline-to-Online Reinforcement Learning algorithms. It aims to simplify experimentation and reproducibility for researchers and practitioners in offline RL, offering a clean codebase inspired by the popular cleanrl library.

How It Works

CORL implements algorithms as self-contained Python files, promoting clarity and ease of modification. Each implementation is designed for reproducibility and includes integration with Weights and Biases for experiment tracking. The library covers a wide range of algorithms, including conservative Q-learning (CQL), implicit Q-learning (IQL), decision transformers (DT), and more, with benchmarks provided on standard datasets like D4RL.

Quick Start & Requirements

Install via pip install -r requirements/requirements_dev.txt or use Docker.
Requires Python and potentially CUDA-enabled GPUs for training.
Official documentation and benchmarks are available.

Highlighted Details

Single-file implementations for 13 SOTA ORL algorithms.
Benchmarked performance on Gym-MuJoCo, Maze2d, Antmaze, and Adroit datasets.
Weights and Biases integration for experiment tracking.
Includes both offline and offline-to-online variants for several algorithms.

Maintenance & Community

The project is maintained by Tinkoff AI. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The library is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While comprehensive, the README notes that benchmark results can vary significantly between papers and implementations, suggesting users verify results. The project also points to a separate library, Katakomba, for discrete control tasks.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

17 stars in the last 30 days