CORL  by tinkoff-ai

Offline RL library with single-file implementations of SOTA algorithms

Created 3 years ago
1,255 stars

Top 31.5% on SourcePulse

GitHubView on GitHub
Project Summary

CORL is a Python library providing single-file, research-friendly implementations of state-of-the-art Offline and Offline-to-Online Reinforcement Learning algorithms. It aims to simplify experimentation and reproducibility for researchers and practitioners in offline RL, offering a clean codebase inspired by the popular cleanrl library.

How It Works

CORL implements algorithms as self-contained Python files, promoting clarity and ease of modification. Each implementation is designed for reproducibility and includes integration with Weights and Biases for experiment tracking. The library covers a wide range of algorithms, including conservative Q-learning (CQL), implicit Q-learning (IQL), decision transformers (DT), and more, with benchmarks provided on standard datasets like D4RL.

Quick Start & Requirements

  • Install via pip install -r requirements/requirements_dev.txt or use Docker.
  • Requires Python and potentially CUDA-enabled GPUs for training.
  • Official documentation and benchmarks are available.

Highlighted Details

  • Single-file implementations for 13 SOTA ORL algorithms.
  • Benchmarked performance on Gym-MuJoCo, Maze2d, Antmaze, and Adroit datasets.
  • Weights and Biases integration for experiment tracking.
  • Includes both offline and offline-to-online variants for several algorithms.

Maintenance & Community

The project is maintained by Tinkoff AI. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The library is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While comprehensive, the README notes that benchmark results can vary significantly between papers and implementations, suggesting users verify results. The project also points to a separate library, Katakomba, for discrete control tasks.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
16 stars in the last 30 days

Explore Similar Projects

Starred by Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), and
2 more.

coach by IntelLabs

0%
2k
Reinforcement learning framework for experimentation (discontinued)
Created 8 years ago
Updated 2 years ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Pawel Garbacki Pawel Garbacki(Cofounder of Fireworks AI), and
6 more.

tianshou by thu-ml

0.2%
9k
PyTorch RL library for algorithm development and application
Created 7 years ago
Updated 1 week ago
Feedback? Help us improve.