stable-baselines  by Stable-Baselines-Team

RL algorithm implementations, a fork of OpenAI Baselines (maintenance mode)

Created 6 years ago
302 stars

Top 88.4% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a fork of OpenAI Baselines, offering refactored and improved implementations of reinforcement learning algorithms. It targets researchers and practitioners seeking to easily replicate, refine, and benchmark RL algorithms, with a focus on usability and clear documentation.

How It Works

Stable-Baselines features a unified structure for all algorithms, adhering to PEP8 standards and offering extensive documentation and testing. It introduces additional algorithms like SAC and TD3, along with HER support for several algorithms, enhancing its capabilities beyond the original OpenAI Baselines.

Quick Start & Requirements

Highlighted Details

  • Implements state-of-the-art RL methods including A2C, PPO2, SAC, TD3, and more.
  • Offers support for various action spaces (Box, Discrete, MultiDiscrete, MultiBinary).
  • Provides a scikit-learn-like API for ease of use.
  • Includes a collection of over 100 pre-trained RL agents in the RL Baselines Zoo.

Maintenance & Community

This package is in maintenance mode, with a recommendation to use Stable-Baselines3 (SB3). Key maintainers include Ashley Hill, Antonin Raffin, and others.

Licensing & Compatibility

The repository is available under the MIT license, permitting commercial use and linking with closed-source projects.

Limitations & Caveats

The package is in maintenance mode and does not support TensorFlow 2. Users are directed to Stable-Baselines3 for up-to-date implementations and support.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Deepak Pathak Deepak Pathak(Cofounder of Skild AI; Professor at CMU), Anastasis Germanidis Anastasis Germanidis(Cofounder of Runway), and
1 more.

deer by VinF

0%
489
Deep reinforcement learning framework
Created 9 years ago
Updated 2 months ago
Starred by Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), and
2 more.

coach by IntelLabs

0%
2k
Reinforcement learning framework for experimentation (discontinued)
Created 8 years ago
Updated 2 years ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
5 more.

stable-baselines3 by DLR-RM

0.4%
12k
PyTorch library for reinforcement learning algorithm implementations
Created 5 years ago
Updated 5 days ago
Feedback? Help us improve.