TD3_BC by sfujim

PyTorch implementation of TD3+BC, an offline RL method

Created 4 years ago

385 stars

Top 74.3% on SourcePulse

1 Expert Loves This Project

MishaLaskin

Cofounder of Reflection AI

Project Summary

This repository provides a minimalist PyTorch implementation of TD3+BC, a simple yet effective variant of the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for offline reinforcement learning. It is designed for researchers and practitioners seeking a straightforward approach to offline RL without complex architectural changes or hyperparameter tuning.

How It Works

TD3+BC enhances the standard TD3 algorithm with two key modifications: a weighted behavior cloning loss is incorporated into the policy update, and states are normalized. This approach aims to leverage the stability of TD3 while incorporating the benefits of behavior cloning for improved performance in offline settings, requiring no changes to the underlying network architecture or hyperparameters.

Quick Start & Requirements

Primary install/run command: ./run_experiments.sh
Prerequisites: MuJoCo 1.50, mujoco-py 1.50.1.1, OpenAI gym 0.17.0, PyTorch 1.4.0, Python 3.6.
Datasets: D4RL datasets.

Highlighted Details

Reproduces paper results using the provided script.
Focuses on a minimalist approach with minimal algorithmic changes.
Implements TD3 with a weighted behavior cloning loss and state normalization.

Maintenance & Community

Developed by Scott Fujimoto and Shixiang Shane Gu.
The project is associated with NeurIPS 2021.

Licensing & Compatibility

The repository is marked as "not an official Google product."
Licensing details are not explicitly stated in the README.

Limitations & Caveats

The implementation is tied to specific older versions of dependencies (MuJoCo 1.50, mujoco-py 1.50.1.1, OpenAI gym 0.17.0, PyTorch 1.4.0, Python 3.6), which may pose challenges for users with newer environments.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

5 stars in the last 30 days

Explore Similar Projects

Starred by

James Bradbury

James Bradbury(Head of Compute at Anthropic),

Adam Paszke

Adam Paszke(Coauthor of PyTorch), and

2 more.

pytorch-es by atgambardella

PyTorch implementation of Evolution Strategies for Markov Decision Processes

Created 8 years ago

Updated 8 years ago

Starred by

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face) and

Luca Antiga

Luca Antiga(CTO of Lightning AI).

machina by DeepX-inc

PyTorch library for real-world deep reinforcement learning

Created 8 years ago

Updated 5 years ago

imitation-learning by Kaixhin

Imitation learning algorithms research paper (SAC base)

Created 5 years ago

Updated 9 months ago

Starred by

Jerry Tworek

Jerry Tworek(VP Research at OpenAI) and

James Bradbury

James Bradbury(Head of Compute at Anthropic).

ACER by Kaixhin

RL research paper reproducing ACER algorithm

Created 8 years ago

Updated 3 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity) and

Chenlin Meng

Chenlin Meng(Cofounder of Pika).

pfrl by pfnet

PyTorch library for deep reinforcement learning research

Created 5 years ago

Updated 3 weeks ago

Starred by

Michael Truell

Michael Truell(Cofounder of Cursor).

DDPG by floodsung

DDPG implementation for continuous control tasks

Created 9 years ago

Updated 4 years ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI), and

1 more.

PyTorch-RL by Khrylx

RL algorithms in PyTorch

Created 8 years ago

Updated 4 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Guillaume Lample

Guillaume Lample(Cofounder of Mistral), and

3 more.

pytorch-a3c by ikostrikov

PyTorch implementation of A3C reinforcement learning algorithm

Created 9 years ago

Updated 6 years ago

Starred by

Nathan Lambert

Nathan Lambert(Research Scientist at AI2),

Phil Wang

Phil Wang(Prolific Research Paper Implementer), and

1 more.

TD3 by sfujim

PyTorch implementation of TD3 for OpenAI gym tasks

Created 8 years ago

Updated 2 years ago

Starred by

Théophile Gervet

Théophile Gervet(Cofounder of Genesis AI),

Joshua Achiam

Joshua Achiam(Head of Mission Alignment at OpenAI), and

10 more.

pytorch-a2c-ppo-acktr-gail by ikostrikov

PyTorch implementations of reinforcement learning algorithms

Created 8 years ago

Updated 3 years ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA),

Lyumin Zhang

Lyumin Zhang(Author of ControlNet), and

1 more.

Deep-reinforcement-learning-with-pytorch by sweetice

PyTorch library for deep reinforcement learning algorithms

Created 7 years ago

Updated 2 years ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect).

Deep-Reinforcement-Learning-Hands-On by PacktPublishing

Code samples for a deep reinforcement learning book

Created 7 years ago

Updated 3 weeks ago

Feedback? Help us improve.