hindsight-experience-replay by TianhongDai

PyTorch implementation of Hindsight Experience Replay (HER)

Created 7 years ago

439 stars

Top 68.0% on SourcePulse

Project Summary

This repository provides a PyTorch implementation of Hindsight Experience Replay (HER), a technique designed to improve sample efficiency in reinforcement learning, particularly for sparse reward tasks. It targets researchers and practitioners working with robotic manipulation environments, offering a way to accelerate learning by relabeling past experiences.

How It Works

HER enhances standard off-policy RL algorithms by allowing agents to learn from failed attempts. When an episode finishes without achieving the intended goal, HER replays the trajectory, but with a different, achieved state designated as the new "desired goal." This strategy effectively turns failures into learning opportunities, enabling the agent to learn from states it actually visited, even if the original goal was not met.

Quick Start & Requirements

Install via pip (specific commands not provided, but dependencies are listed).
Requirements: Python 3.5.2, openai-gym 0.12.5, mujoco-py 1.50.1.56, pytorch 1.0.0, mpi4py.
GPU acceleration is supported via a --cuda flag but not recommended without a powerful machine.
Setup involves installing dependencies and potentially downloading pre-trained models from Google Drive.

Highlighted Details

Implements HER for OpenAI Gym's Fetch robotic environments (Reach, Push, PickAndPlace, Slide).
Supports multi-environment execution per MPI process for faster training.
Includes plotting and demo capabilities for visualizing training performance and agent behavior.
Pre-trained models are available for download.

Maintenance & Community

The project appears to be a personal implementation by TianhongDai.
No explicit community channels (Discord, Slack) or roadmap are mentioned in the README.

Licensing & Compatibility

The README does not explicitly state a license. Given the dependencies and typical RL research practices, it's likely intended for research use. Commercial use would require clarification.

Limitations & Caveats

The README notes that GPU usage is not recommended without a powerful machine.
Specific versions of mujoco-py and pytorch are recommended to avoid potential bugs and data type errors, suggesting potential compatibility issues with newer versions.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

1

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

Starred by

Jerry Tworek

Jerry Tworek(VP Research at OpenAI).

pytorch-rl by navneet-nmk

PyTorch SDK for deep reinforcement learning algorithms

Created 7 years ago

Updated 6 years ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA) and

Wing Lian

Wing Lian(Founder of Axolotl AI).

MLGym by facebookresearch

Gym environment for ML research agents

Created 10 months ago

Updated 5 months ago

alf by HorizonRobotics

Agent Learning Framework (ALF) is a PyTorch RL framework

Created 6 years ago

Updated 1 month ago

pg_travel by reinforcement-learning-kr

PyTorch implementations of Policy Gradient reinforcement learning algorithms

Created 7 years ago

Updated 6 years ago

gym-unrealcv by zfw1226

Gym environment for visual reinforcement learning research

Created 8 years ago

Updated 10 months ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity) and

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic).

coinrun by openai

RL research environment and training script

Created 7 years ago

Updated 2 years ago

simple_rl by david-abel

RL framework for experimenting with reinforcement learning in Python

Created 9 years ago

Updated 1 year ago

lets-do-irl by reinforcement-learning-kr

PyTorch implementations of inverse reinforcement learning algorithms

Created 7 years ago

Updated 2 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic), and

2 more.

random-network-distillation by openai

RL research paper code

Created 7 years ago

Updated 5 years ago

Starred by

Eric Ciarla

Eric Ciarla(Cofounder of Firecrawl).

snake-ai by linyiLYi

AI agent for playing the game "Snake"

Created 2 years ago

Updated 1 year ago

Starred by

Gregor Zunic

Gregor Zunic(Cofounder of Browser Use) and

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

rl-baselines3-zoo by DLR-RM

Training framework for Stable Baselines3 RL agents

Created 5 years ago

Updated 2 weeks ago

Starred by

Cristóbal Valenzuela

Cristóbal Valenzuela(Cofounder of Runway),

Andrew Trask

Andrew Trask(Research Scientist at Google DeepMind), and

3 more.

deep-reinforcement-learning by udacity

Educational resource for deep reinforcement learning algorithms

Created 7 years ago

Updated 2 years ago

Feedback? Help us improve.