DDPG by floodsung

DDPG implementation for continuous control tasks

Created 9 years ago

572 stars

Top 56.3% on SourcePulse

1 Expert Loves This Project

truell20

Cofounder of Cursor

Project Summary

This repository provides a Python reimplementation of the Deep Deterministic Policy Gradient (DDPG) algorithm, a popular deep reinforcement learning method for continuous control tasks. It is designed for researchers and practitioners working with OpenAI Gym environments and TensorFlow.

How It Works

The implementation leverages TensorFlow for building and training the actor and critic networks. It follows the DDPG paper's architecture, with a key detail being the successful application of Batch Normalization to the actor network, though its implementation on the critic network is noted as problematic.

Quick Start & Requirements

Primary install / run command:

git clone https://github.com/songrotek/DDPG.git
cd DDPG
python gym_ddpg.py

Prerequisites: OpenAI Gym, TensorFlow.
To change environments, modify ENV_NAME in gym_ddpg.py. To change network architecture, adjust imports in ddpg.py.

Highlighted Details

Implements DDPG for continuous control tasks.
Utilizes OpenAI Gym for environment simulation.
Batch Normalization is successfully applied to the actor network.

Maintenance & Community

No specific information on contributors, sponsorships, or community channels is provided in the README.

Licensing & Compatibility

The README does not explicitly state a license.

Limitations & Caveats

Batch Normalization on the critic network is reported as problematic. Several Mujoco environments (InvertedPendulum, InvertedDoublePendulum, Hopper) are noted as unsolved within the context of this implementation.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

self-imitation-learning by junhyukoh

TensorFlow implementation of Self-Imitation Learning (ICML 2018) research paper

Created 7 years ago

Updated 5 years ago

Starred by

Soumith Chintala

Soumith Chintala(Coauthor of PyTorch).

pytorch-REINFORCE by chingyaoc

PyTorch implementation of REINFORCE for control tasks

Created 8 years ago

Updated 8 years ago

Starred by

Jerry Tworek

Jerry Tworek(VP Research at OpenAI).

pytorch-rl by navneet-nmk

PyTorch SDK for deep reinforcement learning algorithms

Created 7 years ago

Updated 6 years ago

Starred by

Deshraj Yadav

Deshraj Yadav(Cofounder of Mem0).

PyTorch-ActorCriticRL by vy007vikas

PyTorch implementation of DDPG for continuous RL

Created 8 years ago

Updated 4 years ago

Starred by

Michael Truell

Michael Truell(Cofounder of Cursor).

ddpg-aigym by stevenpjg

DDPG implementation for continuous control in OpenAI Gym

Created 9 years ago

Updated 7 years ago

Multi-Agent-Deep-Deterministic-Policy-Gradients by philtabor

MADDPG implementation in PyTorch

Created 4 years ago

Updated 4 years ago

Starred by

Adam Paszke

Adam Paszke(Coauthor of PyTorch),

Ross Wightman

Ross Wightman(Author of timm; CV at Hugging Face), and

2 more.

pytorch-cpp-rl by Omegastick

RL framework using the PyTorch C++ frontend

Created 6 years ago

Updated 5 years ago

Reinforcement_Learning by pythonlessons

Reinforcement learning tutorials using TensorFlow

Created 6 years ago

Updated 2 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity) and

Chenlin Meng

Chenlin Meng(Cofounder of Pika).

pfrl by pfnet

PyTorch library for deep reinforcement learning research

Created 5 years ago

Updated 3 weeks ago

Starred by

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic) and

Jesse Clark

Jesse Clark(Cofounder of Marqo).

deep-rl by pemami4911

Deep reinforcement learning algorithms collection

Created 10 years ago

Updated 6 years ago

Starred by

Nathan Lambert

Nathan Lambert(Research Scientist at AI2),

Phil Wang

Phil Wang(Prolific Research Paper Implementer), and

1 more.

TD3 by sfujim

PyTorch implementation of TD3 for OpenAI gym tasks

Created 8 years ago

Updated 2 years ago

Starred by

Théophile Gervet

Théophile Gervet(Cofounder of Genesis AI),

Joshua Achiam

Joshua Achiam(Head of Mission Alignment at OpenAI), and

10 more.

pytorch-a2c-ppo-acktr-gail by ikostrikov

PyTorch implementations of reinforcement learning algorithms

Created 8 years ago

Updated 3 years ago

Feedback? Help us improve.