pytorch-ddpg  by ghliu

PyTorch implementation of Deep Deterministic Policy Gradient (DDPG)

Created 8 years ago
618 stars

Top 53.3% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a PyTorch implementation of the Deep Deterministic Policy Gradient (DDPG) algorithm, a reinforcement learning method for continuous action spaces. It is suitable for researchers and practitioners looking to experiment with or apply DDPG in environments like Pendulum-v0 and MountainCarContinuous-v0.

How It Works

The implementation follows the DDPG algorithm, utilizing an actor-critic architecture. It employs a replay buffer for experience replay and a random process for exploration, with utility functions adapted from the keras-rl repository. This approach allows for stable learning in continuous control tasks by decoupling the gradient updates.

Quick Start & Requirements

Highlighted Details

  • Implements Deep Deterministic Policy Gradient (DDPG).
  • Includes utility functions for replay buffer and random process from keras-rl.
  • Demonstrates training on Pendulum-v0 and MountainCarContinuous-v0 environments.

Maintenance & Community

  • Contributions are welcome.

Licensing & Compatibility

  • License: Not specified in the README.
  • Compatibility: PyTorch 0.1.9 is a very old version, potentially incompatible with modern PyTorch releases and Python versions.

Limitations & Caveats

The project relies on a significantly outdated version of PyTorch (0.1.9), which is likely incompatible with current Python versions and may lack features or performance improvements found in modern PyTorch. The README also does not specify a license, creating uncertainty for commercial use or integration into other projects.

Health Check
Last Commit

7 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 30 days

Explore Similar Projects

Starred by Deepak Pathak Deepak Pathak(Cofounder of Skild AI; Professor at CMU), Anastasis Germanidis Anastasis Germanidis(Cofounder of Runway), and
1 more.

deer by VinF

0%
489
Deep reinforcement learning framework
Created 9 years ago
Updated 2 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
5 more.

stable-baselines3 by DLR-RM

0.4%
12k
PyTorch library for reinforcement learning algorithm implementations
Created 5 years ago
Updated 5 days ago
Feedback? Help us improve.