PyTorch implementation of Deep Deterministic Policy Gradient (DDPG)
Top 54.3% on sourcepulse
This repository provides a PyTorch implementation of the Deep Deterministic Policy Gradient (DDPG) algorithm, a reinforcement learning method for continuous action spaces. It is suitable for researchers and practitioners looking to experiment with or apply DDPG in environments like Pendulum-v0 and MountainCarContinuous-v0.
How It Works
The implementation follows the DDPG algorithm, utilizing an actor-critic architecture. It employs a replay buffer for experience replay and a random process for exploration, with utility functions adapted from the keras-rl repository. This approach allows for stable learning in continuous control tasks by decoupling the gradient updates.
Quick Start & Requirements
./main.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project relies on a significantly outdated version of PyTorch (0.1.9), which is likely incompatible with current Python versions and may lack features or performance improvements found in modern PyTorch. The README also does not specify a license, creating uncertainty for commercial use or integration into other projects.
7 years ago
1 week