PyTorch implementation of REINFORCE for control tasks
Top 96.9% on sourcepulse
This repository provides a PyTorch implementation of the REINFORCE algorithm, a foundational policy gradient method for reinforcement learning. It is designed for researchers and practitioners looking to experiment with or apply REINFORCE to both discrete and continuous control tasks, specifically within OpenAI Gym environments.
How It Works
The implementation utilizes a neural network to approximate the policy function. For discrete action spaces, it typically outputs probabilities for each action, while for continuous spaces, it outputs parameters (e.g., mean and standard deviation) of a distribution from which actions are sampled. The REINFORCE algorithm then updates the policy network's weights by backpropagating the discounted return, scaled by the log-probability of the taken actions.
Quick Start & Requirements
python main.py --env_name [name of environment]
Highlighted Details
Maintenance & Community
No information on contributors, sponsorships, or community channels is available in the README.
Licensing & Compatibility
The README does not specify a license.
Limitations & Caveats
The project requires Python 2.7, which is end-of-life and may present compatibility issues with modern libraries. The lack of a specified license could restrict commercial use or integration into closed-source projects.
8 years ago
Inactive