PyTorch implementation of Evolution Strategies for Markov Decision Processes
Top 79.9% on sourcepulse
This repository provides a PyTorch implementation of Evolution Strategies (ES), a black-box optimization algorithm for training neural networks in reinforcement learning tasks. It is suitable for researchers and practitioners looking for an alternative to policy gradient methods, offering potential for efficient parallelization.
How It Works
The implementation leverages ES to optimize neural network parameters directly, bypassing traditional reinforcement learning techniques like policy gradients. It uses a population-based approach where multiple perturbed versions of a network are evaluated, and gradient updates are derived from the performance of these perturbations. The use of the SELU nonlinearity is noted as a computationally efficient alternative to virtual batch normalization.
Quick Start & Requirements
pip install -r requirements.txt
(after cloning).python3 main.py --small-net --env-name CartPole-v1
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The implementation is based on PyTorch version 0.2.0, which is significantly outdated. The README also mentions an unsupported slow_version
branch for managing threads, indicating potential stability or performance issues with certain configurations.
8 years ago
Inactive