evolution-strategies-starter by openai

Research paper code for distributed evolution strategies

Created 8 years ago

1,620 stars

Top 25.8% on SourcePulse

View on GitHub

7 Experts Love This Project

Aravind Srinivas

Cofounder of Perplexity

Junxiao Song

Research Scientist at DeepSeek

Wei-Lin Chiang

Cofounder of LMArena

James Bradbury

Head of Compute at Anthropic

and 3 more!

Project Summary

This repository provides a distributed implementation of Evolution Strategies (ES) as a scalable alternative to Reinforcement Learning, as detailed in the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning." It is targeted at researchers and engineers exploring advanced optimization techniques for complex control problems, offering a robust master-worker architecture for parallel computation.

How It Works

The implementation employs a master-worker architecture. The master node distributes current parameters to multiple worker nodes, which then compute gradients or performance metrics. Workers return these results to the master, enabling iterative parameter updates. This design is advantageous for large-scale parallelization, allowing efficient exploration of the parameter space.

Quick Start & Requirements

Install/Run: Build custom AMIs using scripts/packer.json and launch experiments with scripts/launch.py.
Prerequisites: AWS account, Mujoco (user-provided license and binaries in scripts/dependency.sh), Packer.
Setup: Building AMIs and configuring launch scripts requires manual steps.

Highlighted Details

Master-worker architecture for distributed computation.
Resilient to worker termination, suitable for spot instances.
Code used for the humanoid scaling experiment in the cited paper.

Maintenance & Community

This project is archived and no longer actively maintained or updated.

Licensing & Compatibility

The repository's license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is archived and provided as-is, with no expected updates. It requires a user-provided Mujoco license and manual AMI building, indicating a significant setup overhead and potential compatibility issues with modern environments.

Health Check

Last Commit

6 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

3 stars in the last 30 days