coinrun  by openai

RL research environment and training script

Created 6 years ago
401 stars

Top 72.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the environments and training scripts for the paper "Quantifying Generalization in Reinforcement Learning." It is targeted at researchers and practitioners in reinforcement learning who need to reproduce or extend the CoinRun experiments, offering a standardized benchmark for evaluating generalization capabilities.

How It Works

The project implements custom game environments for reinforcement learning agents, specifically designed to test generalization. It utilizes a procedurally generated maze-like structure with varying difficulty and visual elements. The core training mechanism employs Proximal Policy Optimization (PPO), a popular and effective on-policy reinforcement learning algorithm, integrated with MPI for parallel training.

Quick Start & Requirements

  • Install: pip install -e . after cloning the repository and installing dependencies.
  • Prerequisites: Python 3.6, TensorFlow 1.12.0 (or tensorflow-gpu), mpich, build-essential, qt5-default, pkg-config (Linux) or qt, open-mpi, pkg-config, git (Mac).
  • Setup: Requires compilation of the environment upon first import. A Dockerfile is provided.
  • Links: Blog Post, Paper

Highlighted Details

  • Codebase for the CoinRun RL environment and associated research.
  • Supports training agents using PPO with MPI for parallelization.
  • Includes scripts for interactive play, training, and evaluation.
  • Offers options for training on specific level sets or continuing from checkpoints.

Maintenance & Community

  • Status: Archived; no updates expected.
  • Authors: Karl Cobbe, Oleg Klimov, Chris Hesse, Taehoon Kim, John Schulman (OpenAI).

Licensing & Compatibility

  • License: Not explicitly stated in the README, but asset licenses are in LICENSES. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is archived and will not receive updates. It requires specific, older versions of TensorFlow (1.12.0) and Python (3.6), which may pose compatibility challenges with modern systems. The environment compilation process might require manual adjustments to the Makefile on some systems.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Will Brown Will Brown(Research Lead at Prime Intellect).

agent-lightning by microsoft

6.0%
2k
Train any AI agent with rollouts and feedback
Created 3 months ago
Updated 2 days ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Evan Hubinger Evan Hubinger(Head of Alignment Stress-Testing at Anthropic), and
2 more.

random-network-distillation by openai

0%
909
RL research paper code
Created 7 years ago
Updated 5 years ago
Starred by Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), and
2 more.

coach by IntelLabs

0%
2k
Reinforcement learning framework for experimentation (discontinued)
Created 8 years ago
Updated 2 years ago
Feedback? Help us improve.