pg_travel  by reinforcement-learning-kr

PyTorch implementations of Policy Gradient reinforcement learning algorithms

Created 7 years ago
370 stars

Top 76.3% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides PyTorch implementations of key Policy Gradient (PG) reinforcement learning algorithms, including REINFORCE, NPG, TRPO, and PPO. It targets researchers and practitioners in reinforcement learning, offering a unified framework for experimenting with and comparing these advanced PG methods on standard benchmarks.

How It Works

The project implements four distinct PG algorithms: Vanilla Policy Gradient, Truncated Natural Policy Gradient, Trust Region Policy Optimization (TRPO), and Proximal Policy Optimization (PPO). It leverages PyTorch for model definition and training. The implementations are designed to be modular, allowing for easy switching between algorithms and hyperparameter tuning. The use of standard RL benchmarks like Mujoco and Unity ml-agents facilitates reproducible research and direct comparison of algorithm performance.

Quick Start & Requirements

  • Mujoco:
    • Install: pip install -r requirements.txt (within pg_travel/mujoco)
    • Prerequisites: mujoco-py (requires a license from DeepMind), Python 3.x.
    • Run: python main.py (defaults to PPO on Hopper-v2)
    • Docs: Mujoco-py
  • Unity ml-agents:
    • Install: Download prebuilt environments and place in pg_travel/unity/env.
    • Prerequisites: Unity ml-agents, Python 3.x.
    • Run: python main.py --train (within pg_travel/unity)
    • Docs: Unity ml-agents

Highlighted Details

  • Implements Vanilla PG, NPG, TRPO, and PPO algorithms.
  • Supports both mujoco-py and custom Unity ml-agents environments.
  • Includes TensorboardX integration for visualizing training progress.
  • Provides options for continuing training from checkpoints and testing pre-trained models.

Maintenance & Community

The repository is maintained by reinforcement-learning-kr. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking would require clarification of the licensing terms.

Limitations & Caveats

The project uses PyTorch v0.4.0, which is an older version and may have compatibility issues with newer PyTorch releases or libraries. Trained agents and Unity ml-agent environment source files are noted as "soon to be available," indicating potential incompleteness.

Health Check
Last Commit

6 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI) and Jianwei Yang Jianwei Yang(Research Scientist at Meta Superintelligence Lab).

allenact by allenai

0%
369
Open-source framework for embodied AI research
Created 5 years ago
Updated 3 weeks ago
Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Will Brown Will Brown(Research Lead at Prime Intellect).

agent-lightning by microsoft

6.0%
2k
Train any AI agent with rollouts and feedback
Created 3 months ago
Updated 2 days ago
Starred by Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), and
2 more.

coach by IntelLabs

0%
2k
Reinforcement learning framework for experimentation (discontinued)
Created 8 years ago
Updated 2 years ago
Feedback? Help us improve.