tensorflow-reinforce  by yukezhu

TensorFlow implementations of reinforcement learning models

Created 9 years ago
487 stars

Top 63.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides implementations of various reinforcement learning algorithms using TensorFlow 1.0, targeting researchers and students interested in deep RL. It offers educational code examples for learning and experimenting with RL models, evaluated against OpenAI Gym environments.

How It Works

The project implements classic and deep reinforcement learning algorithms, including Cross-Entropy Method (CEM), Q-Learning, Deep Q-Networks (DQN), Double DQN, REINFORCE, Actor-Critic, and Deep Deterministic Policy Gradient (DDPG). Models are built using TensorFlow 1.0, leveraging its graph computation capabilities for efficient training and inference.

Quick Start & Requirements

  • Install: pip install -r requirements.txt
  • Prerequisites: TensorFlow 1.0, Python 2.7 or 3.5, OpenAI Gym.
  • Links: OpenAI Gym

Highlighted Details

  • Implements a range of RL algorithms from tabular methods to policy gradients.
  • Models are evaluated in standard OpenAI Gym environments.
  • Codebase supports both Python 2.7 and 3.5.

Maintenance & Community

The project is maintained by yukezhu. Contributions and feedback are welcomed.

Licensing & Compatibility

  • License: MIT
  • Compatibility: Permissive MIT license allows for commercial use and integration into closed-source projects.

Limitations & Caveats

The implementations are primarily for educational purposes and may require modifications to work effectively on specific RL problems. The project uses TensorFlow 1.0, which is an older version and may not be compatible with newer TensorFlow ecosystems or hardware acceleration features.

Health Check
Last Commit

7 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Deepak Pathak Deepak Pathak(Cofounder of Skild AI; Professor at CMU), Anastasis Germanidis Anastasis Germanidis(Cofounder of Runway), and
1 more.

deer by VinF

0%
489
Deep reinforcement learning framework
Created 9 years ago
Updated 2 months ago
Feedback? Help us improve.