TensorFlow v2 implementation of the PILCO algorithm
Top 83.8% on sourcepulse
This repository provides a modern TensorFlow 2 implementation of the Probabilistic Inference for Learning Control (PILCO) algorithm, targeting researchers and practitioners in Bayesian Reinforcement Learning. It offers a clean, GPU-scalable approach to learning control policies by leveraging Gaussian Processes for probabilistic modeling.
How It Works
PILCO utilizes Gaussian Processes (GPs) for regression, enabling it to model system dynamics and uncertainty. This probabilistic approach allows for sample-efficient learning and robust control, especially in scenarios with limited data. The implementation leverages TensorFlow 2 for automatic differentiation and GPU acceleration, and GPflow 2 for GP regression, offering a significant advantage over older, MATLAB-based implementations by improving scalability and ease of integration.
Quick Start & Requirements
git clone https://github.com/nrontsis/PILCO && cd PILCO && python setup.py develop
.python examples/inverted_pendulum.py
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project requires specific, older versions of OpenAI gym and mujoco-py, which may pose installation challenges or compatibility issues with newer environments. The absence of an explicit license could impact commercial use or integration into closed-source projects.
4 years ago
1 day