Imitation learning algorithms research paper (SAC base)
Top 59.5% on sourcepulse
This repository provides a pragmatic implementation of various deep imitation learning algorithms, primarily built upon the Soft Actor-Critic (SAC) framework. It targets researchers and practitioners in reinforcement learning and robotics who need to compare and experiment with different imitation learning techniques for learning policies from expert demonstrations. The library offers a flexible and modular approach to evaluating algorithms like GAIL, DRIL, AdRIL, and others, facilitating reproducible research.
How It Works
The core of the library leverages the Soft Actor-Critic (SAC) off-policy reinforcement learning algorithm as a foundation. Various imitation learning methods are integrated by modifying the reward signal or incorporating discriminator networks. For instance, GAIL uses a discriminator to distinguish between agent and expert trajectories, while DRIL employs a disagreement-regularized discriminator. The implementation supports features like state-only imitation, absorbing state indicators, and mixing expert data with agent data for improved sample efficiency.
Quick Start & Requirements
pip install -r requirements.txt
python train.py algorithm=<ALG> env=<ENV>
(e.g., python train.py algorithm=GAIL env=hopper
)Highlighted Details
Maintenance & Community
The project acknowledges contributions from Kai Arulkumaran and Dan Ogawa Lillrank, with citations to relevant research papers and GitHub repositories. There is no explicit mention of active community channels like Discord or Slack, nor a public roadmap.
Licensing & Compatibility
The repository does not explicitly state a license in the provided README. This requires further investigation for commercial use or integration into closed-source projects.
Limitations & Caveats
The README mentions that v1.0 contained on-policy algorithms, implying v2.0 (the current version) focuses on off-policy methods. The absence of an explicit license is a significant caveat for adoption. The project's primary focus is on MuJoCo environments, and compatibility with other simulation platforms may require modifications.
4 months ago
1 day