RL research paper code
Top 40.9% on sourcepulse
This repository provides the code for the paper "Exploration by Random Network Distillation" (RND). It enables reinforcement learning agents to explore novel states in environments by rewarding them for encountering states that their internal "random network" predicts poorly. This is particularly beneficial for sparse-reward environments like Montezuma's Revenge.
How It Works
The core of RND involves two neural networks: a fixed, randomly initialized target network and a predictor network. The predictor network is trained to mimic the output of the target network for states encountered by the agent. The difference between the target and predictor network outputs serves as an intrinsic reward signal, encouraging the agent to visit states where the predictor network is less accurate, thus driving exploration.
Quick Start & Requirements
python run_atari.py --gamma_ext 0.999
mpiexec -n 8 python run_atari.py --num_env 128 --gamma_ext 0.999
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is archived and will not receive further updates. The license is not specified, which may pose a barrier to commercial adoption or integration into closed-source projects.
4 years ago
1+ week