MADDPG implementation in PyTorch
Top 80.3% on sourcepulse
This repository provides a PyTorch implementation of the Multi-Agent Deep Deterministic Policy Gradients (MADDPG) algorithm, targeting researchers and practitioners in multi-agent reinforcement learning. It enables training agents in cooperative-competitive environments, as detailed in the MADDPG paper.
How It Works
The implementation follows the MADDPG algorithm, which extends DDPG to multi-agent settings. It utilizes an actor-critic architecture where each agent has its own actor and critic. The critic takes the observations and actions of all agents as input, allowing it to learn a centralized value function. This centralized critic aids in training decentralized actors, addressing the non-stationarity inherent in multi-agent RL.
Quick Start & Requirements
https://github.com/openai/multiagent-particle-envs
.make_env
function from the MAPE package.https://youtu.be/tZTQ6S9PfkE
Highlighted Details
https://arxiv.org/pdf/1706.02275.pdf
).Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
4 years ago
1 day