MADDPG research paper implementation
Top 24.0% on sourcepulse
This repository provides the implementation of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm, designed for mixed cooperative-competitive environments. It is primarily intended for researchers and practitioners in multi-agent reinforcement learning, offering a framework to experiment with and apply MADDPG to scenarios like the Multi-Agent Particle Environments (MPE).
How It Works
MADDPG is an actor-critic algorithm that extends DDPG to multi-agent settings. Each agent has its own actor and critic. The critic for each agent takes the observations and actions of all agents as input, allowing it to learn a value function that accounts for the actions of others. This centralized critic training, combined with decentralized execution, enables agents to learn effective policies in complex, dynamic environments.
Quick Start & Requirements
pip install -e .
python experiments/train.py --scenario <scenario_name>
(e.g., python experiments/train.py --scenario simple
)Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The repository is archived, meaning no further updates or bug fixes are expected. The code structure has been modified since the original paper, and results may vary from those reported. The strict dependency on older library versions (TensorFlow 1.8.0) is a significant barrier to adoption for current projects.
1 year ago
1 week