PyTorch implementation of the MADDPG multi-agent RL algorithm
Top 52.7% on sourcepulse
This repository provides a PyTorch implementation of the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm, designed for training agents in mixed cooperative-competitive environments. It is suitable for researchers and practitioners exploring multi-agent reinforcement learning strategies.
How It Works
The implementation follows the MADDPG framework, which extends DDPG to multi-agent settings by using a centralized critic that observes the actions and states of all agents, while each agent maintains its own decentralized actor. This approach allows for stable learning in complex, non-stationary environments where individual agent policies are constantly changing.
Quick Start & Requirements
python main.py --help
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The implementation is a personal fork and may not reflect the latest advancements or best practices. Features like ensemble training, inferring other agents' policies, and mixed continuous/discrete action spaces are explicitly noted as not implemented. The specified dependency versions are from the time of use and may not be strict requirements.
5 years ago
1 day