Multi-agent learning via sequence modeling using Transformers
Top 70.8% on sourcepulse
This repository provides the official implementation of Multi-Agent Transformer (MAT), a novel neural network designed for cooperative Multi-Agent Reinforcement Learning (MARL). MAT bridges MARL and sequence modeling by casting MARL problems into a Transformer-based encoder-decoder architecture, enabling the application of powerful sequence models to MARL challenges. It is an online RL method trained through trial and error, offering superior performance and generalization on benchmarks like StarCraftII, Multi-Agent MuJoCo, and Google Research Football.
How It Works
MAT frames cooperative MARL as a sequence modeling problem, leveraging an encoder-decoder architecture. It utilizes the multi-agent advantage decomposition theorem to achieve linear time complexity for multi-agent scenarios, ensuring monotonic performance improvements. This approach differs from offline methods by employing online reinforcement learning, allowing for adaptation and learning through interaction.
Quick Start & Requirements
pip install -r requirements.txt
mujoco-py
and multiagent_mujoco
, with specific LD_LIBRARY_PATH
and LD_PRELOAD
environment variables.bash install_sc2.sh
or manual installation.scripts
folder (e.g., ./train_mujoco.sh
) with algo="mat"
or algo="mat_dec"
. Configuration can be modified in shell files or config.py
.Highlighted Details
Maintenance & Community
The project is associated with PKU-MARL. Further community or maintenance details are not explicitly provided in the README.
Licensing & Compatibility
The README does not explicitly state the license. The citation format suggests it is based on academic research. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The README does not detail specific limitations, unsupported platforms, or known bugs. The setup for certain environments (MuJoCo, StarCraft II) involves multiple external dependencies and specific configuration steps.
1 year ago
1 week