MARL research codebase for fast experimentation in JAX
Top 44.0% on sourcepulse
Mava is a research-focused codebase for multi-agent reinforcement learning (MARL) in JAX, designed for rapid experimentation and scalability. It provides researchers with fast, single-file implementations of state-of-the-art MARL algorithms, enabling quick iteration on new ideas.
How It Works
Mava leverages JAX for its high-performance, automatic differentiation, and compilation capabilities, allowing for end-to-end JIT compilation of MARL training loops. It supports two distribution architectures: Anakin for JAX-based environments, enabling full JIT compilation, and Sebulba for non-JAX environments, facilitating interaction with multiple CPU cores. This approach results in significantly faster experiment runtimes compared to non-JAX alternatives.
Quick Start & Requirements
uv sync
or pip install -e .
.python mava/systems/ppo/anakin/ff_ippo.py
. Configuration is managed via Hydra, allowing overrides from the terminal.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
Mava is not designed as a modular library and is intended to be used directly from the cloned repository. While it supports various environments, adding new ones requires using existing wrappers as a guide.
2 weeks ago
1 day