Multi-Agent-Deep-Deterministic-Policy-Gradients by philtabor

MADDPG implementation in PyTorch

Created 4 years ago

372 stars

Top 76.1% on SourcePulse

Project Summary

This repository provides a PyTorch implementation of the Multi-Agent Deep Deterministic Policy Gradients (MADDPG) algorithm, targeting researchers and practitioners in multi-agent reinforcement learning. It enables training agents in cooperative-competitive environments, as detailed in the MADDPG paper.

How It Works

The implementation follows the MADDPG algorithm, which extends DDPG to multi-agent settings. It utilizes an actor-critic architecture where each agent has its own actor and critic. The critic takes the observations and actions of all agents as input, allowing it to learn a centralized value function. This centralized critic aids in training decentralized actors, addressing the non-stationarity inherent in multi-agent RL.

Quick Start & Requirements

Install the Multi Agent Particle Environment (MAPE) from https://github.com/openai/multiagent-particle-envs.
Recommended PyTorch version: 1.4.0 (later versions may have issues with in-place operations).
Clone this repository into the same directory as the MAPE.
The main file requires the make_env function from the MAPE package.
Tutorial video: https://youtu.be/tZTQ6S9PfkE

Highlighted Details

PyTorch implementation of MADDPG.
Designed for mixed cooperative-competitive environments.
Based on the paper "Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments" (https://arxiv.org/pdf/1706.02275.pdf).

Maintenance & Community

The repository is a personal implementation by philtabor.
No explicit community channels or roadmap are mentioned.

Licensing & Compatibility

The repository does not explicitly state a license. The underlying MAPE environment has no explicit license mentioned in its README.

Limitations & Caveats

Requires a specific, older PyTorch version (1.4.0) due to potential in-place operation issues in newer versions.
Dependencies for the MAPE are described as "somewhat out of date," potentially requiring manual dependency management.
The project appears to be a single author's implementation without extensive community support or ongoing maintenance signals.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

DI-sheep by opendilab

RL agent for "3 Tiles" game

Created 3 years ago

Updated 10 months ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI),

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect), and

3 more.

LlamaGym by KhoomeiK

SDK for fine-tuning LLM agents with online reinforcement learning

Created 1 year ago

Updated 1 year ago

pytorch-rl by bentrevett

PyTorch tutorials for reinforcement learning algorithms

Created 6 years ago

Updated 5 years ago

agent-q by sentient-engineering

Autonomous AI agent for web task completion

Created 1 year ago

Updated 1 year ago

maddpg-pytorch by shariqiqbal2810

PyTorch implementation of the MADDPG multi-agent RL algorithm

Created 8 years ago

Updated 6 years ago

Reinforcement_Learning by pythonlessons

Reinforcement learning tutorials using TensorFlow

Created 6 years ago

Updated 2 years ago

Starred by

Michael Truell

Michael Truell(Cofounder of Cursor).

DDPG by floodsung

DDPG implementation for continuous control tasks

Created 9 years ago

Updated 4 years ago

MAAC by shariqiqbal2810

Research paper code for multi-agent reinforcement learning

Created 7 years ago

Updated 3 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Guillaume Lample

Guillaume Lample(Cofounder of Mistral), and

3 more.

pytorch-a3c by ikostrikov

PyTorch implementation of A3C reinforcement learning algorithm

Created 9 years ago

Updated 6 years ago

Starred by

Luca Antiga

Luca Antiga(CTO of Lightning AI).

maddpg by openai

MADDPG research paper implementation

Created 8 years ago

Updated 1 year ago

Starred by

Théophile Gervet

Théophile Gervet(Cofounder of Genesis AI),

Joshua Achiam

Joshua Achiam(Head of Mission Alignment at OpenAI), and

10 more.

pytorch-a2c-ppo-acktr-gail by ikostrikov

PyTorch implementations of reinforcement learning algorithms

Created 8 years ago

Updated 3 years ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect).

Deep-Reinforcement-Learning-Hands-On by PacktPublishing

Code samples for a deep reinforcement learning book

Created 7 years ago

Updated 3 weeks ago

Feedback? Help us improve.