SIMPLE by davidADSP

Self-play RL for multiplayer environments

Created 5 years ago

328 stars

Top 83.5% on SourcePulse

Project Summary

This project enables training AI agents for custom multiplayer environments using self-play reinforcement learning, specifically Proximal Policy Optimization (PPO). It's designed for researchers and developers interested in multi-agent AI and game development, offering a structured approach to evolving AI opponents.

How It Works

The core innovation is a wrapper that transforms multiplayer environments into single-player ones for PPO. It manages opponent turn-taking and delays reward signals until all players have acted. This creates a constantly evolving training landscape where new policy versions are integrated into a network bank, allowing agents to learn against increasingly sophisticated versions of themselves.

Quick Start & Requirements

Installation: Clone the repository, build the Docker image (docker-compose up -d), and install a specific environment (e.g., bash ./scripts/install_env.sh sushigo).
Prerequisites: Docker and Docker Compose.
Resources: Training can be parallelized using MPI (mpirun -np 10 python3 train.py -e sushigo).
Links: Project GitHub, Blog Post

Highlighted Details

Implements PPO from Stable Baselines.
Supports custom environments with specific Gym-like methods (step, reset, render, observation, legal_actions).
Includes test.py for playing against trained agents or baselines and train.py for self-play training.
TensorBoard integration for monitoring training progress.

Maintenance & Community

The project is maintained by David Foster (@davidADSP).
Contributions are welcome via pull requests.
Roadmap and known issues are tracked in open issues.

Licensing & Compatibility

Distributed under GPL-3.0.
GPL-3.0 is a strong copyleft license, potentially restricting integration with closed-source commercial applications.

Limitations & Caveats

The documentation for test.py and train.py command-line arguments is noted as incomplete, with further documentation planned.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

TimeChamber by inspirai

Self-play framework for parallel simulation

Created 3 years ago

Updated 3 years ago

Starred by

Jiayi Pan

Jiayi Pan(Author of SWE-Gym; MTS at xAI).

l0 by cmriat

Scalable pipeline for training general-purpose agents

Created 6 months ago

Updated 6 months ago

Starred by

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI).

smallville by nmatter1

Generative agents for video games

Created 2 years ago

Updated 2 years ago

Starred by

Will Brown

Will Brown(Research Lead at Prime Intellect) and

Logan Kilpatrick

Logan Kilpatrick(Product Lead on Google AI Studio).

TextArena by TextArena

LLM evaluation and training framework

Created 1 year ago

Updated 3 days ago

openrl by OpenRL-Lab

RL framework for single/multi-agent, offline RL, self-play, and NLP tasks

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

DI-star by opendilab

AI platform for StarCraft II, enabling large-scale distributed training

Created 4 years ago

Updated 10 months ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI).

MADRL by sisl

Multi-agent RL environments

Created 9 years ago

Updated 2 years ago

Starred by

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic),

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI), and

3 more.

neural-mmo by openai

Massively multiagent game environment for training and evaluating intelligent agents

Created 6 years ago

Updated 2 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Nathan Lambert

Nathan Lambert(Research Scientist at AI2).

PettingZoo by Farama-Foundation

Python library for multi-agent reinforcement learning environments

Created 6 years ago

Updated 1 month ago

Starred by

Luca Antiga

Luca Antiga(CTO of Lightning AI).

maddpg by openai

MADDPG research paper implementation

Created 8 years ago

Updated 1 year ago

football by google-research

RL environment for football game research

Created 6 years ago

Updated 6 months ago

Starred by

Taranjeet Singh

Taranjeet Singh(Cofounder of Mem0),

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face), and

5 more.

OpenManus by FoundationAgents

Open-source framework for building general AI agents

Created 10 months ago

Updated 6 days ago

Feedback? Help us improve.