MARTI by TsinghuaC3I

Framework for LLM-based multi-agent reinforced training and inference

Created 6 months ago

357 stars

Top 78.4% on SourcePulse

Project Summary

Summary

MARTI is an open-source framework for training LLM-based Multi-Agent Systems (MAS) with Reinforcement Learning (RL). It addresses scalability and context tracking limitations of single-agent LLMs by enabling structured, collaborative agent behavior via RL. The framework targets researchers and developers aiming to advance LLM reasoning capabilities and foster collective intelligence.

How It Works

MARTI employs centralized multi-agent interaction with distributed policy training, managing interactions and rewards centrally while training policies distributively. Its core modules are the Multi-Agent World, Centralized Rewarding, and Single Agent Trainer. This approach facilitates scalable, adaptive workflows, combining LLM power with MAS robustness and RL learning for complex tasks.

Quick Start & Requirements

Installation requires cloning the repository (git clone https://github.com/TsinghuaC3I/MARTI.git), navigating into the directory (cd MARTI), and installing dependencies (pip install -r requirements.txt). Key prerequisites include OpenRLHF, Ray, and vLLM. Training with three Qwen2.5-3B agents necessitates a minimum of approximately 6x80GB GPUs. Example scripts for inference and training are provided.

Highlighted Details

Unified framework for LLM-based multi-agent inference and RL training.
Supports graph-based workflows (debate, chain-of-agents, mixture-of-agents) with experimental AutoGen/CAMEL integration.
Accommodates heterogeneous models and features built-in credit assignment/reward shaping.
Integrates diverse RL algorithms (PPO, GRPO, REINFORCE++, TTRL) with vLLM/Hybrid Engines.
Achieves state-of-the-art on reasoning benchmarks, including a 66.7 AIME score with Multi-Agent Debates.
Introduces asynchronous tool use and workflow support for enhanced modularity and efficiency.

Maintenance & Community

Developed by Tsinghua University and Shanghai AI Lab, MARTI is in an early experimental stage with active development. The project welcomes collaborations to advance LLM-based multi-agent RL. Key contacts include Kaiyan Zhang and Biqing Qi.

Licensing & Compatibility

The provided README does not specify a software license. Users should verify terms for integration, especially for commercial use.

Limitations & Caveats

MARTI is in a very early experimental stage, suggesting potential instability. It aims to address known LLM agent system limitations like poor role adherence and communication, but these may persist. Experimental features include third-party integrations (AutoGen, CAMEL) and generative reward models.

MARTI by TsinghuaC3I

Explore Similar Projects

sweet_rl by facebookresearch

KnowAgent by zjunlp

Tool-Star by RUC-NLPIR

gem by axon-rl

xLAM by SalesforceAIResearch

tonic by fabiopardo

Awesome-AgenticLLM-RL-Papers by xhyumiracle

verl-tool by TIGER-AI-Lab

DeepEyes by Visual-Agent

Agent-R1 by 0russwest0

Awesome-LLM-Post-training by mbzuai-oryx

MARLlib by Replicable-MARL