MARTI  by TsinghuaC3I

Framework for LLM-based multi-agent reinforced training and inference

Created 5 months ago
296 stars

Top 89.4% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

MARTI is an open-source framework for training LLM-based Multi-Agent Systems (MAS) with Reinforcement Learning (RL). It addresses scalability and context tracking limitations of single-agent LLMs by enabling structured, collaborative agent behavior via RL. The framework targets researchers and developers aiming to advance LLM reasoning capabilities and foster collective intelligence.

How It Works

MARTI employs centralized multi-agent interaction with distributed policy training, managing interactions and rewards centrally while training policies distributively. Its core modules are the Multi-Agent World, Centralized Rewarding, and Single Agent Trainer. This approach facilitates scalable, adaptive workflows, combining LLM power with MAS robustness and RL learning for complex tasks.

Quick Start & Requirements

Installation requires cloning the repository (git clone https://github.com/TsinghuaC3I/MARTI.git), navigating into the directory (cd MARTI), and installing dependencies (pip install -r requirements.txt). Key prerequisites include OpenRLHF, Ray, and vLLM. Training with three Qwen2.5-3B agents necessitates a minimum of approximately 6x80GB GPUs. Example scripts for inference and training are provided.

Highlighted Details

  • Unified framework for LLM-based multi-agent inference and RL training.
  • Supports graph-based workflows (debate, chain-of-agents, mixture-of-agents) with experimental AutoGen/CAMEL integration.
  • Accommodates heterogeneous models and features built-in credit assignment/reward shaping.
  • Integrates diverse RL algorithms (PPO, GRPO, REINFORCE++, TTRL) with vLLM/Hybrid Engines.
  • Achieves state-of-the-art on reasoning benchmarks, including a 66.7 AIME score with Multi-Agent Debates.
  • Introduces asynchronous tool use and workflow support for enhanced modularity and efficiency.

Maintenance & Community

Developed by Tsinghua University and Shanghai AI Lab, MARTI is in an early experimental stage with active development. The project welcomes collaborations to advance LLM-based multi-agent RL. Key contacts include Kaiyan Zhang and Biqing Qi.

Licensing & Compatibility

The provided README does not specify a software license. Users should verify terms for integration, especially for commercial use.

Limitations & Caveats

MARTI is in a very early experimental stage, suggesting potential instability. It aims to address known LLM agent system limitations like poor role adherence and communication, but these may persist. Experimental features include third-party integrations (AutoGen, CAMEL) and generative reward models.

Health Check
Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
1
Star History
53 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.