MAST by multi-agent-systems-failure-taxonomy

Taxonomy for multi-agent system failures

Created 11 months ago

341 stars

Top 81.4% on SourcePulse

Project Summary

This repository provides the code and data for a study on Multi-Agent Systems (MAS) failures, introducing the MAST taxonomy. It's targeted at researchers and practitioners in AI and MAS who need to understand and mitigate common failure modes in complex agent interactions. The project offers a structured approach to analyzing MAS failures, enabling more robust system design.

How It Works

The project introduces the Multi-Agent Systems Failure Taxonomy (MAST), a framework for categorizing and analyzing failures in MAS. It leverages a dataset of annotated MAS traces, including those annotated by LLM-as-a-Judge and human annotators, to systematically identify and classify failure patterns. This data-driven approach allows for a comprehensive understanding of the root causes of MAS malfunctions.

Quick Start & Requirements

Install required libraries: pip install huggingface_hub pandas
Download dataset: Use provided Python snippets to download from Hugging Face Hub (mcemri/MAD).
Prerequisites: Python 3.x, Hugging Face Hub access.

Highlighted Details

Presents the first comprehensive study and taxonomy (MAST) of MAS challenges.
Offers a dataset with over 1,000 annotated MAS traces.
Includes traces annotated by both LLM-as-a-Judge and human annotators.
Provides a bibtex citation for the associated paper "Why Do Multi-Agent LLM Systems Fail?".

Maintenance & Community

No specific community channels or maintenance details are provided in the README.

Licensing & Compatibility

The README does not specify a license. The code and data are presented for research purposes, and citation is requested.

Limitations & Caveats

The repository focuses on failure analysis and does not provide tools for MAS development or simulation. The dataset annotation process, particularly LLM-as-a-Judge, may introduce biases or inaccuracies inherent to the models used.

MAST by multi-agent-systems-failure-taxonomy

Explore Similar Projects

Agents_Failure_Attribution by ag2ai

LLM-Agent-Based-Modeling-and-Simulation by tsinghua-fib-lab

spoon-core by XSpoonAi

Awesome-Agent-Papers by luo-junyu

trpc-agent-go by trpc-group

designing-multiagent-systems by victordibia

awesome-multi-agent-papers by kyegomez

Awesome-AI-Agents by Jenqyang

awesome-llm-powered-agent by hyp1231

lagent by InternLM

Agent-Skills-for-Context-Engineering by muratcankoylan

hive by aden-hive