CoELA  by UMass-Embodied-AGI

Research paper source code for cooperative embodied agents using LLMs

Created 2 years ago
270 stars

Top 95.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the source code for the ICLR 2024 paper "Building Cooperative Embodied Agents Modularly with Large Language Models." It enables the creation of modular, LLM-powered embodied agents capable of complex cooperative tasks in simulated environments, targeting researchers and developers in embodied AI and multi-agent systems.

How It Works

The project leverages large language models (LLMs) to control embodied agents, breaking down complex tasks into modular components. Agents interact within simulated environments like ThreeDWorld Multi-Agent Transport (TDW-MAT) and Communicative Watch-And-Help (C-WAH), utilizing LLMs for decision-making, communication, and task execution. This modular LLM-driven approach facilitates emergent cooperative behaviors and simplifies the development of sophisticated embodied AI.

Quick Start & Requirements

  • TDW-MAT: Requires Python 3.9, PyTorch, and TDW platform setup. Installation involves pip install -e . within the tdw_mat directory. A demo scene can be run with python demo/demo_scene.py.
  • C-WAH: Requires Python 3.8, VirtualHome simulator, and API. Installation involves cloning the VirtualHome repository, downloading the simulator, and running pip install -r requirements.txt within the cwah directory.
  • Dependencies: Specific LLM models (e.g., GPT-4) are used for agent control, requiring API access. TDW setup may need an X server on remote Linux machines.

Highlighted Details

  • Implements two distinct multi-agent environments: TDW-MAT for object transport and C-WAH for household chores.
  • TDW-MAT features configurable settings (Enough Container vs. Rare Container) and metrics like Transport Rate (TR) and Efficiency Improvements (EI).
  • C-WAH supports five household tasks with predicate-based subgoals and metrics like Average Steps (L) and EI.
  • Includes example scripts for running experiments with baseline and CoELA agents.

Maintenance & Community

The project is associated with UMass-Embodied-AGI and has recent updates addressing navigation issues in TDW-MAT. The paper was published in ICLR 2024.

Licensing & Compatibility

The repository itself does not explicitly state a license. The included VirtualHome repository is under the MIT license. Compatibility for commercial use depends on the licenses of the underlying LLMs and simulators used.

Limitations & Caveats

The setup for both environments can be complex, requiring specific simulator versions and configurations. The reliance on external LLM APIs (like GPT-4) introduces potential costs and dependencies. The project is presented as research code, and stability for production use is not guaranteed.

Health Check
Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
5 stars in the last 30 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
4 more.

chatarena by Farama-Foundation

0%
2k
Multi-agent environment for LLM research
Created 2 years ago
Updated 1 month ago
Starred by Andrew Ng Andrew Ng(Founder of DeepLearning.AI; Cofounder of Coursera; Professor at Stanford), Thomas Wolf Thomas Wolf(Cofounder of Hugging Face), and
4 more.

ag2 by ag2ai

0.9%
4k
AgentOS for building AI agents and facilitating multi-agent cooperation
Created 10 months ago
Updated 2 days ago
Feedback? Help us improve.