Agentic framework for interactive planning using LLMs in open-world environments
Top 93.5% on sourcepulse
This project implements an interactive planning agent for open-world multi-task scenarios, specifically within Minecraft. It targets researchers and developers working on embodied AI, LLM-driven agents, and reinforcement learning, offering a framework to enable agents to understand, plan, and execute complex tasks in dynamic environments.
How It Works
The agent utilizes a "Describe, Explain, Plan and Select" (DEPS) framework, leveraging Large Language Models (LLMs) to generate task plans. It interacts with a modified Minecraft simulator, receiving observations and using LLMs to break down high-level goals into actionable steps. The system is designed for interactive planning, allowing for human feedback or LLM-driven refinement of plans.
Quick Start & Requirements
conda create -n planner python=3.9
), activating it, and installing dependencies via requirements.txt
and a Git repository.torch==2.0.0.dev20230208+cu117
with CUDA 11.7), numpy
, MineCLIP
, and a modified MC-Simulator
repository. OpenAI API keys are mandatory for LLM interaction.MC-Simulator
, preparing controller checkpoints, and configuring OpenAI API keys.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project's reliance on specific, potentially deprecated OpenAI APIs (Codex) and the need for a modified simulator fork indicate potential maintenance challenges and compatibility issues with current OpenAI offerings. The lack of explicit community support or recent updates raises concerns about its long-term viability.
2 years ago
1+ week