MineDojo  by MineDojo

AI research framework for embodied agents in Minecraft

created 3 years ago
1,998 stars

Top 22.5% on sourcepulse

GitHubView on GitHub
Project Summary

MineDojo is an AI research framework for developing open-ended, generally capable embodied agents within a Minecraft simulation. It offers a vast benchmark suite with over 3,000 diverse tasks, ranging from programmatic challenges to creative building and full game progression, leveraging internet-scale knowledge bases derived from YouTube, Wikipedia, and Reddit to enhance agent learning.

How It Works

MineDojo provides a Gym-style interface for agent interaction, featuring multimodal observations (RGB, compass, voxels) and a compound action space. Agents can learn from a massive dataset of 730K YouTube videos, 7K Wiki pages, and 340K Reddit posts, enabling them to acquire skills and strategies from human player data. The framework supports programmatic task scoring and offers a unique "playthrough" task to defeat the Ender Dragon.

Quick Start & Requirements

  • Install via pip: pip install minedojo
  • Requires Python ≥ 3.9. Tested on Ubuntu 20.04 and Mac OS X.
  • Prerequisite: JDK 8 for Minecraft backend.
  • A Docker image is available for easier installation.
  • Verification script: python minedojo/scripts/validate_install.py (use xvfb-run or MINEDOJO_HEADLESS=1 for headless environments).
  • Documentation: Docs

Highlighted Details

  • Outstanding Paper award at NeurIPS.
  • Features 3142 tasks categorized into Programmatic (Survival, Harvest, Tech Tree, Combat), Creative, and Playthrough.
  • Includes MineCLIP reward model and agent code.
  • Open-sourced creative task labeling UI for custom task curation.
  • Internet-scale knowledge base includes 730K YouTube videos, 7K Wiki pages, and 340K Reddit posts.

Maintenance & Community

  • Project team includes researchers from various institutions.
  • Website and Arxiv Paper available.

Licensing & Compatibility

  • Codebase: MIT License.
  • YouTube Database: CC BY 4.0.
  • Wiki Database: CC BY-NC-SA 3.0.
  • Reddit Database: CC BY 4.0.
  • Licenses permit commercial use for code and YouTube/Reddit data, but Wiki data is non-commercial.

Limitations & Caveats

The Wiki database has a non-commercial use restriction. While the framework is extensive, achieving complex tasks may require significant computational resources and agent training time.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
61 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.