Discover and explore top open-source AI tools and projects—updated daily.
H-EmbodVisEmbodied agents for parallel task execution
Top 78.8% on SourcePulse
This project addresses the limitations of existing embodied AI task scheduling datasets by incorporating Operations Research (OR) principles and 3D spatial grounding. It introduces the ORS3D task and the GRANT model, enabling embodied agents to understand natural language instructions, ground actions in 3D environments, and optimize task execution by leveraging parallelizable subtasks to minimize completion time. The target audience includes researchers in embodied AI, robotics, and AI task planning.
How It Works
The project introduces the ORS3D task, which requires agents to perform complex, multi-step tasks in 3D environments while optimizing for efficiency. It leverages GRANT, an embodied multi-modal large language model, which incorporates a novel scheduling token mechanism. This mechanism allows the model to identify and exploit parallelizable subtasks, leading to more efficient overall task completion compared to sequential execution.
Quick Start & Requirements
conda environment management, installing specific dependencies (openblas-devel, openjdk=11, torch-scatter, peft), and compiling C++ extensions (MinkowskiEngine, pointnet2).bash scripts/train.sh and bash scripts/eval.sh.Highlighted Details
Maintenance & Community
The project is based on foundational works like Grounded 3D-LLM, SG3D, and LEO. Specific community channels or active maintenance team details are not provided in the README.
Licensing & Compatibility
The code is licensed under Apache 2.0. No specific compatibility notes for commercial use or closed-source linking are mentioned.
Limitations & Caveats
The setup process requires precise version management for Python, PyTorch, and CUDA, along with manual compilation of C++ extensions, indicating a potentially complex and fragile build environment. No explicit limitations regarding unsupported platforms, specific task complexities, or performance bounds are detailed.
1 month ago
Inactive
OpenBMB
BloopAI
microsoft
eyaltoledano