3D scene graph for language-grounded robot navigation research
Top 83.1% on sourcepulse
HOV-SG provides an official implementation for hierarchical open-vocabulary 3D scene graphs, enabling language-grounded robot navigation in complex indoor environments. It targets researchers and engineers in robotics and AI, offering a structured, compact representation of 3D scenes that surpasses dense maps in efficiency and semantic accuracy.
How It Works
HOV-SG constructs a hierarchical scene graph (floor, room, object) from RGB-D data. It leverages OpenCLIP for open-vocabulary feature extraction and SAM for class-agnostic segmentation, creating detailed, multi-level scene representations. This hierarchical approach allows for efficient storage and enables navigation across multiple floors using a cross-floor Voronoi graph.
Quick Start & Requirements
environment.yaml
), install habitat-sim
(conda install habitat-sim -c conda-forge -c aihabitat
), install HOV-SG package (pip install -e .
).Highlighted Details
Maintenance & Community
Initial release in July 2024, with updates in August 2024 adding dataset generation and evaluation code. No community links (Discord/Slack) are provided in the README.
Licensing & Compatibility
MIT license for academic usage. Commercial use requires contacting the authors.
Limitations & Caveats
The README mentions a recommendation of 128 GB RAM for compiling ground truth data, indicating a potentially high resource requirement for dataset preparation. Specific scenes are listed for evaluation, suggesting broader dataset compatibility may require further investigation.
1 week ago
Inactive