vlmaps by vlmaps

Spatial mapping research paper for robot navigation using visual language

Created 3 years ago

597 stars

Top 54.6% on SourcePulse

Project Summary

VLMaps enables robots to navigate using natural language commands by fusing pre-trained visual-language model features into 3D reconstructions of the environment. This approach allows for zero-shot spatial goal navigation and landmark localization without additional data collection or model fine-tuning, targeting robotics researchers and developers.

How It Works

VLMaps represents spatial maps by integrating visual-language features from pre-trained models into a 3D reconstruction. This spatial anchoring of features enables natural language indexing, allowing robots to understand and act upon text-based navigation goals. The system leverages Matterport3D dataset and Habitat simulator for generating and testing these maps.

Quick Start & Requirements

Install via conda create -n vlmaps python=3.8 and conda activate vlmaps, followed by bash install.bash.
Requires Python 3.8, Jupyter, and potentially the Matterport3D dataset.
Demo available at git checkout demo and running jupyter notebook demo.ipynb.
Official docs and dataset generation details are provided within the README.

Highlighted Details

Enables zero-shot spatial goal navigation using natural language.
Fuses visual-language features into 3D reconstructions.
Supports both object-goal and spatial-goal navigation tasks.
Provides detailed instructions for dataset generation and custom data integration.

Maintenance & Community

The project is associated with ICRA2023 and seeks community contributions for improving the navigation stack.

Licensing & Compatibility

MIT License, permitting commercial use and integration with closed-source systems.

Limitations & Caveats

The current navigation stack's reliance on a covisibility graph built from obstacle maps can lead to navigation issues in complex environments. The project is seeking community contributions to address these limitations and integrate with real-world robot sensors.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

15 stars in the last 30 days