vlmaps  by vlmaps

Spatial mapping research paper for robot navigation using visual language

created 2 years ago
534 stars

Top 60.3% on sourcepulse

GitHubView on GitHub
Project Summary

VLMaps enables robots to navigate using natural language commands by fusing pre-trained visual-language model features into 3D reconstructions of the environment. This approach allows for zero-shot spatial goal navigation and landmark localization without additional data collection or model fine-tuning, targeting robotics researchers and developers.

How It Works

VLMaps represents spatial maps by integrating visual-language features from pre-trained models into a 3D reconstruction. This spatial anchoring of features enables natural language indexing, allowing robots to understand and act upon text-based navigation goals. The system leverages Matterport3D dataset and Habitat simulator for generating and testing these maps.

Quick Start & Requirements

  • Install via conda create -n vlmaps python=3.8 and conda activate vlmaps, followed by bash install.bash.
  • Requires Python 3.8, Jupyter, and potentially the Matterport3D dataset.
  • Demo available at git checkout demo and running jupyter notebook demo.ipynb.
  • Official docs and dataset generation details are provided within the README.

Highlighted Details

  • Enables zero-shot spatial goal navigation using natural language.
  • Fuses visual-language features into 3D reconstructions.
  • Supports both object-goal and spatial-goal navigation tasks.
  • Provides detailed instructions for dataset generation and custom data integration.

Maintenance & Community

The project is associated with ICRA2023 and seeks community contributions for improving the navigation stack.

Licensing & Compatibility

MIT License, permitting commercial use and integration with closed-source systems.

Limitations & Caveats

The current navigation stack's reliance on a covisibility graph built from obstacle maps can lead to navigation issues in complex environments. The project is seeking community contributions to address these limitations and integrate with real-world robot sensors.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
67 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.