ETPNav by MarSaKi

Research paper for vision-language navigation in continuous environments

Created 2 years ago

406 stars

Top 71.6% on SourcePulse

Project Summary

ETPNav addresses vision-language navigation (VLN) in continuous environments, a challenging task requiring agents to plan long-range paths and navigate obstacles based on visual and textual instructions. It targets researchers and engineers working on embodied AI and robotics, offering a robust framework that improves upon state-of-the-art methods by over 10-20% on benchmark datasets.

How It Works

ETPNav employs a two-stage approach: topological planning and obstacle-avoiding control. It constructs an online topological map by self-organizing waypoints encountered during traversal, enabling high-level planning independent of prior environmental knowledge. A transformer-based cross-modal planner generates navigation sequences from this map and instructions. Low-level control is handled by a trial-and-error heuristic to avoid static and dynamic obstacles. This modular design separates planning from control, enhancing robustness and adaptability.

Quick Start & Requirements

Installation: Requires Habitat-lab v0.1.7 and Habitat-sim (headless version recommended for clusters). Specific PyTorch (1.9.1+cu111) and CLIP installations are needed. gym==0.21.0 is a critical dependency.
Prerequisites: Python 3.6, CUDA, and the Matterport3D dataset.
Setup: Detailed installation instructions are provided, including cloning specific Habitat versions and installing dependencies via requirements.txt.
Resources: Requires downloading pre-trained weights and datasets.
Links: Habitat Installation Guide, VLN-CE, Challenge Report, Challenge Certificate.

Highlighted Details

Winner of the RxR-Habitat Challenge in CVPR 2022.
Achieves >10% and >20% improvements on R2R-CE and RxR-CE datasets, respectively.
Implements online topological mapping and a transformer-based cross-modal planner.
Utilizes a trial-and-error heuristic for obstacle avoidance.

Maintenance & Community

The project is associated with authors from multiple institutions. Contact information for key contributors is provided. The README mentions inspirations from CWP, Sim2Sim, and DUET.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is marked as "Official repo" and has achieved significant benchmark improvements, suggesting a stable implementation. However, specific details on community support, ongoing maintenance, or potential deprecations are not provided in the README. The setup requires a specific, older version of gym (0.21.0) due to compatibility issues with Habitat-lab v0.1.7.

Health Check

Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

10 stars in the last 30 days