JanusVLN  by MIV-XJTU

Vision-language navigation framework with dual implicit memory

Created 2 months ago
276 stars

Top 93.9% on SourcePulse

GitHubView on GitHub
Project Summary

JanusVLN is a novel Vision-Language Navigation (VLN) framework designed to overcome 2D semantics-dominant limitations. It targets researchers and developers building embodied AI agents, enabling next-generation spatial agents through a focus on 3D spatial-semantic synergy, inspired by human cognitive processes.

How It Works

The core innovation is a dual implicit memory architecture that mimics human navigation by integrating semantic understanding (left-brain) with spatial cognition (right-brain). This design constructs two complementary, fixed-size neural memories, steering VLN research towards a crucial 3D spatial-semantic synergy for enhanced agent perception and decision-making.

Quick Start & Requirements

Installation requires cloning the repo, setting up a Python 3.9 Conda environment, and installing specific versions of habitat-sim (0.2.4) and habitat-lab (v0.2.4). PyTorch 2.5.1 with CUDA 12.4 is mandatory. Extensive data preparation involves downloading MP3D/HM3D scenes, VLN-CE episodes (R2R, RxR, ScaleVLN), and trajectory data from ModelScope. The primary install command is pip install -e . after environment setup.

Highlighted Details

  • First VLN framework featuring dual implicit memory.
  • Decouples semantics and spatiality, inspired by human cognition.
  • Provides pre-trained JanusVLN_Base and JanusVLN_Extra models.
  • Includes extensive pre-collected trajectory data for training.

Maintenance & Community

Affiliated with Amap, Alibaba Group, and Xi’an Jiaotong University. No specific community channels or active maintenance signals are detailed in the provided README.

Licensing & Compatibility

The license type and compatibility notes for commercial or closed-source use are not specified in the provided README content.

Limitations & Caveats

Recent issues reported with incorrect weights for the JanusVLN_Extra model. Setup involves complex data preparation and specific dependency versions. The project builds upon multiple other codebases.

Health Check
Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
10
Star History
62 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.