3D reconstruction research paper using spatial memory
Top 36.5% on sourcepulse
This project provides Spann3R, a 3D reconstruction system that leverages spatial memory for improved accuracy and handling of dynamic scenes. It is targeted at researchers and developers in computer vision and robotics working on scene reconstruction from RGB-D data. The system offers enhanced reconstruction quality and supports both static and dynamic environments.
How It Works
Spann3R employs a novel spatial memory approach, building upon established techniques like NeRF and 3D Gaussian Splatting. It integrates a learned camera pose estimation module and a robust data preprocessing pipeline. The spatial memory allows the system to efficiently store and recall scene information, leading to more consistent and detailed reconstructions, particularly in dynamic scenarios where traditional methods might struggle.
Quick Start & Requirements
conda create -n spann3r python=3.9 cmake=1.14.0
), install PyTorch with CUDA 11.8, and then pip install -r requirements.txt
. Note: Open3D version 0.16.0 has a bug; use the development version. Compile CUDA kernels for RoPE.python demo.py
.--save_ori
to save scaled intrinsics for training with ns-train splatfacto
.python app.py
for an interactive demo.Highlighted Details
Maintenance & Community
The project is actively updated, with recent additions including support for Nerfstudio and camera parameter estimation. It acknowledges support from UKRI/EPSRC and Cisco Research.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The README notes that Habitat sequences were removed from training due to performance issues on synthetic data. The Open3D installation requires using the development version to avoid a known bug.
5 months ago
1 day