Discover and explore top open-source AI tools and projects—updated daily.
sharinka0715Open-vocabulary 3D scene understanding via Gaussian Splatting
Top 99.6% on SourcePulse
Summary
Semantic Gaussians, the official implementation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting," addresses challenges in open-vocabulary 3D scene understanding for applications like embodied agents and augmented reality. It offers a novel approach by distilling pre-trained 2D semantic features directly into 3D Gaussian Splatting representations, enabling versatile scene analysis without requiring additional NeRF training. This method provides significant improvements in semantic segmentation tasks and supports diverse downstream applications.
How It Works
The core innovation lies in distilling 2D semantic features into 3D Gaussians. A versatile projection method maps features from pre-trained image encoders into a novel semantic component of 3D Gaussians. For efficient inference, a dedicated 3D semantic network directly predicts these semantic components from raw 3D Gaussians, bypassing the need for extensive retraining. This approach leverages existing 2D vision-language models to enrich 3D scene representations.
Quick Start & Requirements
git clone --recursive), create a Conda environment (conda env create -f environment.yaml), activate it (conda activate sega), and install pip dependencies (pip install -r requirements.txt). Compilation of MinkowskiEngine is also required, with specific instructions provided for Anaconda/CUDA 11.x.Highlighted Details
Maintenance & Community
The project released its initial implementation in May 2024 and has since addressed dependency issues. No specific community channels (e.g., Discord, Slack), roadmap links, or notable contributor information are detailed in the README.
Licensing & Compatibility
The repository's license is not specified in the README, which is a critical omission for evaluating commercial use or integration into closed-source projects. CUDA is required, limiting compatibility to systems with NVIDIA GPUs.
Limitations & Caveats
The code has not been evaluated on Windows machines and may not be supported. macOS is explicitly not supported due to CUDA dependencies.
1 year ago
Inactive
microsoft