Discover and explore top open-source AI tools and projects—updated daily.
3D scene understanding research paper using open-vocabulary queries
Top 46.1% on SourcePulse
OpenScene provides a zero-shot framework for 3D scene understanding using open-vocabulary queries, enabling tasks like semantic segmentation, rare object search, and image-based object detection. It targets researchers and practitioners in 3D computer vision who need flexible scene analysis beyond predefined categories.
How It Works
OpenScene leverages multi-view 2D image features fused onto 3D point clouds. This approach combines the rich semantic understanding of large-scale 2D vision models (like OpenSeg or LSeg) with the geometric context of 3D data. By projecting 2D features onto 3D points, it achieves open-vocabulary capabilities, allowing queries based on arbitrary text descriptions, properties, or activities.
Quick Start & Requirements
installation.md
.scripts/download_dataset.sh
and scripts/download_fused_features.sh
. These downloads can be substantial (e.g., 234.8G for ScanNet features).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive