Language-grounded NeRF for scene editing and object search
Top 49.8% on sourcepulse
LERF (Language Embedded Radiance Fields) enables users to interactively query and manipulate 3D scenes represented by Neural Radiance Fields (NeRFs) using natural language. It targets researchers and developers in computer vision and graphics who want to explore semantic control over 3D scene generation and editing. The primary benefit is enabling intuitive, text-based scene exploration and modification.
How It Works
LERF integrates with the Nerfstudio framework, extending its NeRF models with a language field. It leverages pre-trained vision-language models (like CLIP or DINO) to embed textual descriptions into the 3D scene representation. This allows for semantic querying, where specific regions or objects in the NeRF can be identified and visualized based on text prompts, facilitating a new paradigm for 3D scene interaction.
Quick Start & Requirements
tinycudann
. Clone the repo (git clone https://github.com/kerrj/lerf
) and install as a package (python -m pip install -e .
). Verify with ns-train -h
.ns-train lerf --data <data_folder>
. Connect to the viewer via the provided link.Highlighted Details
lerf-lite
for reduced memory footprint and lerf-big
with ViT-L/14 for larger models.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project has a "TODO" for integrating command-line video rendering with custom prompts. Visualization code may change as Nerfstudio features evolve, requiring users to check for updates when forking. High-resolution rendering (above 256px) may be slow.
1 year ago
1+ week