3D scene editor for interactive manipulation via LLM-driven chat
Top 86.0% on sourcepulse
This project provides an interactive 3D scene editing framework, CE3D, that leverages large language models (LLMs) and a suite of over 20 visual models to enable users to modify 3D scenes using text prompts. It is designed for researchers and developers working with 3D content creation and manipulation, offering a ChatGPT-like interface for intuitive scene editing.
How It Works
CE3D integrates multiple specialized AI models for tasks such as segmentation, image captioning, text-to-image generation, visual question answering, and depth estimation. These models work in concert, orchestrated by an LLM, to interpret user text commands and apply corresponding edits to a 3D scene. The framework supports a modular approach, allowing for flexible configuration of the required visual models based on available hardware resources.
Quick Start & Requirements
pip install -r requirements.txt
and specific dependencies like tiny-cuda-nn
.export OPENAI_API_KEY={Your_Private_Openai_Key}
followed by make run-all
or make run-small-instruct
.Highlighted Details
Maintenance & Community
The project is associated with ECCV2024 and lists authors from various institutions. Further updates and planned features are indicated by "heartbeat" emojis in the README, suggesting active development.
Licensing & Compatibility
The repository does not explicitly state a license in the provided README. Users should verify licensing for commercial use or integration into closed-source projects.
Limitations & Caveats
The full functionality requires substantial GPU resources (100GB). While a smaller version exists, its editing capabilities are limited. The project is actively being developed, with plans for 4D scene support and local LLM integration, indicating potential for ongoing changes and API evolution.
1 month ago
1 day