image-sculpting by vision-x-nyu

Image editing framework using 3D geometry

Created 2 years ago

298 stars

Top 89.4% on SourcePulse

Project Summary

Image Sculpting offers a novel framework for precise 2D image editing by leveraging 3D geometry. It targets researchers and artists seeking granular control over object manipulation, moving beyond ambiguous text-based edits to enable direct interaction with 3D models derived from single images.

How It Works

The core approach converts 2D objects into editable 3D representations. This allows for direct manipulation of pose, rotation, translation, and composition. After editing, the 3D models are re-rendered into the 2D image, with a coarse-to-fine enhancement process ensuring high-fidelity integration. This hybrid method combines generative model flexibility with the precision of traditional graphics pipelines.

Quick Start & Requirements

Install: Clone the repository, create a virtual environment, and install dependencies using pip install -r requirements.txt. PyTorch with CUDA 11.8 is required.
Prerequisites: NVIDIA RTX 4090 with CUDA 12.0 recommended. Background removal (e.g., Clipdrop) and Zero-1-to-3 XL model weights are needed for custom data.
Resources: Download provided reconstructed meshes from Google Drive. Setup for custom data involves 3D reconstruction using Zero-1-to-3 and potentially DreamBooth fine-tuning.
Links: Project Page, Video, Paper.

Highlighted Details

Enables precise editing operations like pose, rotation, translation, carving, and serial addition.
Integrates 3D geometry control with generative models for high-fidelity results.
Supports re-rendering and texture enhancement via DreamBooth fine-tuning.
Leverages Zero-1-to-3 for single-image 3D reconstruction.

Maintenance & Community

The project is associated with New York University and Intel Labs. Further community or maintenance details are not explicitly provided in the README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

The README notes that while other deformation methods are possible, using bones is recommended for intuitive, physics-aware editing. Successful 3D reconstruction from single images may require careful preprocessing, including recentering and scaling.

image-sculpting by vision-x-nyu

Explore Similar Projects

awesome-3DGS by qqqqqqy0227

MVEdit by Lakonik

LLaVA-3D by ZCMax

WonderWorld by KovenYu

lyra by nv-tlabs

GaussianDreamer by hustvl

GaussianObject by chensjtu

stable-diffusion-webui-depthmap-script by thygate

Hunyuan3D-2.1 by Tencent-Hunyuan

stable-dreamfusion by ashawkey

shap-e by openai

Hunyuan3D-2 by Tencent-Hunyuan