Lift 2D photos to 3D objects
Top 86.3% on sourcepulse
NeuralLift-360 addresses the challenge of reconstructing a 3D object with 360° views from a single 2D image. It is designed for researchers and developers in computer vision and graphics interested in novel view synthesis and 3D reconstruction from limited input. The project enables the creation of multi-view representations from single images, facilitating applications in virtual reality, augmented reality, and content creation.
How It Works
The project leverages a diffusion model, building upon the Stable DreamFusion codebase, to generate novel views. It incorporates depth estimation from external tools and foreground masking to guide the 3D reconstruction process. Text inversion is optionally used to improve the text embeddings for better object representation during the diffusion process. This approach allows for the generation of a 3D object from a single 2D input by iteratively refining the object's representation across multiple views.
Quick Start & Requirements
pip install -r requirements.txt
pip install gradio
https://github.com/Ir1d/image-background-remove-tool
).accelerate
and a pre-trained Stable Diffusion model (e.g., runwayml/stable-diffusion-v1-5
).[Website]
[Colab Notebook Link]
Highlighted Details
lift_ep0100_rgb.mp4
) for testing.Maintenance & Community
https://github.com/ashawkey/stable-dreamfusion
.Licensing & Compatibility
Limitations & Caveats
The Gradio App is noted to be slower than direct script execution due to on-the-fly rendering. Configuration loading is currently from pre-defined YAML files, with plans for updates. Imagic finetuning functionality is listed as "Coming soon."
1 year ago
1 week