MVEdit by Lakonik

PyTorch code for multi-view diffusion-based 3D generation research

Created 1 year ago

342 stars

Top 80.8% on SourcePulse

Project Summary

This repository provides the official PyTorch implementation for "3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation" and "Generic 3D Diffusion Adapter Using Controlled Multi-View Editing." It enables high-quality 3D asset generation and editing through controlled multi-view diffusion, targeting researchers and developers in computer vision and graphics.

How It Works

The project leverages multi-view diffusion models to achieve geometry-consistent 3D generation. It acts as an adapter, integrating with existing diffusion pipelines to guide the generation process across multiple views, ensuring spatial coherence and high fidelity in the resulting 3D assets. The approach utilizes off-the-shelf models for optimization-based adapters, requiring no further training for this variant.

Quick Start & Requirements

Installation: Clone the repository and install dependencies via pip install -r requirements.txt. A conda environment with Python 3.10, PyTorch 2.1.2, and CUDA 12.1 is recommended. FFmpeg and x264 are optional for video export.
Prerequisites: Linux (Ubuntu 20+), CUDA Toolkit 11.8+, PyTorch 2.1+, FFmpeg, x264. Windows is supported with potential adjustments for packages like tiny-cuda-nn.
Inference: Run python app.py --unload-models to start the Gradio Web UI. A GPU with at least 24GB VRAM is required.
Resources: Initial model downloads can be extensive.
Links: Project page, Demo, Paper.

Highlighted Details

Implements geometry-consistent multi-view diffusion for high-quality 3D generation.
Offers a Gradio Web UI for accessible inference and API access.
Built upon numerous foundational libraries including SSDNeRF, Stable-DreamFusion, and Gaussian Splatting.
Integrates with Zero123++, IP-Adapter, TRACER, LoFTR, and Omnidata for enhanced capabilities.

Maintenance & Community

The project is associated with Stanford University, Apparate Labs, and UCSD. GRM-based 3D-Adapter models are pending release alongside GRM.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

GRM-based 3D-Adapters are not yet released. Certain packages may require specific configuration for Windows installation. API documentation may contain inaccuracies in data types and default values.

MVEdit by Lakonik

Explore Similar Projects

image-sculpting by vision-x-nyu

richdreamer by modelscope

ShapeLLM-Omni by JAMESYJL

GaussianDreamer by hustvl

spann3r by HengyiWang

stable-virtual-camera by Stability-AI

HunyuanWorld-1.0 by Tencent-Hunyuan

ComfyUI-3D-Pack by MrForExample

threestudio by threestudio-project

stable-dreamfusion by ashawkey

Hunyuan3D-2 by Tencent-Hunyuan

pytorch3d by facebookresearch