TriplaneGaussian by VAST-AI-Research

Research paper for single-view 3D reconstruction using hybrid representation

Created 2 years ago

913 stars

Top 39.8% on SourcePulse

1 Expert Loves This Project

jiamings

Chief Scientist at Luma AI

Project Summary

This project provides a fast and generalizable single-view 3D reconstruction system, targeting researchers and developers in computer vision and graphics. It enables high-quality 3D reconstruction from a single image in seconds, leveraging a novel hybrid Triplane-Gaussian representation.

How It Works

The system employs a hybrid 3D representation combining Triplane and Gaussian Splatting. Triplanes offer an efficient implicit representation, while Gaussian Splatting provides explicit, high-fidelity rendering. This fusion allows for fast inference and high-quality results by capturing both global structure and fine details. Transformers are utilized to process the input image and guide the reconstruction process.

Quick Start & Requirements

Installation: pip install -r requirements.txt (after installing PyTorch, pointnet2_ops, pytorch_scatter, and diff-gaussian-rasterization).
Prerequisites: Python >= 3.8, PyTorch >= 1.12 (tested with cu113), CUDA 11.3, pointnet2_ops, pytorch_scatter, diff-gaussian-rasterization, PyTorch3D.
Pretrained Model: Download from Hugging Face (VAST-AI/TriplaneGaussian).
Demo: Online Gradio demo available on Hugging Face Spaces. Colab notebook provided.
Links: Hugging Face Demo, Colab Demo

Highlighted Details

Achieves high-quality 3D reconstruction from single-view images in under a second.
Utilizes a novel hybrid Triplane-Gaussian 3D representation.
Compatible with graphdeco-inria/gaussian-splatting PLY format.
Supports background removal via SAM checkpoint integration.

Maintenance & Community

Official implementation of the paper "Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers".
Supported by Tsinghua University and VAST.
Code modified from SnowflakeNet for point cloud upsampling.

Licensing & Compatibility

The repository does not explicitly state a license in the README.

Limitations & Caveats

The provided pretrained model is trained only on the Objaverse-LVIS dataset.
Performance may improve with models trained on larger datasets or with more parameters.
Results can be sensitive to the cam_dist parameter, requiring tuning for optimal output.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

4 stars in the last 30 days

Explore Similar Projects

awesome-3DGS by qqqqqqy0227

Real-time 3D scene rendering and reconstruction

Created 1 year ago

Updated 1 year ago

image-sculpting by vision-x-nyu

Image editing framework using 3D geometry

Created 2 years ago

Updated 1 year ago

MVEdit by Lakonik

PyTorch code for multi-view diffusion-based 3D generation research

Created 1 year ago

Updated 1 year ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

GaussianCube by GaussianCube

Research paper for 3D generative modeling using Gaussian splatting

Created 1 year ago

Updated 1 year ago

SCube by nv-tlabs

Scene reconstruction research paper using voxels and splats

Created 1 year ago

Updated 2 months ago

lyra by nv-tlabs

Generative 3D scene reconstruction from single inputs

Created 4 months ago

Updated 3 months ago

GaussianDreamer by hustvl

Framework for fast text-to-3D Gaussian generation

Created 2 years ago

Updated 1 year ago

GaussianObject by chensjtu

3D object reconstruction research paper using Gaussian splatting

Created 1 year ago

Updated 1 year ago

gaussian-splatting-lightning by yzslab

PyTorch Lightning framework for 3D Gaussian Splatting

Created 2 years ago

Updated 2 weeks ago

Starred by

Alberto Taiuti

Alberto Taiuti(Cofounder of Luma AI) and

Saining Xie

Saining Xie(Professor at NYU).

zero123 by cvlab-columbia

Research paper for zero-shot one image to 3D object generation

Created 2 years ago

Updated 2 years ago

Starred by

Robin Huang

Robin Huang(Cofounder of Comfy Org) and

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

ComfyUI-3D-Pack by MrForExample

ComfyUI node suite for 3D asset processing via cutting-edge algorithms

Created 2 years ago

Updated 1 week ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Wei-Lin Chiang

Wei-Lin Chiang(Cofounder of LMArena), and

4 more.

shap-e by openai

3D object generator conditioned on text or images

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.