GaussianCube  by GaussianCube

Research paper for 3D generative modeling using Gaussian splatting

Created 1 year ago
417 stars

Top 70.4% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

GaussianCube introduces a novel, structured, and explicit radiance representation for 3D generative modeling, addressing limitations of existing implicit or unstructured methods. It targets researchers and practitioners in 3D computer vision and graphics, enabling state-of-the-art results in unconditional, class-conditioned, and text-to-3D synthesis with significantly reduced parameter counts.

How It Works

GaussianCube first employs a densification-constrained Gaussian fitting algorithm for high-accuracy fitting with a fixed number of Gaussians. These Gaussians are then rearranged into a predefined voxel grid using Optimal Transport. This structured grid representation allows the use of standard 3D U-Net architectures in diffusion models without complex modifications, achieving high-quality representations with orders of magnitude fewer parameters than prior structured methods.

Quick Start & Requirements

  • Install: Clone the repository, create and activate a conda environment using environment.yml.
  • Prerequisites: Linux recommended, conda.
  • Models: Download checkpoints and statistics from Hugging Face (links provided for Objaverse, OmniObject3D, ShapeNet).
  • Inference: Run inference.py with specified model name and configuration.
  • Docs: Paper, Project Page, Code

Highlighted Details

  • Achieves state-of-the-art results in 3D generative modeling tasks.
  • Offers 1-2 orders of magnitude parameter reduction compared to previous structured representations for similar quality.
  • Supports text-conditioned generation, class-conditioned generation, unconditional generation, and digital avatar creation.
  • Includes scripts for mesh conversion from generated results.

Maintenance & Community

  • Codebase built upon improved-diffusion.
  • Pretrained models and inference code released.
  • Data construction and diffusion training code available.
  • Project is associated with NeurIPS 2024.

Licensing & Compatibility

  • The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

  • Mesh conversion requires installing several additional dependencies, including custom builds like nvdiffrast and diff-gaussian-rasterization.
  • Training data preparation requires specific formatting and pre-computation of statistics.
Health Check
Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
13 more.

pytorch3d by facebookresearch

0.2%
10k
PyTorch3D is a PyTorch library for 3D deep learning research
Created 5 years ago
Updated 3 days ago
Feedback? Help us improve.