GaussianCube  by GaussianCube

Research paper for 3D generative modeling using Gaussian splatting

created 1 year ago
415 stars

Top 71.7% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

GaussianCube introduces a novel, structured, and explicit radiance representation for 3D generative modeling, addressing limitations of existing implicit or unstructured methods. It targets researchers and practitioners in 3D computer vision and graphics, enabling state-of-the-art results in unconditional, class-conditioned, and text-to-3D synthesis with significantly reduced parameter counts.

How It Works

GaussianCube first employs a densification-constrained Gaussian fitting algorithm for high-accuracy fitting with a fixed number of Gaussians. These Gaussians are then rearranged into a predefined voxel grid using Optimal Transport. This structured grid representation allows the use of standard 3D U-Net architectures in diffusion models without complex modifications, achieving high-quality representations with orders of magnitude fewer parameters than prior structured methods.

Quick Start & Requirements

  • Install: Clone the repository, create and activate a conda environment using environment.yml.
  • Prerequisites: Linux recommended, conda.
  • Models: Download checkpoints and statistics from Hugging Face (links provided for Objaverse, OmniObject3D, ShapeNet).
  • Inference: Run inference.py with specified model name and configuration.
  • Docs: Paper, Project Page, Code

Highlighted Details

  • Achieves state-of-the-art results in 3D generative modeling tasks.
  • Offers 1-2 orders of magnitude parameter reduction compared to previous structured representations for similar quality.
  • Supports text-conditioned generation, class-conditioned generation, unconditional generation, and digital avatar creation.
  • Includes scripts for mesh conversion from generated results.

Maintenance & Community

  • Codebase built upon improved-diffusion.
  • Pretrained models and inference code released.
  • Data construction and diffusion training code available.
  • Project is associated with NeurIPS 2024.

Licensing & Compatibility

  • The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

  • Mesh conversion requires installing several additional dependencies, including custom builds like nvdiffrast and diff-gaussian-rasterization.
  • Training data preparation requires specific formatting and pre-computation of statistics.
Health Check
Last commit

7 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
12 stars in the last 90 days

Explore Similar Projects

Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Travis Fischer Travis Fischer(Founder of Agentic), and
3 more.

consistency_models by openai

0.0%
6k
PyTorch code for consistency models research paper
created 2 years ago
updated 1 year ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), and
3 more.

guided-diffusion by openai

0.2%
7k
Image synthesis codebase for diffusion models
created 4 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), and
4 more.

taming-transformers by CompVis

0.1%
6k
Image synthesis research paper using transformers
created 4 years ago
updated 1 year ago
Feedback? Help us improve.