GaussianDreamer  by hustvl

Framework for fast text-to-3D Gaussian generation

Created 1 year ago
788 stars

Top 44.5% on SourcePulse

GitHubView on GitHub
Project Summary

GaussianDreamer is a framework for fast text-to-3D Gaussian splatting generation, targeting researchers and developers in 3D content creation. It bridges the strengths of 2D and 3D diffusion models to produce high-quality, real-time renderable 3D assets from text prompts, significantly reducing generation time compared to prior methods.

How It Works

GaussianDreamer leverages a hybrid approach, using a 3D diffusion model for initial priors and a 2D diffusion model for refining geometry and appearance. It introduces novel operations like noisy point growing and color perturbation to enhance the initialized Gaussian representations, achieving a balance between 3D consistency and high-fidelity generation.

Quick Start & Requirements

  • Installation: Requires PyTorch with CUDA 11.7, Ninja, and cloning the repository. Dependencies include shap-e and specific submodules for Gaussian rasterization and kNN.
  • Prerequisites: Python 3.x, GPU with CUDA 11.7. A finetuned Shap-E model is required.
  • Demo: Huggingface and Colab demos are available.
  • Docs: Project page and arXiv paper provide detailed information.

Highlighted Details

  • Generates high-quality 3D instances or avatars within 15 minutes on a single GPU.
  • Achieves competitive results on ViT similarity and T3 Bench benchmarks.
  • Supports integration with Unity game engine via UnityGaussianSplatting.
  • Offers extensions for threestudio.

Maintenance & Community

The project is associated with CVPR 2024 and has seen recent updates, including the release of GaussianDreamerPro. It acknowledges contributions from other open-source projects.

Licensing & Compatibility

The repository's licensing is not explicitly stated in the README, but it mentions borrowing code from other projects, implying potential licensing considerations for commercial use.

Limitations & Caveats

The initial code release may contain issues, as noted by the authors. The specific licensing for commercial use is not detailed, which could be a factor for adoption.

Health Check
Last Commit

8 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
6 more.

threestudio by threestudio-project

0.2%
7k
Framework for 3D content generation from text/images using 2D diffusion
Created 2 years ago
Updated 9 months ago
Starred by Yaowei Zheng Yaowei Zheng(Author of LLaMA-Factory), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
13 more.

stable-dreamfusion by ashawkey

0.1%
9k
Text-to-3D model using NeRF and diffusion
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.