LION by nv-tlabs

Research paper for latent point diffusion models for 3D shape generation

Created 3 years ago

824 stars

Top 43.1% on SourcePulse

Project Summary

LION addresses the generation of 3D shapes using latent point diffusion models, targeting researchers and practitioners in computer graphics and AI. It offers a novel approach to 3D shape synthesis by leveraging diffusion models in a latent space, enabling high-quality and diverse point cloud generation.

How It Works

LION employs a two-stage process: first, a Variational Autoencoder (VAE) learns a compressed latent representation of 3D point clouds. Second, a diffusion model is trained in this latent space to generate new latent codes, which are then decoded by the VAE to produce 3D point clouds. This latent diffusion approach allows for efficient and high-fidelity generation compared to direct diffusion in point cloud space.

Quick Start & Requirements

Install via conda: conda env create --name lion_env --file=env.yaml followed by conda activate lion_env.
Additional dependencies: pip install git+https://github.com/openai/CLIP.git.
Requires CUDA 11.6.
Setup involves downloading ShapeNet data and released checkpoints.
Demo: python demo.py (requires checkpoint download).
Official Docs: Not explicitly linked, but the README provides detailed setup and training instructions.

Highlighted Details

Latent Point Diffusion Models for 3D Shape Generation.
Supports text-to-shape generation via CLIP embeddings.
Includes code for rendering point clouds using Mitsuba.
Provides scripts for training VAE, diffusion prior, and evaluation.

Maintenance & Community

Project initiated by researchers from NVIDIA and University of Toronto.
Primary contact for issues: @ZENGXH.
Experiment logging supported via comet-ml, wandb, and TensorBoard.

Licensing & Compatibility

The README does not explicitly state a license.
Code is provided for research purposes, implying potential restrictions on commercial use.

Limitations & Caveats

Released checkpoints and demo data were not immediately available at the time of the README's last update.
Training requires significant computational resources (e.g., multiple A100 or V100 GPUs).
Data paths for ShapeNet and rendered images may require customization.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

1

Star History

3 stars in the last 30 days

Explore Similar Projects

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI),

Paras Jain

Paras Jain(Cofounder of Genmo), and

1 more.

sjc by pals-ttic

Research paper for 3D generation from 2D diffusion models

Created 3 years ago

Updated 1 year ago

richdreamer by modelscope

Text-to-3D model for generating detailed 3D assets

Created 2 years ago

Updated 1 year ago

Awesome-AIGC-3D by hitcslj

Curated list of AIGC 3D papers

Created 2 years ago

Updated 5 months ago

Awesome-Text-to-3D by yyeboah

Curated list of Text-to-3D and Diffusion-to-3D research papers

Created 2 years ago

Updated 4 days ago

Fantasia3D by Gorilla-Lab-SCUT

Research paper for text-to-3D content creation via disentangled geometry/appearance

Created 2 years ago

Updated 1 year ago

GaussianDreamer by hustvl

Framework for fast text-to-3D Gaussian generation

Created 2 years ago

Updated 1 year ago

Starred by

Hahnbee Lee

Hahnbee Lee(Cofounder of Mintlify).

cube by Roblox

3D foundation model research paper for Roblox asset generation

Created 10 months ago

Updated 5 months ago

Make-It-3D by junshutang

3D creation from a single image using diffusion prior (ICCV 2023)

Created 2 years ago

Updated 1 year ago

Starred by

Alberto Taiuti

Alberto Taiuti(Cofounder of Luma AI) and

Saining Xie

Saining Xie(Professor at NYU).

zero123 by cvlab-columbia

Research paper for zero-shot one image to 3D object generation

Created 2 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

13 more.

stable-dreamfusion by ashawkey

Text-to-3D model using NeRF and diffusion

Created 3 years ago

Updated 2 years ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Edward Sun

Edward Sun(Research Scientist at Meta Superintelligence Lab), and

13 more.

point-e by openai

Diffusion model for 3D point cloud generation from prompts

Created 3 years ago

Updated 1 year ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Wei-Lin Chiang

Wei-Lin Chiang(Cofounder of LMArena), and

4 more.

shap-e by openai

3D object generator conditioned on text or images

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.