DreamCraft3D  by deepseek-ai

3D generator for high-fidelity object creation from a 2D reference image

created 1 year ago
2,960 stars

Top 16.5% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

DreamCraft3D is a hierarchical 3D content generation method that produces high-fidelity, coherent 3D objects from a single 2D reference image. It targets researchers and developers in computer graphics and AI, offering a novel approach to overcome consistency issues in 3D generation by leveraging bootstrapped diffusion priors.

How It Works

DreamCraft3D employs a multi-stage process. It first uses score distillation sampling with a view-dependent diffusion model for geometry sculpting, prioritizing consistency over texture fidelity. Subsequently, it introduces "Bootstrapped Score Distillation" by training a personalized, 3D-aware diffusion model (e.g., Dreambooth) on augmented renderings. This personalized prior provides view-consistent guidance, and an alternating optimization between the diffusion prior and the 3D scene representation leads to mutually reinforcing improvements, significantly boosting texture quality.

Quick Start & Requirements

  • Install: pip install -r requirements.txt (after setting up a virtual environment and installing PyTorch).
  • Prerequisites: NVIDIA GPU with >= 20GB VRAM, CUDA, Python >= 3.8, PyTorch >= 1.12. Ninja is recommended for faster CUDA extension compilation.
  • Models: Requires downloading pre-trained Zero123 and Omnidata models.
  • Setup: Detailed installation and setup instructions are available in installation.md. Docker installation is also supported.
  • Links: DreamCraft3D Paper, Project Page, Youtube video, Replicate demo.

Highlighted Details

  • Hierarchical generation process for improved coherence and fidelity.
  • Bootstrapped Score Distillation using personalized diffusion models for enhanced texture.
  • Alternating optimization loop for mutual reinforcement between 3D scene and diffusion prior.
  • Supports exporting textured meshes in OBJ format.
  • DreamCraft3D++ released with enhanced quality and efficiency.

Maintenance & Community

The project is associated with DeepSeek AI and builds upon threestudio-project and stable-dreamfusion. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The project is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The default configurations require significant GPU memory (tested on 40GB A100). Memory usage can be reduced by lowering rendering resolutions. The "Janus problem" may arise in Stage 1, potentially requiring custom diffusion model training.

Health Check
Last commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
59 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), and
7 more.

stable-dreamfusion by ashawkey

0.1%
9k
Text-to-3D model using NeRF and diffusion
created 2 years ago
updated 1 year ago
Feedback? Help us improve.