3D-aware portrait synthesis for interactive disentangled editing
Top 64.3% on sourcepulse
IDE-3D addresses the trade-off between quality and editability in 3D-aware facial generation, enabling high-resolution, view-consistent, and disentangled portrait synthesis with interactive editing capabilities. It targets researchers and practitioners in computer graphics and generative AI who require fine-grained control over 3D face generation.
How It Works
The system employs a three-component architecture: a 3D-semantics-aware generative model for disentangled outputs, a hybrid GAN inversion for faithful reconstruction, and a canonical editor for semantic mask manipulation. This approach combines the strengths of low-resolution editability and high-resolution photorealism by leveraging semantic masks and a hybrid inversion technique for efficient, high-quality editing.
Quick Start & Requirements
conda env create -f environment.yml
.ide3d-ffhq-64-512.pkl
, encoder-base-hybrid.pkl
).pip install -r ./Painter/requirements.txt
.Highlighted Details
Maintenance & Community
The project is associated with ACM Transactions on Graphics (SIGGRAPH Asia 2022). Code is borrowed from StyleGAN3, PTI, EG3D, and StyleGAN-nada. Training scripts are noted as "will be released soon."
Licensing & Compatibility
The repository does not explicitly state a license. However, its academic publication and reliance on other projects (some with permissive licenses) suggest it is intended for research purposes. Commercial use would require careful review of any underlying component licenses.
Limitations & Caveats
Training scripts are not yet released, limiting the ability to train custom models. The project relies on pre-trained models and specific dataset formats, which may require significant effort to adapt for custom use cases.
2 years ago
Inactive