Discover and explore top open-source AI tools and projects—updated daily.
SelfishGeneHigh-resolution synthetic face dataset for generative AI research
Top 99.6% on SourcePulse
Summary
The SFHQ dataset offers approximately 425,000 high-quality, 1024x1024 synthetic face images. It addresses the need for large-scale, privacy-free facial data for training machine learning models or augmenting existing datasets, providing significant variability in identity, ethnicity, age, pose, expression, and lighting.
How It Works
Inspiration images (paintings, 3D models, text-to-image outputs) are encoded into StyleGAN2 latent space via the e4e encoder. Latent space manipulation generates photorealistic faces. A semi-automatic curation process using a "visual taste approximator" and CLIP features ensures high quality and removes near-duplicates (CLIP similarity < 0.92), yielding a large, diverse dataset.
Quick Start & Requirements
Download is available via Kaggle. Implied dependencies include StyleGAN2, e4e encoder, CLIP, Face Parsing BiSeNet, and Dlib. An example script (explore_dataset.py) and a live Kaggle notebook demonstrate accessing features like landmarks, segmentation maps, and performing textual searches.
Highlighted Details
Maintenance & Community
Created by David Beniaguev, with the GitHub repository (SelfishGene/SFHQ-dataset) as the primary resource. No specific community channels or maintenance details are provided in the README.
Licensing & Compatibility
Described as having "no privacy issues or license issues" due to synthetic generation. A specific open-source license is not stated, requiring clarification for commercial use.
Limitations & Caveats
Limited variability in accessories (hats, earphones) and jewelry; minimal occlusions beyond hair self-occlusion. Inherits biases from source datasets (FFHQ, AAHQ) and generative models (StyleGAN2, Stable Diffusion). A newer dataset, SFHQ-T2I, is also mentioned.
1 year ago
Inactive
kakaobrain
podgorskiy
clovaai