Text-to-image prompt gallery dataset for generative AI research
Top 31.5% on sourcepulse
DiffusionDB is a comprehensive dataset for text-to-image generation research, offering 14 million Stable Diffusion-generated images with associated prompts and hyperparameters. It targets researchers and developers working on generative models, deepfake detection, and human-AI interaction, providing a large-scale, human-actuated resource for understanding prompt engineering and model behavior.
How It Works
The dataset is structured into two subsets: DiffusionDB 2M (2 million images, 1.6TB) and DiffusionDB Large (14 million images, 6.5TB). Images are organized into modular folders, with each folder containing images and a JSON file mapping image filenames to their generation parameters (prompt, seed, CFG scale, steps, sampler). Metadata is also provided in Parquet format for efficient querying without downloading all images.
Quick Start & Requirements
pip install datasets Pillow
) or a provided Python downloader script (download.py
).urlretrieve
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The dataset may contain some NSFW images despite an NSFW filter, and users are advised to apply their own filtering based on provided NSFW scores. Timestamps may not be accurate for duplicate images.
1 year ago
Inactive