daam by castorini

Research paper implementation for interpreting Stable Diffusion models

Created 3 years ago

785 stars

Top 44.8% on SourcePulse

Project Summary

DAAM (Diffusion Attentive Attribution Maps) provides a method for interpreting Stable Diffusion models by visualizing cross-attention mechanisms. It helps users understand which parts of the input prompt influence specific regions of the generated image, targeting researchers and users of diffusion models who need to debug or explain model behavior.

How It Works

DAAM leverages cross-attention maps generated during the diffusion process. By analyzing these maps, it attributes image regions to specific words in the prompt. The approach allows for granular heatmaps per word and global heatmaps, offering a detailed view of the model's internal reasoning. This method is advantageous for its direct link to the attention mechanism, providing a more interpretable explanation than generic saliency methods.

Quick Start & Requirements

Install with pip install daam. For editable installs, clone the repo and run pip install -e daam.
Requires PyTorch and huggingface-cli login for model access.
Supports Stable Diffusion XL (SDXL) and Diffusers 0.21.1.
A demo is available at https://huggingface.co/spaces/tetrisd/Diffusion-Attentive-Attribution-Maps.
A Colab notebook is provided for guided usage.

Highlighted Details

Generates per-word and global heatmaps for Stable Diffusion.
Supports CLI utility for quick generation and library integration for custom workflows.
Enables serialization and deserialization of generation experiments and heatmaps.
Offers an extension, DAAM-i2i, for image-to-image attribution.

Maintenance & Community

The codebase is regularly updated. Questions can be submitted via issues. Links to community resources are not explicitly provided beyond the GitHub repository.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README text. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README mentions that documentation is still being added. While it supports SDXL and recent Diffusers versions, users should verify compatibility with specific model checkpoints or older library versions.

daam by castorini

Explore Similar Projects

attention-map-diffusers by wooyeolbaek

segmoe by segmind

CCSR by csslc

erasing by rohitgandikota

Attend-and-Excite by yuval-alaluf

DiffusionFastForward by mikonvergence

Radiata by ddPn08

Diffusion-Models by yangqy1110

custom-diffusion by adobe-research

pytorch-stable-diffusion by hkproj

T2I-Adapter by TencentARC

k-diffusion by crowsonkb