Interpretability toolkit for sequence generation models
Top 69.9% on sourcepulse
Inseq is a Python toolkit for post-hoc interpretability analysis of sequence generation models, targeting researchers and practitioners in NLP. It democratizes access to various attribution methods, enabling deeper understanding of model behavior and facilitating reproducible research.
How It Works
Inseq integrates with Hugging Face Transformers, supporting both encoder-decoder and decoder-only architectures. It implements a wide range of attribution methods, including gradient-based (e.g., Integrated Gradients, DeepLIFT), attention-based, and perturbation-based techniques. The library allows for flexible post-processing of attribution maps via Aggregator
classes and supports custom attribution targets using "step functions" to extract scores like logits, probabilities, or entropy at each generation step.
Quick Start & Requirements
pip install inseq
pip install inseq[notebook,datasets]
tokenizers
installation requires a Rust compiler.sentencepiece
installation may require cmake
, build-essential
, pkg-config
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
tokenizers
, sentencepiece
) may require additional system-level setup.3 months ago
1 day