DNN analyzer for predicting model accuracy without training/test data
Top 26.3% on sourcepulse
WeightWatcher is a diagnostic tool for analyzing Deep Neural Networks (DNNs) without requiring access to training or test data. It helps users predict model accuracy, identify over-training or over-parameterization, and detect potential issues during compression or fine-tuning, targeting researchers and practitioners in deep learning.
How It Works
The tool is based on theoretical research into Heavy-Tailed Self-Regularization (HT-SR) and employs concepts from Random Matrix Theory (RMT) and Statistical Mechanics. It analyzes the Empirical Spectral Density (ESD) of layer weight matrices, fitting the tail of the distribution to a power law to derive generalization metrics. This approach aims to quantify how "on-random" or "heavy-tailed" a layer's weight distribution is, correlating these properties with model generalization performance.
Quick Start & Requirements
pip install weightwatcher
Highlighted Details
alpha
(power law exponent) and rand_distance
for generalization assessment.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The power law fits are most effective for well-trained, heavy-tailed ESDs; results may be spurious if alpha > 8.0
or if ESDs are multimodal or not well-described by a single power law. The PEFT/LoRA analysis is experimental and currently ignores biases.
1 month ago
1 day