Sparsification toolkit for optimized neural networks
Top 21.4% on sourcepulse
SparseML is an open-source toolkit for optimizing neural networks through pruning, quantization, and distillation, targeting researchers and engineers who want to create faster, smaller models for efficient inference. It enables applying state-of-the-art sparsification techniques with minimal code changes, exporting models to ONNX for deployment with DeepSparse on CPU hardware.
How It Works
SparseML utilizes a declarative recipe system (YAML files) to define sparsification algorithms and hyperparameters. These recipes are parsed and applied to PyTorch or Hugging Face models via a Python API or a CLI. The library abstracts the complexity of these algorithms, allowing users to integrate sparsification into existing training pipelines with minimal code modification.
Quick Start & Requirements
pip install sparseml
pip install -e "sparseml[transformers]"
Highlighted Details
SparseGPTModifier
.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The library supports specific versions of PyTorch and TensorFlow; compatibility with newer versions may require updates. TensorFlow support is limited to versions < 2.0.0.
2 months ago
Inactive