mdistiller by megvii-research

PyTorch library for knowledge distillation research

Created 3 years ago

884 stars

Top 40.8% on SourcePulse

View on GitHub

1 Expert Loves This Project

Wing Lian

Founder of Axolotl AI

Project Summary

This library provides a PyTorch framework for implementing various knowledge distillation techniques, targeting researchers and practitioners in computer vision. It offers official implementations of "Decoupled Knowledge Distillation" (CVPR2022) and "DOT: A Distillation-Oriented Trainer" (ICCV2023), enabling significant performance improvements on standard benchmarks like CIFAR-100 and ImageNet.

How It Works

MDistiller supports a wide array of distillation methods, including KD, FitNet, AT, NST, PKT, KDSVD, OFD, RKD, VID, SP, CRD, ReviewKD, and DKD. The framework is designed to be extensible, allowing users to easily add custom distillation algorithms by defining new distiller classes and registering them within the library. It leverages PyTorch for model implementation and training, with optional integration of Weights & Biases for logging.

Quick Start & Requirements

Install: sudo pip3 install -r requirements.txt followed by sudo python3 setup.py develop.
Prerequisites: Python 3.6, PyTorch 1.9.0, torchvision 0.10.0. Wandb is used by default for logging.
Datasets: Requires downloading datasets for CIFAR-100, ImageNet, and MS-COCO. Checkpoints for teacher models are also available for download.
Links: Official Docs, CVPR2022 Paper, ICCV2023 Paper.

Highlighted Details

Implements 13 distinct knowledge distillation methods.
Achieves notable accuracy gains, e.g., +1.79% on CIFAR-100 (ResNet32x4 student) with DKD.
Supports training and evaluation on CIFAR-100, ImageNet, and MS-COCO.
Includes visualization tools for t-SNE and correlation matrices.

Maintenance & Community

The project is associated with Megvii Research. It builds upon existing codebases from CRD and ReviewKD.

Licensing & Compatibility

Released under the MIT license, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The specified environment requires older versions of PyTorch (1.9.0) and torchvision (0.10.0), which may pose compatibility challenges with newer projects or hardware.

mdistiller by megvii-research

Explore Similar Projects

BK-SDM by Nota-NetsPresso

distill-sd by segmind

Awesome-Knowledge-Distillation-of-LLMs by Tebmer

knowledge-distillation-papers by lhyfst

distilling-step-by-step by google-research

DistillKit by arcee-ai

Awesome-LLM-Compression by HuangOwen

Awesome-Dataset-Distillation by Guang000

tinker-cookbook by thinking-machines-lab

Awesome-Knowledge-Distillation by FLHonker

awesome-knowledge-distillation by dkozlov

torchtune by meta-pytorch