Awesome-Model-Quantization by Efficient-ML

Curated list for model quantization research

Created 7 years ago

2,306 stars

Top 19.5% on SourcePulse

Project Summary

This repository serves as a curated, comprehensive list of papers, documentation, and code related to model quantization, targeting researchers and practitioners in machine learning. It aims to consolidate information on techniques for reducing the precision of neural network weights and activations to improve efficiency, particularly for large models.

How It Works

The repository is structured as an "awesome list," categorizing resources by year, topic (e.g., binarization, LLM quantization), and specific benchmarks. It includes links to seminal papers, recent research, and associated code repositories, facilitating a broad overview and deep dive into the field of model quantization.

Quick Start & Requirements

This is a curated list, not a runnable software project. No installation or execution commands are applicable.

Highlighted Details

Extensive collection of papers spanning from 2015 to the present, with a strong focus on recent advancements in LLM quantization.
Inclusion of key benchmarks like BiBench and MQBench for evaluating quantization methods.
Categorization of resources by specific techniques such as binarization, mixed-precision, and hardware-aware quantization.
Links to associated code repositories for many of the listed papers, enabling practical exploration.

Maintenance & Community

The project is community-driven, with an open invitation for Pull Requests to add missing works. It is maintained by the "Efficient-ML" organization.

Licensing & Compatibility

The repository itself is licensed under the MIT License, allowing for broad reuse. Individual papers and code repositories linked within the list will have their own respective licenses.

Limitations & Caveats

As a curated list, the repository does not provide any executable code or direct functionality. Its value is entirely dependent on the completeness and accuracy of the community contributions.

Awesome-Model-Quantization by Efficient-ML

Explore Similar Projects

Awesome-LLM-Quantization by pprp

EfficientQAT by OpenGVLab

LLM-QAT by facebookresearch

Atom by efeslab

VPTQ by microsoft

quip-sharp by Cornell-RelaxML

Awesome-Quantization-Papers by Zhen-Dong

aimet-model-zoo by quic

optimum-quanto by huggingface

deepcompressor by nunchaku-tech

gptq by IST-DASLab

ppq by OpenPPL