Awesome-Knowledge-Distillation-of-LLMs by Tebmer

Paper list for LLM knowledge distillation

Created 1 year ago

1,238 stars

Top 31.7% on SourcePulse

Project Summary

This repository is a curated collection of research papers on Knowledge Distillation (KD) for Large Language Models (LLMs), aimed at researchers and practitioners seeking to transfer capabilities from large proprietary models to smaller ones or enable self-improvement. It provides a structured overview of KD techniques, categorized by algorithms, skill transfer, and domain-specific applications, serving as a comprehensive resource for understanding and implementing LLM distillation.

How It Works

The collection is organized around a taxonomy that breaks down KD into "Knowledge Elicitation" (extracting knowledge from teacher LLMs) and "Distillation Algorithms" (transferring knowledge to student models). It further explores "Skill Distillation" for enhancing specific cognitive abilities (e.g., reasoning, alignment) and "Verticalization Distillation" for domain-specific applications (e.g., law, medicine). This structured approach allows users to navigate the diverse landscape of KD research efficiently.

Quick Start & Requirements

This repository is a curated list of papers and does not involve direct code execution or installation. It serves as a reference guide.

Highlighted Details

Comprehensive taxonomy: KD is categorized into Knowledge Elicitation, Distillation Algorithms, Skill Distillation, and Verticalization Distillation.
Broad coverage: Includes papers on various KD algorithms (e.g., supervised fine-tuning, reinforcement learning, divergence minimization) and applications across numerous domains.
Active updates: The collection is updated weekly, with a recent update on March 19, 2024.
Legal considerations: Highlights the importance of adhering to the terms of use for LLM providers.

Maintenance & Community

The repository is maintained by Xiaohan Xu and collaborators, with contact information provided for contributions and feedback. Users are encouraged to open issues/PRs or email to suggest missing papers or taxonomies.

Licensing & Compatibility

The repository itself is not licensed for software use. The linked papers have their own respective licenses and terms of use, which users must adhere to.

Limitations & Caveats

The collection primarily focuses on generative LLMs and explicitly notes that encoder-based KD is not included, though it is being tracked. Some entries may lack direct code links.

Awesome-Knowledge-Distillation-of-LLMs by Tebmer

Explore Similar Projects

llm-continual-learning-survey by Wang-ML-Lab

LLM4Annotation by Zhen-Tan-dmml

distill-sd by segmind

AGI-Papers by gyunggyung

knowledge-distillation-papers by lhyfst

distilling-step-by-step by google-research

DistillKit by arcee-ai

mdistiller by megvii-research

Awesome-Dataset-Distillation by Guang000

Awesome-Knowledge-Distillation by FLHonker

awesome-knowledge-distillation by dkozlov

awesome-knowledge-graph by husthuke