TextPruner by airaria

PyTorch toolkit for pruning pre-trained language models

Created 4 years ago

388 stars

Top 73.9% on SourcePulse

Project Summary

TextPruner is a PyTorch-based toolkit for efficiently reducing the size and inference time of pre-trained language models. It offers training-free, structured pruning methods for researchers and practitioners looking to deploy large NLP models in resource-constrained environments.

How It Works

TextPruner implements two primary pruning strategies: vocabulary pruning and transformer pruning. Vocabulary pruning removes underutilized tokens from the model's embedding layer and tokenizer, reducing model size and potentially speeding up masked language modeling tasks. Transformer pruning targets less important attention heads and feed-forward network (FFN) neurons within model layers, aiming to maintain performance while significantly shrinking the model. It supports both iterative pruning and mask-based approaches.

Quick Start & Requirements

Install via pip: pip install textpruner
Requirements: Python >= 3.7, torch >= 1.7, transformers >= 4.0, sentencepiece, protobuf.
Official documentation: https://textpruner.readthedocs.io

Highlighted Details

Supports vocabulary and transformer pruning, with a combined pipeline pruning option.
Compatible with Hugging Face Transformers models like BERT, ALBERT, RoBERTa, ELECTRA, and XLM-RoBERTa.
Offers both a Python package API and a CLI tool for ease of use.
Demonstrates significant speedups (up to 2x) and size reductions (e.g., 62.5% reduction in vocab size) with minimal accuracy loss on tasks like XNLI.

Maintenance & Community

The project's paper was accepted to ACL 2022 demo.
Associated with the HFL (Harbin Institute of Technology) research group.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Users should verify licensing terms before commercial use.

Limitations & Caveats

Does not support TensorFlow 2.
Transformer pruning is not supported for XLM, BART, T5, and mT5 models, though vocabulary pruning is.
Achieving optimal performance with transformer pruning may require careful tuning of parameters like n_iters and potentially using uneven head configurations.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

Sebastian Raschka

Sebastian Raschka(Author of "Build a Large Language Model (From Scratch)").

mint by dpressel

Minimal PyTorch library for Transformer tutorials

Created 3 years ago

Updated 3 years ago

LLMPruner by yangjianxin1

LLM pruning tool for reducing model size and accelerating training

Created 2 years ago

Updated 2 years ago

pytorch-transformers-classification by ThilinaRajapakse

Deprecated starter for Transformer-based text classification tasks

Created 6 years ago

Updated 5 years ago

tiny-llm-zh by wdndev

Chinese LLM for learning large language models

Created 1 year ago

Updated 1 year ago

EasyTransfer by alibaba

NLP platform for transfer learning

Created 5 years ago

Updated 3 years ago

bert4torch by Tongjilibo

PyTorch library for transformer models

Created 3 years ago

Updated 2 weeks ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and

5 more.

matmulfreellm by ridgerchu

MatMul-free language models

Created 1 year ago

Updated 1 month ago

Bert-Multi-Label-Text-Classification by lonePatient

PyTorch code for multi-label text classification

Created 7 years ago

Updated 2 years ago

Starred by

Tim Suchanek

Tim Suchanek(Founder of expand.ai) and

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

fast-bert by appvision-ai

SDK for BERT/XLNet-based NLP model training and deployment

Created 6 years ago

Updated 1 year ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

FlagAI by FlagAI-Open

Toolkit for large-scale model training, fine-tuning, and deployment

Created 3 years ago

Updated 2 months ago

zero_nlp by yuanzhoulvpi2017

NLP solution for Chinese language models, data, training, and inference

Created 2 years ago

Updated 5 months ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

François Chollet

François Chollet(Author of Keras; Cofounder of Ndea, ARC Prize), and

43 more.

spaCy by explosion

NLP library for production applications

Created 11 years ago

Updated 1 month ago

Feedback? Help us improve.