FastBERT by autoliuweijie

Source code for FastBERT research paper

Created 5 years ago

608 stars

Top 54.0% on SourcePulse

Project Summary

FastBERT provides a self-distilling approach to BERT models, enabling adaptive inference times for improved efficiency without significant accuracy loss. It targets researchers and practitioners working with large language models who need to optimize performance for deployment.

How It Works

FastBERT employs a self-distillation strategy where a smaller, faster model learns from a larger, pre-trained BERT. This process involves adaptive inference, allowing the model to dynamically adjust its computation based on input complexity or desired speed. The core advantage is achieving reduced FLOPs (Floating Point Operations) while maintaining high accuracy, as demonstrated by significant reductions in computational cost on benchmark datasets.

Quick Start & Requirements

Install via pip: pip install fastbert
Requires Python >= 3.4.0.
Additional requirements can be installed with pip install -r requirements.txt.
Pre-trained model weights and vocabulary files are necessary for both Chinese and English datasets.
GPU is recommended for training and inference.
Official quick-start examples are provided for Chinese Book review and English Ag.news datasets.

Highlighted Details

Achieves significant FLOP reduction (e.g., ~75% on Chinese Book review, ~3% on English Ag.news) with minimal accuracy drop.
Supports adaptive inference speed control via the --speed parameter.
Offers both fine-tuning and self-distillation phases for model optimization.
Codebase is available on PyPI and GitHub.

Maintenance & Community

The project is associated with ACL 2020 and IEEE TNNLS publications.
Mentions funding from the 2019 Tencent Rhino-Bird Elite Training Program.
An alternative PyTorch implementation exists at BitVoyage/FastBERT.

Licensing & Compatibility

The README does not explicitly state a license. However, the presence of a PyPI package and public GitHub repository suggests a permissive open-source license, though this requires verification.

Limitations & Caveats

The README does not specify the exact license, which could impact commercial use.
Requires downloading separate pre-trained model weights and vocabulary files.
Performance gains may vary depending on the specific dataset and task.

Health Check

Last Commit

4 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days