albert_pytorch by lonePatient

PyTorch implementation for ALBERT research paper

Created 6 years ago

716 stars

Top 48.0% on SourcePulse

Project Summary

This repository provides a PyTorch implementation of the ALBERT model, a lighter version of BERT designed for efficient self-supervised learning of language representations. It is targeted at researchers and practitioners in Natural Language Processing (NLP) who need a performant yet resource-conscious language model for tasks like fine-tuning on benchmarks such as GLUE.

How It Works

The implementation focuses on ALBERT's core architectural innovations, including parameter-reduction techniques like factorized embedding parameterization and cross-layer parameter sharing. These methods significantly reduce the number of parameters compared to BERT, leading to faster training and lower memory requirements while aiming to maintain competitive performance on downstream NLP tasks.

Quick Start & Requirements

Install: PyTorch 1.10, CUDA 9.0, cuDNN 7.5, scikit-learn, sentencepiece.
Pre-trained Models: Download links for v1 and v2 models (base, large, xlarge, xxlarge) are provided.
Fine-tuning: Place downloaded models and config.json in prev_trained_model/albert_base_v2.
Conversion: A script convert_albert_tf_checkpoint_to_pytorch.py is available to convert TensorFlow checkpoints.
GLUE Tasks: Download GLUE data and run provided shell scripts (e.g., scripts/run_classifier_sst2.sh) for fine-tuning.

Highlighted Details

Offers implementations for ALBERT v1 and v2 models.
Includes performance metrics on the GLUE benchmark for various model sizes.
Provides a utility script for converting TensorFlow checkpoints to PyTorch.

Maintenance & Community

No specific information on maintainers, community channels, or roadmap is present in the README.

Licensing & Compatibility

The README does not explicitly state the license type. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project requires specific older versions of PyTorch (1.10) and CUDA (9.0), which may pose compatibility challenges with modern hardware and software stacks. The README lacks details on community support or ongoing maintenance.

Health Check

Last Commit

5 years ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days