LoRA by microsoft

PyTorch library for low-rank adaptation (LoRA) of LLMs

Created 4 years ago

13,142 stars

Top 3.8% on SourcePulse

View on GitHub

17 Experts Love This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Patrick von Platen

Author of Hugging Face Diffusers; Research Engineer at Mistral

Jeff Hammerbacher

Cofounder of Cloudera

Zack Li

Cofounder of Nexa AI

and 13 more!

Project Summary

This repository provides loralib, a Python package implementing Low-Rank Adaptation (LoRA) for large language models. It enables efficient fine-tuning by injecting trainable low-rank matrices into pre-trained models, significantly reducing the number of trainable parameters and storage requirements without introducing inference latency. The target audience includes researchers and engineers working with large NLP models who need to adapt them to specific tasks efficiently.

How It Works

LoRA reduces trainable parameters by decomposing weight updates into two smaller, low-rank matrices. This approach drastically cuts down the memory footprint for fine-tuning and allows for rapid task switching. The library integrates seamlessly with PyTorch models, offering drop-in replacements for nn.Linear, nn.Embedding, and nn.Conv2d layers.

Quick Start & Requirements

Install via pip: pip install loralib or pip install git+https://github.com/microsoft/LoRA
Requires PyTorch.
Supports adapting nn.Linear, nn.Embedding, and nn.Conv2d.
See examples for NLU and NLG tasks: examples/NLU/, examples/NLG/

Highlighted Details

Achieves comparable or superior results to full fine-tuning on GLUE and NLG benchmarks with a fraction of trainable parameters.
Demonstrates significant parameter reduction: RoBERTa base LoRA uses 0.8M trainable parameters vs. 125M for full fine-tuning.
Supports lora.MergedLinear for handling fused layers (e.g., QKV projections) and optional bias training.
Merging LoRA weights with base weights occurs automatically during model.eval(), eliminating inference latency.

Maintenance & Community

The project is associated with Microsoft.
Contact information for core contributors is provided.
Follows the Microsoft Open Source Code of Conduct.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README text. However, it is a Microsoft project, which typically uses permissive licenses like MIT. Further verification of the LICENSE file is recommended for commercial use.

Limitations & Caveats

Currently only supports PyTorch.
The initial release of the repository has been archived on the snapshot-9-15-2021 branch.
Adapting fused layers requires using lora.MergedLinear or manually splitting them.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

113 stars in the last 30 days