LoRA-ViT by JamesQFreeman

LoRA SDK for Vision Transformer models

Created 2 years ago

428 stars

Top 69.2% on SourcePulse

Project Summary

MeLo provides a low-rank adaptation (LoRA) implementation specifically for Vision Transformers (ViT), offering a more parameter-efficient fine-tuning alternative to full model fine-tuning for tasks like medical image diagnosis. It targets researchers and practitioners working with ViTs who need to adapt models to new datasets or tasks with reduced computational cost and memory footprint.

How It Works

MeLo injects low-rank matrices into the attention layers of ViT models. This approach decomposes the weight updates into smaller, trainable matrices, significantly reducing the number of parameters that need to be updated during fine-tuning. This method is advantageous as it maintains performance comparable to full fine-tuning while drastically cutting down on memory usage and training time.

Quick Start & Requirements

Install via pip (assuming the lora package is available or the repo is cloned).
Requires PyTorch version 1.10.0 or newer.
Safetensors library is needed for saving/loading LoRA weights.
Official quick-start examples are available in examples.ipynb.
Homepage
arXiv

Highlighted Details

Supports integration with timm library for various ViT architectures.
Enables adaptation for segmentation tasks using DeepLab wrappers.
Implements multi-LoRA functionality for complex adaptations.
Claims 1.8x-1.9x speedup on M1 Pro compared to full fine-tuning.

Maintenance & Community

The project is associated with the paper "MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image Diagnosis".
Credit is given to lukemelas/PyTorch-Pretrained-ViT for ViT code and weights.
No explicit community links (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. The project is hosted on GitHub, implying a default open-source license, but specific terms are not mentioned.
Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is marked with a "[ ] Repo clean up" task, suggesting it may be under active development or not yet fully polished. The README also notes that compatibility with PyTorch versions newer than 1.10.0 is an assumption ("should also work, I guess").

LoRA-ViT by JamesQFreeman

Explore Similar Projects

lynx-llm by bytedance

Awesome-Parameter-Efficient-Transfer-Learning by synbol

xlora by EricLBuehler

RSPrompter by KyanChen

DoRA by NVlabs

ChatGLM-finetune-LoRA by lich99

TinyLLaVA_Factory by TinyLLaVA

Text-To-Video-Finetuning by ExponentialML

musubi-tuner by kohya-ss

Awesome-Transformer-Attention by cmhungsteve

minimind-v by jingyaogong

ai-toolkit by ostris