Data selection research paper for targeted instruction tuning
Top 65.3% on sourcepulse
This repository provides code for LESS, a method to select influential data for targeted instruction tuning of large language models, aimed at researchers and practitioners seeking to optimize fine-tuning efficiency. It enables users to identify and utilize the most impactful data points for specific downstream tasks, thereby improving model performance and reducing computational costs.
How It Works
LESS operates by calculating an "influence score" for each data point in a training set, based on its impact on a target task's performance. This is achieved by first performing a "warmup" LoRA training on a small subset of data. Then, gradients are collected for the entire training dataset and for validation data specific to the target task. By comparing these gradients, LESS estimates the influence of each training data point on the target task, allowing for targeted selection of the most beneficial data.
Quick Start & Requirements
pip3 install torch==2.1.2 torchvision torchaudio
cd LESS
pip install -r requirement.txt
pip install -e .
meta-llama/Llama-2-7b-hf
as a base model, and data preparation follows the open-instruct
repository.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
9 months ago
1 week