t-few by r-three

Code for parameter-efficient fine-tuning research paper

Created 3 years ago

457 stars

Top 66.1% on SourcePulse

View on GitHub

1 Expert Loves This Project

Luca Soldaini

Research Scientist at Ai2

Project Summary

This repository provides the official code for the T-Few paper, focusing on parameter-efficient fine-tuning (PEFT) for few-shot learning tasks. It aims to offer a more effective and cost-efficient alternative to in-context learning, achieving state-of-the-art results on benchmarks like RAFT. The target audience includes researchers and practitioners in NLP and machine learning who need to adapt large language models to new tasks with limited data.

How It Works

T-Few implements parameter-efficient fine-tuning techniques, specifically focusing on methods that modify only a small subset of model parameters. This approach contrasts with full fine-tuning and in-context learning, offering a balance between performance and computational cost. The method is designed to be more stable and achieve better generalization than in-context learning, particularly in low-data regimes.

Quick Start & Requirements

Install: Create a conda environment with Python 3.7 (conda create -n tfew python==3.7), activate it (conda activate tfew), and install dependencies (pip install -r requirements.txt -f https://download.pytorch.org/whl/cu113/torch_stable.html).
Prerequisites: CUDA 11.3, Python 3.7, PyTorch. For SAID, run python src/intrinsic_said_setup.py develop.
Execution: Run experiments using CUDA_VISIBLE_DEVICES=<gpu_id> python -m src.pl_train -c <config_file1>.json+<config_file2>.json -k <key>=<value> exp_name=<experiment_name>.
Resources: Recommended GPUs: 40GB for T0(3B), 80GB for T0.
Docs: Configuration and execution details are provided within the README.

Highlighted Details

Outperforms in-context learning with GPT-3.
Achieves state-of-the-art on the RAFT benchmark.
Supports combining multiple configuration files for flexible experiment setup.
Includes scripts for running arrays of experiments and generating results tables.

Maintenance & Community

The paper's authors are affiliated with Google and other institutions. No specific community channels (Discord, Slack) or active development signals are mentioned in the README.

Licensing & Compatibility

The repository does not explicitly state a license. The included citations are for academic purposes, and commercial use would require clarification of licensing terms.

Limitations & Caveats

The setup requires a specific older version of Python (3.7) and CUDA (11.3), which may pose compatibility challenges with newer hardware and software stacks. The README does not detail ongoing maintenance or community support.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

2 stars in the last 30 days