Instruction-Tuning-Papers  by SinclairCoder

Reading list for instruction tuning papers

Created 2 years ago
769 stars

Top 45.4% on SourcePulse

GitHubView on GitHub
Project Summary

This repository serves as a curated reading list for instruction tuning in large language models, tracking the evolution of techniques from early works like Natural-Instruction and FLAN to more recent advancements. It is intended for researchers and practitioners in NLP and LLM development seeking to understand and implement methods for improving model generalization and multi-task learning through natural language instructions.

How It Works

The project compiles a chronological list of research papers that explore instruction tuning. This approach allows users to trace the development of the field, understand the foundational concepts, and identify key methodologies and datasets that have emerged. The papers cover various aspects, including cross-task generalization, zero-shot learning, prompt-based pre-training, and the impact of human feedback or self-generated instructions.

Quick Start & Requirements

This repository is a collection of research papers and does not require installation or execution. All papers are linked via provided URLs.

Highlighted Details

  • Comprehensive list of seminal and recent papers on instruction tuning.
  • Covers a wide range of techniques including prompt-based learning, multi-task fine-tuning, and self-instruct methods.
  • Tracks the trend from ACL 2022 (Natural-Instruction) through ICLR 2022 (FLAN, T0) up to mid-2023.
  • Includes papers focusing on specific domains like biomedical NLP, dialogue systems, and visual instruction tuning.

Maintenance & Community

The repository is maintained by SinclairCoder. There are no explicit mentions of community channels, active development, or a roadmap.

Licensing & Compatibility

The repository itself does not have a specified license. It is a collection of links to external research papers, each with its own licensing and terms of use.

Limitations & Caveats

This repository is a static list of papers and does not provide code, datasets, or implementations. It is purely an informational resource for understanding the research landscape of instruction tuning.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

PromptWizard by microsoft

0.4%
4k
Agent-driven framework for task-aware prompt optimization
Created 1 year ago
Updated 1 month ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Ross Taylor Ross Taylor(Cofounder of General Reasoning; Cocreator of Papers with Code), and
11 more.

open-instruct by allenai

0.7%
3k
Training codebase for instruction-following language models
Created 2 years ago
Updated 17 hours ago
Feedback? Help us improve.