weak-to-strong by openai

Weak-to-strong generalization research paper implementation

Created 2 years ago

2,550 stars

Top 18.2% on SourcePulse

9 Experts Love This Project

jaredpalmer

SVP at GitHub; Founder of Turborepo; Author of Formik, TSDX

Edward-Sun

Research Scientist at Meta Superintelligence Lab

vincentweisser

Vincent Weisser

Cofounder of Prime Intellect

Jiayi-Pan

Author of SWE-Gym; MTS at xAI

and 5 more!

Project Summary

This repository provides code for implementing the "weak-to-strong generalization" technique, enabling the training of powerful models using labels generated by weaker, less capable models. It's designed for researchers and practitioners in machine learning, particularly those working with large language models and computer vision tasks, to improve model performance and data efficiency.

How It Works

The core approach involves fine-tuning a strong model using labels derived from a weaker model, potentially with auxiliary losses like confidence weighting. The sweep.py script orchestrates this by first training ground truth models for specified sizes and then iteratively training stronger models using the labels from weaker ones. This method aims to transfer knowledge effectively, reducing the need for extensive human-labeled data for high-performance models.

Quick Start & Requirements

Install dependencies using pip: pip install .
Requires Python.
See notebooks/Plotting.ipynb for plotting results.

Highlighted Details

Implements weak-to-strong learning for binary classification tasks.
Supports fine-tuning pretrained language models and training against model-generated labels.
Includes code for weak-to-strong generalization in vision models (AlexNet -> DINO on ImageNet).
Offers various loss functions described in the paper, including confidence auxiliary loss.

Maintenance & Community

Authors include Adrien Ecoffet, Manas Joglekar, Jeffrey Wu, Jan Hendrik Kirchner, and Pavel Izmailov (vision).
Acknowledges Hugging Face for their transformer models.

Licensing & Compatibility

Licensed under the MIT License.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

The codebase is noted as not well-tested and may not use the exact settings from the paper, though it aims for qualitatively similar results.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

ml-papers by rosinality

Collection of ML papers and reviews

Created 4 years ago

Updated 2 years ago

fancy-nlp by boat-group

NLP toolkit for rapid prototyping and deployment

Created 6 years ago

Updated 3 years ago

nlp_notes by YangBin1729

NLP notes for ML/DL principles, examples, and model deployment

Created 6 years ago

Updated 5 years ago

Starred by

Robert Stojnic

Robert Stojnic(Cocreator of Papers with Code).

finetune by IndicoDataSolutions

NLP finetuning library with scikit-learn style API

Created 7 years ago

Updated 2 months ago

Pre-trained-Models by loujie0822

NLP pre-trained model overview

Created 6 years ago

Updated 5 years ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow),

Tristan Hume

Tristan Hume(MTS at Anthropic), and

2 more.

texar-pytorch by asyml

PyTorch toolkit for NLP and text generation research

Created 6 years ago

Updated 3 years ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Lysandre Debut

Lysandre Debut(Chief Open-Source Officer at Hugging Face).

dllm by ZHZisZZ

Framework for diffusion language modeling

Created 3 months ago

Updated 4 days ago

Bert-Multi-Label-Text-Classification by lonePatient

PyTorch code for multi-label text classification

Created 7 years ago

Updated 2 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI) and

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen).

EasyNLP by alibaba

NLP toolkit for easy model training, inference, and deployment

Created 3 years ago

Updated 1 year ago

Starred by

Malte Pietsch

Malte Pietsch(Cofounder of deepset) and

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

pet by timoschick

Code for pattern-exploiting training (PET) research paper

Created 5 years ago

Updated 2 years ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind),

Omar Khattab

Omar Khattab(Coauthor of DSPy, ColBERT; Professor at MIT), and

15 more.

gpt-neo by EleutherAI

GPT-2/3-style model implementation using mesh-tensorflow

Created 5 years ago

Updated 3 years ago

Starred by

Lilian Weng

Lilian Weng(Cofounder of Thinking Machines Lab),

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity), and

45 more.

fairseq by facebookresearch

Sequence modeling toolkit for translation, language modeling, and text generation research

Created 8 years ago

Updated 3 months ago

Feedback? Help us improve.