trl by huggingface

Library for transformer RL

Created 5 years ago

16,918 stars

Top 2.8% on SourcePulse

33 Experts Love This Project

jeffchuber

Cofounder of Chroma

vincentweisser

Vincent Weisser

Cofounder of Prime Intellect

tjbck

Founder of Open WebUI

alexchen4ai

Cofounder of Nexa AI

and 29 more!

Project Summary

TRL (Transformer Reinforcement Learning) is a Python library for post-training foundation models using advanced techniques like Supervised Fine-Tuning (SFT), Proximal Policy Optimization (PPO), and Direct Preference Optimization (DPO). It targets researchers and engineers working with large language models, offering efficient scaling and integration with the Hugging Face ecosystem.

How It Works

TRL provides specialized trainer classes (SFTTrainer, GRPOTrainer, DPOTrainer, RewardTrainer) that wrap 🤗 Transformers' Trainer. This design allows seamless integration with distributed training (DDP, DeepSpeed ZeRO, FSDP) and efficient fine-tuning of large models on modest hardware via 🤗 PEFT (LoRA/QLoRA) and Unsloth optimized kernels.

Quick Start & Requirements

Install: pip install trl
Prerequisites: Python, 🤗 Transformers, Datasets. GPU recommended for practical use.
Docs: https://huggingface.co/docs/trl/index
CLI: trl sft --model_name_or_path ... or trl dpo --model_name_or_path ...

Highlighted Details

Supports SFT, PPO, GRPO, DPO, and Reward modeling.
Integrates with 🤗 PEFT for parameter-efficient fine-tuning.
Leverages 🤗 Accelerate for distributed training.
Includes Unsloth integration for accelerated training.

Maintenance & Community

Developed by Hugging Face.
Active development and community support.

Licensing & Compatibility

License: Apache-2.0.
Compatibility: Permissive license allows commercial use and integration with closed-source projects.

Limitations & Caveats

The library focuses on transformer-based models and requires familiarity with the Hugging Face ecosystem for advanced customization.

Health Check

Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)

94

Issues (30d)

43

Star History

331 stars in the last 30 days

Explore Similar Projects

LLM-RLHF-Tuning by Joyce94

LLM tuning via RLHF (SFT+RM+PPO+DPO) with LoRA

Created 2 years ago

Updated 2 years ago

ModelCenter by OpenBMB

Transformer library for efficient, low-resource, distributed training

Created 3 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

libai by Oneflow-Inc

Large-scale distributed parallel training toolbox

Created 4 years ago

Updated 5 months ago

MINI_LLM by jiahe7ay

LLM pre-training reproduction repo for experimentation

Created 1 year ago

Updated 8 months ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Philipp Schmid

Philipp Schmid(DevRel at Google DeepMind), and

2 more.

Megatron-LLM by epfLLM

Distributed trainer for LLMs

Created 2 years ago

Updated 1 year ago

LLM-Dojo by mst272

LLM training framework for model training and RLHF

Created 1 year ago

Updated 1 month ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face),

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory), and

5 more.

naacl_transfer_learning_tutorial by huggingface

NLP transfer learning tutorial code

Created 6 years ago

Updated 6 years ago

Starred by

Chuan Li

Chuan Li(Chief Scientific Officer at Lambda).

NeMo-Framework-Launcher by NVIDIA

Cloud-native tool for launching NeMo framework training jobs

Created 3 years ago

Updated 8 months ago

Starred by

Amanpreet Singh

Amanpreet Singh(Cofounder of Contextual AI),

Johannes Hagemann

Johannes Hagemann(Cofounder of Prime Intellect), and

2 more.

Megatron-DeepSpeed by bigscience-workshop

Transformer LM research repo for BERT & GPT-2 training at scale

Created 4 years ago

Updated 1 year ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

20 more.

accelerate by huggingface

PyTorch training helper for distributed execution

Created 5 years ago

Updated 2 days ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI),

Amit Jain

Amit Jain(Cofounder of Luma AI), and

22 more.

Megatron-LM by NVIDIA

Framework for training transformer models at scale

Created 6 years ago

Updated 13 hours ago

Starred by

Albert Gu

Albert Gu(Cofounder of Cartesia; Professor at CMU),

Luca Soldaini

Luca Soldaini(Research Scientist at Ai2), and

34 more.

pytorch-lightning by Lightning-AI

Deep learning framework for pretraining, finetuning, and deploying AI models

Created 6 years ago

Updated 3 days ago

Feedback? Help us improve.