turbo-alignment by turbo-llm

Library for LLM industrial alignment

Created 1 year ago

404 stars

Top 71.9% on SourcePulse

Project Summary

Turbo-Alignment is a Python library designed for industrial-scale fine-tuning and alignment of large language models. It targets ML engineers and researchers seeking efficient, end-to-end pipelines for tasks like Supervised Fine-Tuning (SFT), Reward Modeling (RM), and Direct Preference Optimization (DPO), offering streamlined deployment of new methods and comprehensive logging.

How It Works

The library provides an end-to-end pipeline from data preprocessing to model alignment, supporting various alignment methods including SFT, RM, Offline Preference Optimization, and Online Preference Optimization. It integrates with vLLM for fast inference and includes a wide array of metrics like Self-BLEU, KL divergence, and diversity for comprehensive evaluation.

Quick Start & Requirements

Install via pip: pip install turbo-alignment
For latest features: pip install git+https://github.com/turbo-llm/turbo-alignment.git
Development setup requires poetry install.
Requires datasets formatted as ChatDataset or PairPreferencesDataset.
Official guides and tutorials are available.

Highlighted Details

Supports Supervised Fine-Tuning, Reward Modeling, Offline and Online Preference Optimization.
Implements metrics: Accuracy, Distinctness, Diversity, Self-BLEU, KL-divergence, Reward, Length, Perplexity.
Optimized for fast inference using vLLM.
End-to-end pipelines from data preprocessing to model alignment.

Maintenance & Community

The project references implementations from Hugging Face's TRL, AllenNLP, and LinkedIn's Liger-Kernel. Contribution guidelines and a development environment setup are provided.

Licensing & Compatibility

The project is licensed under a specific license detailed in the LICENSE file. Compatibility for commercial use or closed-source linking is not explicitly detailed.

turbo-alignment by turbo-llm

Explore Similar Projects

URIAL by Re-Align

SuperAdapters by cckuailong

AlignBench by THUDM

fmeval by aws

Platypus by arielnlee

FEDOT by aimclub

NeMo-Aligner by NVIDIA

ditto by megagonlabs

dclm by mlfoundations

automl-gs by minimaxir

ROLL by alibaba

awesome-LLM-resources by WangRongsheng