Library for LLM industrial alignment
Top 73.7% on sourcepulse
Turbo-Alignment is a Python library designed for industrial-scale fine-tuning and alignment of large language models. It targets ML engineers and researchers seeking efficient, end-to-end pipelines for tasks like Supervised Fine-Tuning (SFT), Reward Modeling (RM), and Direct Preference Optimization (DPO), offering streamlined deployment of new methods and comprehensive logging.
How It Works
The library provides an end-to-end pipeline from data preprocessing to model alignment, supporting various alignment methods including SFT, RM, Offline Preference Optimization, and Online Preference Optimization. It integrates with vLLM for fast inference and includes a wide array of metrics like Self-BLEU, KL divergence, and diversity for comprehensive evaluation.
Quick Start & Requirements
pip install turbo-alignment
pip install git+https://github.com/turbo-llm/turbo-alignment.git
poetry install
.ChatDataset
or PairPreferencesDataset
.Highlighted Details
Maintenance & Community
The project references implementations from Hugging Face's TRL, AllenNLP, and LinkedIn's Liger-Kernel. Contribution guidelines and a development environment setup are provided.
Licensing & Compatibility
The project is licensed under a specific license detailed in the LICENSE file. Compatibility for commercial use or closed-source linking is not explicitly detailed.
Limitations & Caveats
The roadmap indicates that Online RL methods (PPO, Reinforce), distributed training, and low-memory training approaches are still in progress.
3 days ago
Inactive