Discover and explore top open-source AI tools and projects—updated daily.
JAX-native library for efficient LLM post-training
Top 25.4% on SourcePulse
A JAX-native library for Large Language Model (LLM) post-training, Tunix streamlines supervised fine-tuning, reinforcement learning (RL), and knowledge distillation. It targets researchers and engineers seeking efficient, scalable LLM adaptation on accelerators, leveraging JAX for high-performance computation and seamless integration with Flax NNX. The library aims to simplify complex post-training workflows, offering a modular and extensible framework.
How It Works
Tunix is built upon JAX, enabling accelerated, distributed computation, particularly on TPUs. It supports various post-training methodologies, including parameter-efficient fine-tuning (PEFT) via LoRA/Q-LoRA, multiple RL algorithms like PPO, GRPO, and GSPO-token, and preference alignment through Direct Preference Optimization (DPO). For knowledge distillation, it offers strategies such as matching output probability distributions (Logit Strategy), aligning attention mechanisms, and matching intermediate feature representations. The architecture emphasizes modularity for customization and efficiency, with native support for common model sharding strategies (DP, FSDP, TP) designed for multi-host distributed training.
Quick Start & Requirements
pip install "tunix[prod]"
. Latest from GitHub: pip install git+https://github.com/google/tunix
. Editable install for development: git clone https://github.com/google/tunix.git && cd tunix && pip install -e ".[dev]"
.Highlighted Details
Maintenance & Community
Tunix is in "Early Development," with active expansion of capabilities and performance improvements planned. Contributions are welcomed, with a draft contribution process available. Users can engage via the Tunix GitHub discussion forum for feature requests, issues, and questions. A notable collaboration with GRL (Game Reinforcement Learning) integrates seamless TPU support for scalable RL experiments.
Licensing & Compatibility
The provided README does not specify a software license. This omission requires further investigation for compatibility with commercial or closed-source projects.
Limitations & Caveats
The library is in early development, indicating ongoing work on features, usability, and performance optimization. The contribution process is still being formalized. Specific limitations regarding unsupported platforms or known bugs are not detailed in the provided text.
10 hours ago
Inactive