axolotl by axolotl-ai-cloud

CLI tool for streamlined post-training of AI models

Created 2 years ago

11,056 stars

Top 4.6% on SourcePulse

View on GitHub

30 Experts Love This Project

Tobi Lutke

Cofounder of Shopify

Beyang Liu

Cofounder of Sourcegraph

Yineng Zhang

Inference Lead at SGLang; Research Scientist at Together AI

Patrick von Platen

Author of Hugging Face Diffusers; Research Engineer at Mistral

and 26 more!

Project Summary

Axolotl is a comprehensive toolkit for streamlining the post-training of large language models, targeting AI researchers and engineers. It simplifies complex fine-tuning tasks like LoRA, QLoRA, and full fine-tuning, enabling efficient customization of pre-trained models.

How It Works

Axolotl leverages a YAML configuration system to manage the entire model training pipeline, from data preprocessing to inference and evaluation. This approach offers a unified and reproducible workflow. It integrates with performance-enhancing libraries such as xformers, Flash Attention, and various multi-GPU strategies (FSDP, DeepSpeed), aiming to maximize training speed and efficiency.

Quick Start & Requirements

Installation: pip install --no-build-isolation axolotl[flash-attn,deepspeed]
Prerequisites: NVIDIA GPU (Ampere+ recommended for bf16/Flash Attention), Python 3.11, PyTorch ≥2.4.1.
Examples: Fetch example configurations via axolotl fetch examples.
First Fine-tune: Run axolotl train examples/llama-3/lora-1b.yml.
Documentation: Getting Started Guide

Highlighted Details

Supports a wide range of Hugging Face models including LLaMA, Mistral, Mixtral, Falcon, and Pythia.
Offers multiple training methods: full fine-tuning, LoRA, QLoRA, ReLoRA, and GPTQ.
Features performance optimizations like Flash Attention, xformers, and multi-packing.
Integrates with logging platforms like Weights & Biases, MLflow, and Comet.

Maintenance & Community

The project is sponsored by Modal. Community support is available via Discord.

Licensing & Compatibility

Licensed under the Apache 2.0 License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

Support for certain models (e.g., Mixtral-MoE, Falcon, Pythia) with specific optimizations like GPTQ or Flash Attention is marked as untested or not supported in the provided table.

Health Check

Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

147 stars in the last 30 days