xTuring  by stochasticai

SDK for fine-tuning and customizing open-source LLMs

Created 2 years ago
2,658 stars

Top 17.8% on SourcePulse

GitHubView on GitHub
Project Summary

xTuring is an open-source library designed for the efficient and accessible fine-tuning of large language models (LLMs). It empowers users, from researchers to developers, to personalize LLMs like LLaMA, Mistral, and GPT-J with their own data, ensuring data privacy by enabling local or private cloud execution.

How It Works

xTuring employs memory-efficient fine-tuning techniques such as LoRA (Low-Rank Adaptation) and quantization (INT4, INT8) to significantly reduce hardware requirements and costs, potentially by up to 90%. This approach allows for faster training epochs and enables fine-tuning on less powerful hardware, making LLM customization more accessible. The library also supports scaling across multiple GPUs for accelerated training and includes features for data preprocessing, model evaluation with metrics like perplexity, and inference on both GPU and CPU.

Quick Start & Requirements

Highlighted Details

  • Supports LLaMA 2, Mistral, Falcon-7B, GPT-J, GPT-2, OPT, Cerebras-GPT, Galactica, and Bloom models.
  • Offers INT4, INT8, and LoRA fine-tuning for reduced memory footprint and cost.
  • Includes CPU inference capabilities leveraging Intel® Extension for Transformers.
  • Provides a UI playground for interactive fine-tuning and experimentation.

Maintenance & Community

  • Active development with recent updates including LLaMA 2 integration, evaluation features, and INT4 precision.
  • Community support via Discord: https://discord.gg/TgHXuSJEk6

Licensing & Compatibility

  • Licensed under Apache License 2.0.
  • Permissive license suitable for commercial use and integration with closed-source applications.

Limitations & Caveats

The roadmap indicates future support for INT3, INT2, and INT1 low-precision fine-tuning, suggesting these are not yet implemented. Stable Diffusion support is also listed as a future item.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Ying Sheng Ying Sheng(Coauthor of SGLang) and Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake).

llm-analysis by cli99

0.4%
455
CLI tool for LLM latency/memory analysis during training/inference
Created 2 years ago
Updated 5 months ago
Starred by Wing Lian Wing Lian(Founder of Axolotl AI) and Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

airllm by lyogavin

0.1%
6k
Inference optimization for LLMs on low-resource hardware
Created 2 years ago
Updated 2 weeks ago
Starred by Junyang Lin Junyang Lin(Core Maintainer at Alibaba Qwen), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
25 more.

alpaca-lora by tloen

0.0%
19k
LoRA fine-tuning for LLaMA
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.