SiLLM  by armbues

Toolkit for training/running LLMs on Apple Silicon using MLX

Created 1 year ago
280 stars

Top 93.0% on SourcePulse

GitHubView on GitHub
Project Summary

SiLLM is a toolkit designed to simplify the training and inference of Large Language Models (LLMs) on Apple Silicon hardware, leveraging the MLX framework. It caters to researchers and developers looking to experiment with LLMs locally on Macs, offering features for model loading, fine-tuning (LoRA, DPO), and deployment.

How It Works

SiLLM builds upon MLX, providing a streamlined interface for common LLM operations. It supports loading models from various formats (Huggingface, Torch, GGUF, MLX) and includes implementations for LoRA and DPO fine-tuning. The toolkit also offers experimental features like speculative decoding and beam search, aiming to make advanced LLM techniques accessible on Apple Silicon.

Quick Start & Requirements

  • Install via pip: pip install sillm-mlx
  • Requires Apple Silicon hardware.
  • Clone repo for web app: git clone https://github.com/armbues/SiLLM.git
  • Web app setup: cd SiLLM/app && pip install -r requirements.txt && python -m chainlit run app.py -w
  • Official examples: SiLLM-examples

Highlighted Details

  • Supports major model architectures: Llama 2/3, Mistral, Mixtral, Gemma, Phi, Qwen.
  • Offers an OpenAI-compatible API server for deployment.
  • Includes a Chainlit-based web app for a seamless chat experience.
  • Provides tools for model conversion and quantization.

Maintenance & Community

The project is actively maintained by armbues and acknowledges contributions from the MLX community.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The project includes experimental features, which may be subject to change or instability. Specific performance benchmarks are not detailed in the README.

Health Check
Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
2 more.

torchchat by pytorch

0.1%
4k
PyTorch-native SDK for local LLM inference across diverse platforms
Created 1 year ago
Updated 1 week ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Stefan van der Walt Stefan van der Walt(Core Contributor to scientific Python ecosystem), and
12 more.

litgpt by Lightning-AI

0.1%
13k
LLM SDK for pretraining, finetuning, and deploying 20+ high-performance LLMs
Created 2 years ago
Updated 5 days ago
Feedback? Help us improve.