SiLLM  by armbues

Toolkit for training/running LLMs on Apple Silicon using MLX

created 1 year ago
275 stars

Top 94.9% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

SiLLM is a toolkit designed to simplify the training and inference of Large Language Models (LLMs) on Apple Silicon hardware, leveraging the MLX framework. It caters to researchers and developers looking to experiment with LLMs locally on Macs, offering features for model loading, fine-tuning (LoRA, DPO), and deployment.

How It Works

SiLLM builds upon MLX, providing a streamlined interface for common LLM operations. It supports loading models from various formats (Huggingface, Torch, GGUF, MLX) and includes implementations for LoRA and DPO fine-tuning. The toolkit also offers experimental features like speculative decoding and beam search, aiming to make advanced LLM techniques accessible on Apple Silicon.

Quick Start & Requirements

  • Install via pip: pip install sillm-mlx
  • Requires Apple Silicon hardware.
  • Clone repo for web app: git clone https://github.com/armbues/SiLLM.git
  • Web app setup: cd SiLLM/app && pip install -r requirements.txt && python -m chainlit run app.py -w
  • Official examples: SiLLM-examples

Highlighted Details

  • Supports major model architectures: Llama 2/3, Mistral, Mixtral, Gemma, Phi, Qwen.
  • Offers an OpenAI-compatible API server for deployment.
  • Includes a Chainlit-based web app for a seamless chat experience.
  • Provides tools for model conversion and quantization.

Maintenance & Community

The project is actively maintained by armbues and acknowledges contributions from the MLX community.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The project includes experimental features, which may be subject to change or instability. Specific performance benchmarks are not detailed in the README.

Health Check
Last commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
13 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Philipp Schmid Philipp Schmid(DevRel at Google DeepMind), and
2 more.

LightLLM by ModelTC

0.7%
3k
Python framework for LLM inference and serving
created 2 years ago
updated 23 hours ago
Feedback? Help us improve.