LLMFarm by guinmoon

iOS/MacOS app for local LLM inference

Created 2 years ago

1,952 stars

Top 22.3% on SourcePulse

3 Experts Love This Project

JustinLin610

Core Maintainer at Alibaba Qwen

ggerganov

Georgi Gerganov

Author of llama.cpp, whisper.cpp

vincentweisser

Vincent Weisser

Cofounder of Prime Intellect

Project Summary

LLMFarm is an iOS and macOS application enabling offline execution of various large language models (LLMs) and multimodal models. It targets developers and power users on Apple platforms seeking to test and deploy LLMs locally, leveraging the GGML library for efficient inference.

How It Works

LLMFarm utilizes the GGML library, a C library for machine learning, to run LLMs efficiently on Apple hardware. It supports Metal for GPU acceleration on Apple Silicon Macs, offering faster inference. The application provides a flexible interface for loading diverse model architectures and configuring various sampling methods for fine-tuned output generation.

Quick Start & Requirements

Install via git clone --recurse-submodules https://github.com/guinmoon/LLMFarm.
Requires macOS 13+ or iOS 16+.
Metal acceleration is available on Apple Silicon Macs (Intel Macs are not supported for Metal).
See FAQ for more details.

Highlighted Details

Supports a wide range of LLM architectures including LLaMA, Gemma, Phi, Falcon, Mistral, Mixtral, Mamba, RWKV, and more.
Includes multimodal capabilities with support for LLaVA, BakLLaVA, Moondream, and other vision-language models.
Offers advanced sampling methods like Temperature, TFS, Mirostat, and Grammar-based sampling.
Features include Apple Shortcuts integration and context state restoration.

Maintenance & Community

The core inference engine (llmfarm_core) has been moved to a separate repository.
Project relies on and integrates code from rwkv.cpp, Mia, LlamaChat, swift-markdown-ui, and similarity-search-kit.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Metal GPU acceleration is not supported on Intel Macs.
The project's licensing status requires clarification for commercial adoption.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

36 stars in the last 30 days

Explore Similar Projects

SuperAdapters by cckuailong

CLI tool for finetuning and inference of LLMs using adapters

Created 2 years ago

Updated 5 months ago

ToolkenGPT by Ber666

Research code for augmenting frozen LLMs with tools via embeddings

Created 2 years ago

Updated 1 year ago

Kolosal by KolosalAI

Desktop app for local LLM training and inference

Created 1 year ago

Updated 7 months ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

llama4micro by maxbbraun

LLM inference on a microcontroller

Created 2 years ago

Updated 2 years ago

Starred by

Alex Chen

Alex Chen(Cofounder of Nexa AI),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

1 more.

Anemll by Anemll

Framework for porting LLMs to Apple Neural Engine (ANE)

Created 1 year ago

Updated 1 week ago

llmfarm_core.swift by guinmoon

Swift library for local LLM inference

Created 2 years ago

Updated 1 month ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

LlamaChat by alexrozanski

macOS app for local LLM chats

Created 2 years ago

Updated 2 years ago

david-share by david-xinyuwei

Deep learning resource for LLM training, inference, and fine-tuning

Created 5 years ago

Updated 1 day ago

Starred by

Alex Chen

Alex Chen(Cofounder of Nexa AI) and

Zack Li

Zack Li(Cofounder of Nexa AI).

mllm by UbiquitousLearning

Mobile inference engine for multimodal LLMs

Created 2 years ago

Updated 2 days ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n),

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow), and

2 more.

torchchat by pytorch

PyTorch-native SDK for local LLM inference across diverse platforms

Created 1 year ago

Updated 4 months ago

Starred by

Jeffrey Morgan

Jeffrey Morgan(Cofounder of Ollama).

transformerlab-app by transformerlab

Open-source app for LLM experimentation

Created 2 years ago

Updated 1 day ago

Starred by

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research),

Eugene Yan

Eugene Yan(AI Scientist at AWS), and

1 more.

ms-swift by modelscope

SDK for fine-tuning and deploying LLMs/MLLMs

Created 2 years ago

Updated 1 day ago

Feedback? Help us improve.