awesome-mobile-llm  by stevelaskaridis

LLMs and tools optimized for mobile and embedded hardware

Created 1 year ago
289 stars

Top 91.2% on SourcePulse

GitHubView on GitHub
Project Summary

Summary This repository is a curated "awesome list" for Large Language Models (LLMs) and related research tailored for mobile and embedded hardware. It serves researchers, engineers, and practitioners aiming to deploy LLM technology on resource-constrained devices. The list consolidates information on mobile-first LLMs, deployment infrastructure, benchmarking, optimization techniques, applications, and multimodal models, accelerating edge AI development.

How It Works The project is a structured directory linking to LLMs, papers, code, and deployment frameworks for mobile/embedded LLM inference and training. It categorizes resources into "Mobile-First LLMs," "Infrastructure / Deployment," "Benchmarking," and "Mobile-Specific Optimisations." This approach highlights essential models (e.g., sub-3B parameter LLMs) and tools (e.g., llama.cpp, MLC-LLM, PyTorch ExecuTorch, MLX), enabling quick identification of relevant resources and trends.

Quick Start & Requirements This is a curated list, not a runnable project, so no direct installation steps exist. Users navigate the list to find specific projects. Requirements vary widely, often including specific hardware (mobile, edge devices), OS (Android, iOS), software dependencies (Python, CUDA, ML frameworks), and potentially API keys or datasets. Links to official quick-start guides and documentation are provided for many projects.

Highlighted Details

  • Features a table of sub-3B parameter LLMs for on-device deployment, listing models from major tech companies and research labs (2023-20
Health Check
Last Commit

1 month ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
0
Star History
8 stars in the last 30 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
2 more.

torchchat by pytorch

0.1%
4k
PyTorch-native SDK for local LLM inference across diverse platforms
Created 1 year ago
Updated 4 months ago
Feedback? Help us improve.