awesome-AI-system  by lambda7xx

AI system papers and code

Created 2 years ago
327 stars

Top 83.4% on SourcePulse

GitHubView on GitHub
Project Summary

This repository serves as a curated collection of papers and their associated code for AI systems, focusing on large language models (LLMs) and related technologies. It targets researchers and engineers working on optimizing LLM inference, training, evaluation, and deployment, providing a valuable resource for understanding the state-of-the-art in system design for AI.

How It Works

The repository categorizes papers and code across various domains within AI systems, including LLM serving frameworks, inference systems (both system and AI sides), evaluation platforms, RLHF, Mixture-of-Experts (MoE), LoRA, parallelism strategies, GPU cluster management, and optimization techniques. It aims to consolidate cutting-edge research, offering links to papers and code repositories for each entry.

Quick Start & Requirements

This repository is a collection of links and does not have a direct installation or execution command. Users are expected to navigate to the linked paper or code repositories for specific setup and requirements.

Highlighted Details

  • Comprehensive coverage of LLM inference systems, with numerous entries from top-tier conferences (e.g., OSDI, ASPLOS, MLSys) published in 2023-2025.
  • Extensive lists of papers and code for distributed training parallelism strategies, including pipeline, data, and tensor parallelism.
  • Detailed sections on Mixture-of-Experts (MoE) and LoRA, covering both training and serving aspects.
  • Includes research on GPU cluster management and resource scheduling for AI workloads.

Maintenance & Community

The repository is open for contributions via issues or pull requests. It lists numerous researchers from prominent universities, indicating a strong academic backing.

Licensing & Compatibility

The licensing of individual linked projects varies, as this repository itself is a collection of pointers. Users must consult the licenses of the respective code repositories for compatibility and usage restrictions.

Limitations & Caveats

This repository is a curated list and does not provide any executable code or unified framework. Users must independently locate, set up, and integrate the individual projects. The "paper and its code" claim may not always hold true for every entry, as some links might only point to papers.

Health Check
Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 30 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
2 more.

torchchat by pytorch

0.1%
4k
PyTorch-native SDK for local LLM inference across diverse platforms
Created 1 year ago
Updated 1 week ago
Starred by Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
9 more.

LightLLM by ModelTC

0.5%
4k
Python framework for LLM inference and serving
Created 2 years ago
Updated 14 hours ago
Starred by Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
11 more.

mistral.rs by EricLBuehler

0.3%
6k
LLM inference engine for blazing fast performance
Created 1 year ago
Updated 1 day ago
Feedback? Help us improve.