Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Zhuohan Li
Zhuohan Li
Coauthor of vLLM
GitHub
Starred Projects (106)
TileGym
by
NVIDIA
4.5%
554
CUDA Tile kernel library for efficient GPU programming
Starred by
Created 1 month ago
Updated 3 days ago
SkyRL
by
NovaSky-AI
1.4%
1k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+12
Created 8 months ago
Updated 1 day ago
checkpoint-engine
by
MoonshotAI
0.6%
886
Middleware for efficient LLM weight updates during inference
Starred by
+3
Created 4 months ago
Updated 2 days ago
batch_invariant_ops
by
thinking-machines-lab
0.2%
946
Enhance LLM inference determinism
Starred by
+1
Created 4 months ago
Updated 2 months ago
wechat-bot
by
wangrongding
0.9%
10k
WeChat bot integrating multiple AI services
Created 4 years ago
Updated 3 days ago
harmony
by
openai
0.3%
4k
Renderer for OpenAI's harmony response format
Starred by
+9
Created 5 months ago
Updated 3 weeks ago
gpt-oss
by
openai
0.3%
20k
Open-weight LLMs for reasoning and agents
Starred by
+15
Created 6 months ago
Updated 2 months ago
mirage
by
mirage-project
1.6%
2k
Tool for fast GPU kernel generation via superoptimization
Starred by
+1
Created 1 year ago
Updated 3 days ago
Hunyuan3D-2.1
by
Tencent-Hunyuan
2.3%
3k
Image to 3D asset generation with PBR materials
Starred by
Created 7 months ago
Updated 2 months ago
torchtitan
by
pytorch
0.6%
5k
PyTorch platform for generative AI model training research
Starred by
+12
Created 2 years ago
Updated 1 day ago
tilelang
by
tile-ai
4.2%
5k
DSL for high-performance GPU/CPU kernel development (GEMM, attention, etc.)
Starred by
+2
Created 1 year ago
Updated 1 day ago
3FS
by
deepseek-ai
0.3%
10k
Distributed file system for AI training/inference workloads
Starred by
+6
Created 10 months ago
Updated 5 days ago
DeepGEMM
by
deepseek-ai
0.4%
6k
CUDA library for efficient FP8 GEMM kernels with fine-grained scaling
Starred by
+6
Created 11 months ago
Updated 5 days ago
open-infra-index
by
deepseek-ai
0.0%
8k
AI infrastructure tools for efficient AGI development
Starred by
+15
Created 10 months ago
Updated 8 months ago
mochi
by
genmoai
0.3%
4k
Video generation model
Starred by
+6
Created 1 year ago
Updated 1 month ago
llm-compressor
by
vllm-project
1.6%
3k
Transformers-compatible library for LLM compression, optimized for vLLM deployment
Starred by
+3
Created 1 year ago
Updated 18 hours ago
Nanoflow
by
efeslab
0.7%
937
LLM serving framework for high throughput
Starred by
Created 1 year ago
Updated 2 months ago
vattention
by
microsoft
0%
454
Memory manager for LLM serving systems
Created 1 year ago
Updated 7 months ago
lm-evaluation-harness
by
EleutherAI
0.5%
11k
Framework for few-shot language model evaluation
Starred by
+18
Created 5 years ago
Updated 4 days ago
mistral.rs
by
EricLBuehler
0.4%
6k
LLM inference engine for blazing fast performance
Starred by
+9
Created 1 year ago
Updated 2 days ago
OpenHands
by
OpenHands
0.4%
66k
AI platform for software development agents
Starred by
+36
Created 1 year ago
Updated 18 hours ago
ThunderKittens
by
HazyResearch
0.5%
3k
CUDA kernel framework for fast deep learning primitives
Starred by
+14
Created 1 year ago
Updated 16 hours ago
arena-hard-auto
by
lmarena
0.3%
979
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 6 months ago
simple-evals
by
openai
0.7%
4k
Lightweight library for evaluating language models
Starred by
+14
Created 1 year ago
Updated 5 months ago
calm
by
zeux
0%
624
Single-GPU inference engine for rapid LLM prototyping
Starred by
Created 2 years ago
Updated 7 months ago
dspy
by
stanfordnlp
0.7%
31k
Framework for programming language models, not prompting
Starred by
+49
Created 3 years ago
Updated 3 days ago
Consistency_LLM
by
hao-ai-lab
0.2%
412
Parallel decoder for efficient LLM inference
Starred by
Created 2 years ago
Updated 1 year ago
grok-1
by
xai-org
0.0%
51k
JAX example code for loading and running Grok-1 open-weights model
Starred by
+22
Created 1 year ago
Updated 1 year ago
mlc-llm
by
mlc-ai
0.2%
22k
Universal LLM deployment engine with ML compilation
Starred by
+21
Created 2 years ago
Updated 1 week ago
kserve
by
kserve
0.5%
5k
Kubernetes CRD for scalable ML model serving
Starred by
Created 6 years ago
Updated 4 days ago
LMFlow
by
OptimalScale
0.1%
9k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 2 years ago
Updated 3 days ago
TransformerEngine
by
NVIDIA
0.9%
3k
Library for Transformer model acceleration on NVIDIA GPUs
Starred by
+4
Created 3 years ago
Updated 1 day ago
LWM
by
LargeWorldModel
0.1%
7k
Multimodal autoregressive model for long-context video/text
Starred by
+6
Created 1 year ago
Updated 1 year ago
search_with_lepton
by
leptonai
0.1%
8k
Conversational search engine demo
Starred by
+9
Created 1 year ago
Updated 1 month ago
llama_index
by
run-llama
0.3%
46k
Data framework for building LLM-powered agents
Starred by
+44
Created 3 years ago
Updated 3 days ago
marlin
by
IST-DASLab
0.3%
980
FP16xINT4 kernel for fast LLM inference
Starred by
Created 2 years ago
Updated 1 year ago
sglang
by
sgl-project
0.9%
22k
Fast serving framework for LLMs and vision language models
Starred by
+34
Created 2 years ago
Updated 14 hours ago
LLaVA
by
haotian-liu
0.1%
24k
Multimodal assistant with GPT-4 level capabilities
Starred by
+16
Created 2 years ago
Updated 1 year ago
megablocks
by
databricks
0.2%
2k
Lightweight library for mixture-of-experts (MoE) training
Starred by
+15
Created 3 years ago
Updated 6 months ago
flashinfer
by
flashinfer-ai
3.5%
5k
Kernel library for LLM serving
Starred by
+11
Created 2 years ago
Updated 14 hours ago
gpt-fast
by
meta-pytorch
0.1%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 2 years ago
Updated 4 months ago
LookaheadDecoding
by
hao-ai-lab
0.2%
1k
Parallel decoding algorithm for faster LLM inference
Starred by
Created 2 years ago
Updated 10 months ago
axolotl
by
axolotl-ai-cloud
0.3%
11k
CLI tool for streamlined post-training of AI models
Starred by
+25
Created 2 years ago
Updated 2 days ago
TensorRT-LLM
by
NVIDIA
0.6%
13k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+18
Created 2 years ago
Updated 15 hours ago
letta
by
letta-ai
0.6%
21k
Agent framework for stateful agents with memory, reasoning, and context management
Starred by
+18
Created 2 years ago
Updated 1 week ago
streaming-llm
by
mit-han-lab
0.1%
7k
Framework for efficient LLM streaming
Starred by
+2
Created 2 years ago
Updated 1 year ago
llm-engine
by
scaleapi
0.2%
820
Open-source engine for fine-tuning and serving LLMs
Starred by
+3
Created 2 years ago
Updated 1 day ago
scalene
by
plasma-umass
0.2%
13k
Python profiler with AI-powered optimization proposals
Starred by
+14
Created 6 years ago
Updated 2 weeks ago
Medusa
by
FasterDecoding
0.1%
3k
Framework for accelerating LLM generation using multiple decoding heads
Starred by
+6
Created 2 years ago
Updated 1 year ago
outlines
by
dottxt-ai
0.2%
13k
SDK for structured LLM text generation
Starred by
+34
Created 2 years ago
Updated 2 days ago
llm-awq
by
mit-han-lab
0.1%
3k
Weight quantization research paper for LLM compression/acceleration
Starred by
+4
Created 2 years ago
Updated 5 months ago
llama-cookbook
by
meta-llama
0.1%
18k
Guide for building with Llama models
Starred by
+15
Created 2 years ago
Updated 2 months ago
openchat
by
imoneoi
0.1%
5k
Open-source LLM fine-tuned with C-RLFT, inspired by offline reinforcement learning
Starred by
+4
Created 2 years ago
Updated 1 year ago
flash-attention
by
Dao-AILab
0.6%
22k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 1 day ago
Dromedary
by
IBM
0%
1k
Self-aligned language model research paper with minimal human supervision
Starred by
Created 2 years ago
Updated 3 months ago
LLMSurvey
by
RUCAIBox
0.1%
12k
Survey paper for large language models
Starred by
+2
Created 2 years ago
Updated 10 months ago
vllm
by
vllm-project
0.7%
67k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+57
Created 2 years ago
Updated 14 hours ago
tabby
by
TabbyML
0.2%
33k
Self-hosted AI coding assistant for on-prem code completion
Starred by
+17
Created 2 years ago
Updated 5 days ago
LongChat
by
DachengLi1
0%
533
Long-context LLM chatbot training and evaluation framework
Starred by
+2
Created 2 years ago
Updated 1 year ago
gorilla
by
ShishirPatil
0.1%
13k
LLM tool-use framework for API invocation and function calling
Starred by
+15
Created 2 years ago
Updated 1 week ago
gorilla-cli
by
gorilla-llm
0.1%
1k
CLI tool using LLMs to generate commands
Starred by
Created 2 years ago
Updated 1 year ago
llama.cpp
by
ggml-org
0.5%
93k
C/C++ library for local LLM inference
Starred by
+51
Created 2 years ago
Updated 14 hours ago
ray-llm
by
ray-project
0%
1k
LLM deployment framework on Ray (now upstreamed to Ray)
Starred by
+2
Created 2 years ago
Updated 10 months ago
peft
by
huggingface
0.2%
20k
Parameter-efficient fine-tuning (PEFT) library
Starred by
+16
Created 3 years ago
Updated 2 days ago
bitsandbytes
by
bitsandbytes-foundation
0.3%
8k
PyTorch library for k-bit quantization, enabling accessible LLMs
Starred by
+26
Created 4 years ago
Updated 3 days ago
ctransformers
by
marella
0.1%
2k
Python bindings for fast Transformer model inference
Starred by
+8
Created 2 years ago
Updated 1 year ago
CTranslate2
by
OpenNMT
0.2%
4k
Fast inference engine for Transformer models
Starred by
+6
Created 6 years ago
Updated 14 hours ago
EasyLM
by
young-geng
0.1%
3k
LLM training/finetuning framework in JAX/Flax
Starred by
+9
Created 3 years ago
Updated 1 year ago
open_llama
by
openlm-research
0%
8k
Open-source reproduction of LLaMA models
Starred by
+14
Created 2 years ago
Updated 2 years ago
text-generation-inference
by
huggingface
0.1%
11k
Rust/Python/gRPC server for fast LLM text generation
Starred by
+35
Created 3 years ago
Updated 3 days ago
langchain
by
langchain-ai
0.5%
124k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 1 day ago
web-llm
by
mlc-ai
0.3%
17k
In-browser LLM inference engine using WebGPU for hardware acceleration
Starred by
+20
Created 2 years ago
Updated 1 month ago
FasterTransformer
by
NVIDIA
0.1%
6k
Optimized transformer library for inference
Starred by
+12
Created 4 years ago
Updated 1 year ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 7 months ago
llama
by
meta-llama
0.1%
59k
Inference code for Llama 2 models (deprecated)
Starred by
+38
Created 2 years ago
Updated 11 months ago
FlexLLMGen
by
FMInference
0.0%
9k
High-throughput generation engine for LLMs with limited GPU memory
Starred by
+20
Created 2 years ago
Updated 1 year ago
PiPPy
by
pytorch
0%
783
PyTorch tool for pipeline parallelism
Starred by
+3
Created 4 years ago
Updated 1 year ago
AITemplate
by
facebookincubator
0.0%
5k
Generate high-performance inference engines
Starred by
+19
Created 3 years ago
Updated 3 weeks ago
compiler-and-arch
by
KnowingNothing
0%
517
Compiler/architecture resources for emerging domains
Starred by
Created 3 years ago
Updated 1 year ago
server
by
triton-inference-server
0.3%
10k
AI model inference serving optimized for cloud and edge
Starred by
+12
Created 7 years ago
Updated 2 days ago
cutlass
by
NVIDIA
0.5%
9k
CUDA C++ and Python DSLs for high-performance linear algebra
Starred by
+20
Created 8 years ago
Updated 2 days ago
skypilot
by
skypilot-org
0.2%
9k
Framework for cloud AI/batch jobs, unifying execution across diverse infrastructure
Starred by
+24
Created 4 years ago
Updated 14 hours ago
paxml
by
google
0%
542
Jax-based ML framework for large-scale model training and experimentation
Starred by
Created 3 years ago
Updated 3 weeks ago
metaseq
by
facebookresearch
0%
7k
Codebase for large-scale transformer model development and deployment
Starred by
+11
Created 3 years ago
Updated 1 year ago
alpa
by
alpa-projects
0.0%
3k
Auto-parallelization framework for large-scale neural network training and serving
Starred by
+17
Created 4 years ago
Updated 2 years ago
DeepSpeed
by
deepspeedai
0.2%
41k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 6 years ago
Updated 15 hours ago
DPR
by
facebookresearch
0.1%
2k
Dense Passage Retriever for open-domain Q&A research
Starred by
+4
Created 5 years ago
Updated 2 years ago
flexflow-train
by
flexflow
0.1%
2k
Accelerating distributed deep learning training
Starred by
+8
Created 7 years ago
Updated 1 day ago
pytorch-lightning
by
Lightning-AI
0.1%
31k
Deep learning framework for pretraining, finetuning, and deploying AI models
Starred by
+31
Created 6 years ago
Updated 3 days ago
faiss
by
facebookresearch
0.2%
39k
Similarity search library for dense vectors
Starred by
+52
Created 9 years ago
Updated 4 days ago
tvm
by
apache
0.2%
13k
Compiler stack for deep learning systems
Starred by
+20
Created 9 years ago
Updated 1 day ago
universal-triggers
by
Eric-Wallace
0%
301
NLP attack/analysis research paper (EMNLP 2019)
Starred by
Created 6 years ago
Updated 1 year ago
gdrcopy
by
NVIDIA
0.8%
1k
GPU memory copy library using GPUDirect RDMA
Starred by
Created 11 years ago
Updated 3 weeks ago
Megatron-LM
by
NVIDIA
0.6%
15k
Framework for training transformer models at scale
Starred by
+19
Created 6 years ago
Updated 18 hours ago
DeepLearningExamples
by
NVIDIA
0.0%
15k
Deep learning examples for training and deployment
Starred by
+8
Created 7 years ago
Updated 1 year ago
gpt-2
by
openai
0.1%
25k
Code for research paper "Language Models are Unsupervised Multitask Learners"
Starred by
+27
Created 7 years ago
Updated 1 year ago
fairseq
by
facebookresearch
0.1%
32k
Sequence modeling toolkit for translation, language modeling, and text generation research
Starred by
+42
Created 8 years ago
Updated 3 months ago
rl_a3c_pytorch
by
dgriff777
0%
570
PyTorch implementation of A3C for Atari games
Starred by
+2
Created 8 years ago
Updated 2 years ago
ray
by
ray-project
0.3%
41k
AI compute engine for scaling Python and AI applications
Starred by
+52
Created 9 years ago
Updated 19 hours ago
bert
by
google-research
0.0%
40k
TensorFlow code and pre-trained models for BERT
Starred by
+26
Created 7 years ago
Updated 1 year ago
awesome-ai-residency
by
dangkhoasdc
0.3%
3k
Curated list of AI residency programs
Starred by
Created 7 years ago
Updated 9 months ago
3D-Machine-Learning
by
timzhang642
0.1%
10k
Resource list for 3D machine learning
Starred by
+3
Created 8 years ago
Updated 1 year ago
tensor2tensor
by
tensorflow
0.1%
17k
Deprecated library for deep learning models/datasets, successor to Trax
Starred by
+23
Created 8 years ago
Updated 2 years ago
kit
by
HugoBlox
3.4%
9k
AI-powered static site builder for technical content
Starred by
Created 9 years ago
Updated 22 hours ago
generating-reviews-discovering-sentiment
by
openai
0.1%
2k
Language model code for generating reviews and discovering sentiment
Starred by
+8
Created 8 years ago
Updated 2 years ago
tensorflow
by
tensorflow
0.1%
193k
Open-source ML framework
Starred by
+97
Created 10 years ago
Updated 13 hours ago
Feedback? Help us improve.