Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Woosuk Kwon
Woosuk Kwon
Coauthor of vLLM
GitHub
Starred Projects (67)
torchtitan
by
pytorch
0.2%
5k
PyTorch platform for generative AI model training research
Starred by
+12
Created 2 years ago
Updated 19 hours ago
vllm-omni
by
vllm-project
1.6%
3k
Omni-modality model inference and serving framework
Starred by
Created 5 months ago
Updated 18 hours ago
verl
by
verl-project
0.5%
19k
RL training library for LLMs
Starred by
+14
Created 1 year ago
Updated 1 day ago
SkyRL
by
NovaSky-AI
1.4%
2k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+13
Created 10 months ago
Updated 19 hours ago
tinker-cookbook
by
thinking-machines-lab
0.7%
3k
Advanced LLM fine-tuning SDK and example cookbook
Starred by
+6
Created 7 months ago
Updated 1 day ago
batch_invariant_ops
by
thinking-machines-lab
0.2%
967
Enhance LLM inference determinism
Starred by
+1
Created 5 months ago
Updated 3 months ago
recipes
by
vllm-project
2.5%
453
LLM inference recipes
Starred by
Created 7 months ago
Updated 1 day ago
openevolve
by
algorithmicsuperintelligence
0.9%
5k
Coding agent for scientific/algorithmic discovery, based on AlphaEvolve paper
Starred by
+2
Created 9 months ago
Updated 3 weeks ago
nano-vllm
by
GeeeekExplorer
0.7%
12k
Lightweight vLLM implementation from scratch
Starred by
+2
Created 8 months ago
Updated 3 months ago
llm-d
by
llm-d
0.7%
3k
Kubernetes-native framework for distributed LLM inference
Starred by
Created 10 months ago
Updated 1 day ago
dynamo
by
ai-dynamo
0.4%
6k
Inference framework for distributed generative AI model serving
Starred by
+7
Created 11 months ago
Updated 17 hours ago
ArcticInference
by
snowflakedb
0%
400
vLLM plugin for high-throughput, low-latency LLM and embedding inference
Starred by
Created 11 months ago
Updated 1 day ago
MiMo
by
XiaomiMiMo
0%
2k
LLM for reasoning, pre-trained and post-trained for math/code tasks
Starred by
Created 10 months ago
Updated 8 months ago
rllm
by
rllm-org
0.8%
5k
Framework for post-training language agents via reinforcement learning
Starred by
+2
Created 1 year ago
Updated 19 hours ago
chatgpt_system_prompt
by
LouisShark
0.2%
10k
GPT system prompt collection for prompt engineering and security education
Starred by
+2
Created 2 years ago
Updated 1 month ago
fairseq2
by
facebookresearch
0.2%
1k
Sequence modeling toolkit for content generation research
Created 3 years ago
Updated 2 days ago
Mooncake
by
kvcache-ai
0.5%
5k
Research paper on a disaggregated architecture for LLM serving
Starred by
+2
Created 1 year ago
Updated 17 hours ago
xgrammar
by
mlc-ai
0.5%
2k
Library for efficient structured generation
Starred by
+4
Created 1 year ago
Updated 1 week ago
Liger-Kernel
by
linkedin
0.1%
6k
Triton kernels for efficient LLM training
Starred by
+9
Created 1 year ago
Updated 18 hours ago
Nanoflow
by
efeslab
0.2%
946
LLM serving framework for high throughput
Starred by
Created 1 year ago
Updated 3 months ago
xla
by
pytorch
0.2%
3k
PyTorch on XLA devices
Starred by
+15
Created 7 years ago
Updated 2 months ago
ao
by
pytorch
0.3%
3k
PyTorch library for quantization and sparsity in training/inference
Starred by
+11
Created 2 years ago
Updated 19 hours ago
Model-Optimizer
by
NVIDIA
2.2%
2k
Library for optimizing deep learning models for GPU inference
Starred by
Created 1 year ago
Updated 17 hours ago
llm-compressor
by
vllm-project
0.7%
3k
Transformers-compatible library for LLM compression, optimized for vLLM deployment
Starred by
+3
Created 1 year ago
Updated 19 hours ago
intel-extension-for-pytorch
by
intel
0.1%
2k
PyTorch extension for performance boost on Intel platforms
Starred by
Created 5 years ago
Updated 1 week ago
ThunderKittens
by
HazyResearch
1.3%
3k
CUDA kernel framework for fast deep learning primitives
Starred by
+15
Created 2 years ago
Updated 1 day ago
mirage
by
mirage-project
0.4%
2k
Tool for fast GPU kernel generation via superoptimization
Starred by
+1
Created 1 year ago
Updated 2 days ago
AutoAWQ
by
casper-hansen
0.0%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Starred by
+5
Created 2 years ago
Updated 9 months ago
grok-1
by
xai-org
0.1%
52k
JAX example code for loading and running Grok-1 open-weights model
Starred by
+22
Created 1 year ago
Updated 1 year ago
lm-evaluation-harness
by
EleutherAI
0.3%
11k
Framework for few-shot language model evaluation
Starred by
+18
Created 5 years ago
Updated 1 day ago
LLMSys-PaperList
by
AmberLJC
0.3%
2k
Curated list of LLM systems papers
Starred by
Created 2 years ago
Updated 2 weeks ago
aici
by
microsoft
0%
2k
AICI constrains LLM output using (Wasm) programs
Starred by
+7
Created 2 years ago
Updated 1 year ago
mlc-llm
by
mlc-ai
0.1%
22k
Universal LLM deployment engine with ML compilation
Starred by
+21
Created 2 years ago
Updated 2 days ago
mscclpp
by
microsoft
0.8%
472
GPU-driven communication stack for scalable AI applications
Starred by
Created 3 years ago
Updated 20 hours ago
sglang
by
sgl-project
1.0%
24k
Fast serving framework for LLMs and vision language models
Starred by
+34
Created 2 years ago
Updated 17 hours ago
flashinfer
by
flashinfer-ai
0.8%
5k
Kernel library for LLM serving
Starred by
+12
Created 2 years ago
Updated 22 hours ago
punica
by
punica-ai
0.3%
1k
LoRA serving system (research paper) for multi-tenant LLM inference
Starred by
+3
Created 2 years ago
Updated 1 year ago
LLMCompiler
by
SqueezeAILab
0.1%
2k
LLM compiler for parallel function calling
Starred by
+2
Created 2 years ago
Updated 1 year ago
gpt-fast
by
meta-pytorch
0.1%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 2 years ago
Updated 6 months ago
TensorRT-LLM
by
NVIDIA
0.3%
13k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+18
Created 2 years ago
Updated 17 hours ago
WizardLM
by
nlpxucan
0.0%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 2 years ago
Updated 8 months ago
outlines
by
dottxt-ai
0.1%
13k
SDK for structured LLM text generation
Starred by
+34
Created 2 years ago
Updated 1 week ago
Awesome-LLM
by
Hannibal046
0.2%
26k
Curated list of Large Language Model resources
Starred by
+8
Created 3 years ago
Updated 6 months ago
gorilla
by
ShishirPatil
0.1%
13k
LLM tool-use framework for API invocation and function calling
Starred by
+15
Created 2 years ago
Updated 2 weeks ago
LLMSurvey
by
RUCAIBox
0.0%
12k
Survey paper for large language models
Starred by
+2
Created 2 years ago
Updated 11 months ago
CTranslate2
by
OpenNMT
0.3%
4k
Fast inference engine for Transformer models
Starred by
+6
Created 6 years ago
Updated 3 weeks ago
SqueezeLLM
by
SqueezeAILab
0%
713
Quantization framework for efficient LLM serving (ICML 2024 paper)
Starred by
Created 2 years ago
Updated 1 year ago
vllm
by
vllm-project
0.8%
71k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+57
Created 3 years ago
Updated 17 hours ago
Awesome-LLMOps
by
tensorchord
0.3%
6k
Curated list of LLMOps tools for developers
Starred by
+3
Created 3 years ago
Updated 3 weeks ago
FastChat
by
lm-sys
0.0%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 8 months ago
llama
by
meta-llama
0.1%
59k
Inference code for Llama 2 models (deprecated)
Starred by
+38
Created 3 years ago
Updated 1 year ago
Megatron-LM
by
NVIDIA
0.3%
15k
Framework for training transformer models at scale
Starred by
+20
Created 7 years ago
Updated 17 hours ago
flash-attention
by
Dao-AILab
0.3%
22k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 17 hours ago
TransformerEngine
by
NVIDIA
0.3%
3k
Library for Transformer model acceleration on NVIDIA GPUs
Starred by
+5
Created 3 years ago
Updated 21 hours ago
AITemplate
by
facebookincubator
0.1%
5k
Generate high-performance inference engines
Starred by
+19
Created 3 years ago
Updated 1 month ago
x-transformers
by
lucidrains
0%
6k
Transformer library with extensive experimental features
Starred by
+7
Created 5 years ago
Updated 1 week ago
compiler-and-arch
by
KnowingNothing
0%
521
Compiler/architecture resources for emerging domains
Starred by
Created 3 years ago
Updated 1 year ago
skypilot
by
skypilot-org
0.3%
9k
Framework for cloud AI/batch jobs, unifying execution across diverse infrastructure
Starred by
+24
Created 4 years ago
Updated 17 hours ago
metaseq
by
facebookresearch
0%
7k
Codebase for large-scale transformer model development and deployment
Starred by
+11
Created 3 years ago
Updated 1 year ago
FasterTransformer
by
NVIDIA
0.0%
6k
Optimized transformer library for inference
Starred by
+12
Created 4 years ago
Updated 1 year ago
alpa
by
alpa-projects
0.1%
3k
Auto-parallelization framework for large-scale neural network training and serving
Starred by
+17
Created 5 years ago
Updated 2 years ago
transformers
by
huggingface
0.2%
157k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 18 hours ago
ray
by
ray-project
0.4%
41k
AI compute engine for scaling Python and AI applications
Starred by
+53
Created 9 years ago
Updated 19 hours ago
awesome-tensor-compilers
by
merrymercy
0%
3k
Curated list of tensor compiler projects and papers
Starred by
+10
Created 5 years ago
Updated 1 year ago
tvm
by
apache
0.1%
13k
Compiler stack for deep learning systems
Starred by
+20
Created 9 years ago
Updated 20 hours ago
cutlass
by
NVIDIA
0.3%
9k
CUDA C++ and Python DSLs for high-performance linear algebra
Starred by
+21
Created 8 years ago
Updated 21 hours ago
DeepLearningExamples
by
NVIDIA
0.1%
15k
Deep learning examples for training and deployment
Starred by
+8
Created 7 years ago
Updated 1 year ago
Feedback? Help us improve.