beta
Home
Browse all repos
Newsletter
/
Popular searches
MCP
model serving
fine tuning
conversational speech model
observability
evaluation framework
Home
Browse all repos
Newsletter
Home
>
Users
>
xiezhq-hermann
Zhiqiang Xie
@xiezhq-hermann
Author of SGLang
GitHub
View on GitHub
Starred Projects (40)
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
.
KernelBench
by
ScalingIntelligence
2.2%
499
Benchmark for LLMs generating GPU kernels from PyTorch ops
created 9 months ago
updated 2 days ago
Starred by
Taranjeet Singh
(Cofounder of Mem0)
,
Woosuk Kwon
(Author of vLLM),
and
1 more.
openevolve
by
codelion
2.4%
3k
Coding agent for scientific/algorithmic discovery, based on AlphaEvolve paper
created 2 months ago
updated 2 days ago
Starred by
Joe Walnes
(Head of Experimental Projects at Stripe)
,
Dongxu Huang
(Cofounder of PingCAP),
and
10 more.
letta
by
letta-ai
0.6%
18k
Agent framework for stateful agents with memory, reasoning, and context management
created 1 year ago
updated 1 day ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
,
Jiayi Pan
(Author of SWE-Gym; AI Researcher at UC Berkeley),
and
1 more.
SkyRL
by
NovaSky-AI
4.4%
666
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
created 3 months ago
updated 2 days ago
Starred by
Joe Walnes
(Head of Experimental Projects at Stripe)
,
Jeffrey Morgan
(Cofounder of Ollama),
and
8 more.
mem0
by
mem0ai
0.8%
38k
AI agent memory layer for personalized interactions
created 2 years ago
updated 23 hours ago
Starred by
Shishir Patil
(Author of BFCL, Gorilla)
and
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
.
tokasaurus
by
ScalingIntelligence
1.3%
386
LLM inference engine for high-throughput workloads
created 1 month ago
updated 3 days ago
Starred by
Carol Willing
(Core Contributor to CPython, Jupyter)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
3 more.
dynamo
by
ai-dynamo
1.1%
5k
Inference framework for distributed generative AI model serving
created 5 months ago
updated 13 hours ago
Starred by
Joe Walnes
(Head of Experimental Projects at Stripe)
,
Wes McKinney
(Author of Pandas),
and
4 more.
3FS
by
deepseek-ai
0.2%
9k
Distributed file system for AI training/inference workloads
created 5 months ago
updated 5 days ago
Starred by
Yang Song
(Professor at Caltech; Research Scientist at OpenAI)
,
Jeremy Howard
(Cofounder of fast.ai),
and
1 more.
flash-linear-attention
by
fla-org
1.2%
3k
Efficient Torch/Triton implementations for linear attention models
created 1 year ago
updated 1 day ago
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
.
Sana
by
NVlabs
0.5%
4k
Image synthesis research paper using a linear diffusion transformer
created 9 months ago
updated 2 weeks ago
Starred by
Travis Fischer
(Founder of Agentic)
and
Jared Palmer
(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX)
.
Trace
by
microsoft
0.5%
631
AutoDiff-like tool for end-to-end AI agent training with general feedback
created 1 year ago
updated 1 month ago
Starred by
David Cournapeau
(Author of scikit-learn)
,
Stas Bekman
(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake),
and
3 more.
lectures
by
gpu-mode
0.4%
5k
Lecture series for GPU-accelerated computing
created 1 year ago
updated 1 month ago
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
,
Robert Stojnic
(Creator of Papers with Code),
and
5 more.
swarm
by
openai
0.3%
20k
Multi-agent orchestration framework for lightweight agent coordination
created 1 year ago
updated 4 months ago
Starred by
Dan Guido
(Cofounder of Trail of Bits)
,
Michael Han
(Cofounder of Unsloth),
and
9 more.
ComfyUI
by
comfyanonymous
0.8%
84k
Visual AI engine for diffusion models, API, and backend
created 2 years ago
updated 18 hours ago
Starred by
Philipp Schmid
(DevRel at Google DeepMind)
and
Lianmin Zheng
(Author of SGLang)
.
distrifuser
by
mit-han-lab
0%
698
Research paper for distributed parallel inference of high-resolution diffusion models
created 1 year ago
updated 8 months ago
Starred by
Stas Bekman
(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake)
.
veScale
by
volcengine
0.1%
839
PyTorch-native framework for LLM training
created 1 year ago
updated 3 weeks ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
,
Lianmin Zheng
(Author of SGLang),
and
3 more.
flashinfer
by
flashinfer-ai
1.4%
3k
Kernel library for LLM serving
created 2 years ago
updated 13 hours ago
Starred by
Elie Bursztein
(Cybersecurity Lead at Google DeepMind)
,
Philipp Schmid
(DevRel at Google DeepMind),
and
17 more.
sglang
by
sgl-project
1.1%
16k
Fast serving framework for LLMs and vision language models
created 1 year ago
updated 11 hours ago
Starred by
Deshraj Yadav
(Cofounder of Mem0)
,
Didier Lopes
(Founder of OpenBB),
and
4 more.
camel
by
camel-ai
1.4%
14k
Multi-agent framework for studying agent scaling laws
created 2 years ago
updated 13 hours ago
Starred by
Travis Fischer
(Founder of Agentic)
and
Andreas Jansson
(Cofounder of Replicate)
.
LLM-Agent-Paper-List
by
WooooDyy
0.1%
8k
Paper list for LLM-based agents
created 1 year ago
updated 1 year ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Stas Bekman
(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake),
and
20 more.
guidance
by
guidance-ai
0.1%
21k
Guidance is a programming paradigm for steering LLMs
created 2 years ago
updated 23 hours ago
Starred by
Zhuohan Li
(Author of vLLM)
,
Ying Sheng
(Author of SGLang),
and
8 more.
scalene
by
plasma-umass
0.1%
13k
Python profiler with AI-powered optimization proposals
created 5 years ago
updated 3 weeks ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
4 more.
generative_agents
by
joonspk-research
0.2%
19k
Research paper code for interactive human behavior simulation using generative agents
created 2 years ago
updated 1 year ago
Starred by
Lysandre Debut
(Chief Open-Source Officer at Hugging Face)
.
rccl
by
ROCm
1.1%
353
ROCm library for GPU collective communication routines
created 7 years ago
updated 1 day ago
Starred by
Fabian Hedin
(Cofounder of Lovable)
,
Quincy Larson
(Founder of freeCodeCamp),
and
21 more.
llama
by
meta-llama
0.1%
59k
Inference code for Llama 2 models (deprecated)
created 2 years ago
updated 6 months ago
madrona
by
shacklettbp
1.5%
411
GPU-accelerated game engine for high-throughput batch simulation
created 3 years ago
updated 1 week ago
Starred by
Travis Fischer
(Founder of Agentic)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
3 more.
mistral-inference
by
mistralai
0.1%
10k
Inference library for Mistral models
created 1 year ago
updated 4 months ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Boris Cherny
(Creator of Claude Code; MTS at Anthropic),
and
8 more.
ai-town
by
a16z-infra
0.2%
9k
AI town starter kit for building a virtual world
created 2 years ago
updated 5 months ago
Starred by
John Resig
(Author of jQuery; Chief Software Architect at Khan Academy)
,
Georgios Konstantopoulos
(CTO, General Partner at Paradigm),
and
6 more.
milvus
by
milvus-io
0.4%
36k
Cloud-native vector database for scalable ANN search
created 5 years ago
updated 18 hours ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Matei Zaharia
(Cofounder of Databricks),
and
27 more.
dspy
by
stanfordnlp
0.6%
27k
Framework for programming language models, not prompting
created 2 years ago
updated 1 day ago
dynolog
by
facebookincubator
0%
326
Telemetry daemon for performance monitoring and tracing of heterogeneous CPU-GPU systems
created 3 years ago
updated 3 days ago
Starred by
Ying Sheng
(Author of SGLang)
.
ChatGDB
by
pgosar
0%
922
CLI tool for debugging with natural language via LLM
created 2 years ago
updated 7 months ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Daniel Gross
(Cofounder of Safe Superintelligence),
and
19 more.
nanoGPT
by
karpathy
0.4%
43k
Minimalist repo for training/finetuning GPT models
created 2 years ago
updated 7 months ago
Starred by
Aravind Srinivas
(Cofounder of Perplexity)
,
John Yang
(Author of SWE-bench, SWE-agent),
and
6 more.
awesome-courses
by
prakhar1989
0.2%
62k
Awesome CS courses with free online materials
created 10 years ago
updated 2 years ago
Starred by
Wei-Lin Chiang
(Cofounder of LMArena)
,
Adam Paszke
(Author of PyTorch),
and
2 more.
iree
by
iree-org
0.7%
3k
MLIR-based compiler and runtime toolkit for machine learning models
created 5 years ago
updated 18 hours ago
Starred by
Lianmin Zheng
(Author of SGLang)
.
antares
by
microsoft
0.2%
478
Compiler solution for PyTorch operator optimization on diverse accelerators
created 5 years ago
updated 3 months ago
oneflow
by
Oneflow-Inc
0.1%
9k
Deep learning framework for user-friendly, scalable, efficient model development
created 8 years ago
updated 2 days ago
Starred by
Omar Sanseviero
(DevRel at Google DeepMind)
,
Wei-Lin Chiang
(Cofounder of LMArena),
and
7 more.
dgl
by
dmlc
0.0%
14k
Python package for deep learning on graphs
created 7 years ago
updated 1 day ago
AI-Infra-from-Zero-to-Hero
by
HuaizhengZhang
0.8%
3k
Curated list of machine learning systems resources
created 6 years ago
updated 1 week ago
Starred by
Aravind Srinivas
(Cofounder of Perplexity)
,
Travis Fischer
(Founder of Agentic),
and
8 more.
tvm
by
apache
0.1%
12k
Compiler stack for deep learning systems
created 8 years ago
updated 15 hours ago
Feedback? Help us improve.