binarybana avatar

Jason Knight

@binarybana

Director AI Compilers at NVIDIA; Cofounder of OctoML

GitHubView on GitHub

Starred Projects (52)

Starred by Will Brown Will Brown(Research Lead at Prime Intellect), Carol Willing Carol Willing(Core Contributor to CPython, Jupyter), and
15 more.

llm by simonw

1.0%
9k
CLI tool and Python library for LLM interaction
created 2 years ago
updated 4 days ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Simon Willison Simon Willison(Author of Django), and
3 more.

mlx-lm by ml-explore

4.9%
2k
Python package for LLM text generation and fine-tuning on Apple silicon
created 5 months ago
updated 1 day ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Kent Dodds Kent Dodds(Cofounder of Remix), and
7 more.

awesome-mcp-servers by punkpeye

1.3%
66k
Curated list of Model Context Protocol (MCP) servers
created 8 months ago
updated 1 day ago
Starred by Jeffrey Morgan Jeffrey Morgan(Cofounder of Ollama), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
16 more.

codex by openai

6.9%
35k
Coding agent CLI tool for terminal-based chat-driven development
created 4 months ago
updated 19 hours ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Lewis Tunstall Lewis Tunstall(Research Engineer at Hugging Face), and
8 more.

verl by volcengine

2.2%
12k
RL training library for LLMs
created 9 months ago
updated 1 day ago
Starred by Addy Osmani Addy Osmani(Head of Chrome Developer Experience at Google), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
13 more.

goose by block

1.9%
19k
Open-source AI agent for automating complex engineering tasks
created 11 months ago
updated 19 hours ago
Starred by Christian Laforte Christian Laforte(Distinguished Engineer at NVIDIA; Former CTO at Stability AI) and Georgi Gerganov Georgi Gerganov(Author of llama.cpp, whisper.cpp).

shell_sage by AnswerDotAI

0.3%
356
CLI tool for terminal context analysis using LLMs
created 9 months ago
updated 1 month ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
5 more.

lorax by predibase

0.5%
3k
Multi-LoRA inference server for serving 1000s of fine-tuned LLMs
created 1 year ago
updated 2 months ago
Starred by Simon Mo Simon Mo(Core Maintainer of vLLM), Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and
4 more.

lingua by facebookresearch

0.1%
5k
LLM research codebase for training and inference
created 10 months ago
updated 4 weeks ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.5%
20k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm), and
2 more.

gpu.cpp by AnswerDotAI

0.1%
4k
C++ library for portable GPU computation using WebGPU
created 1 year ago
updated 1 month ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm), and
10 more.

ThunderKittens by HazyResearch

0.5%
3k
CUDA kernel framework for fast deep learning primitives
created 1 year ago
updated 1 week ago
Starred by Chris Van Pelt Chris Van Pelt(Cofounder of Weights & Biases), Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and
2 more.

tensorizer by coreweave

1.6%
255
Module for fast model serialization/deserialization
created 2 years ago
updated 2 days ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
17 more.

StableLM by Stability-AI

0.0%
16k
Language models by Stability AI
created 2 years ago
updated 1 year ago
Starred by Junyang Lin Junyang Lin(Core Maintainer of Alibaba Qwen), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
15 more.

alpaca-lora by tloen

0.1%
19k
LoRA fine-tuning for LLaMA
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Wing Lian Wing Lian(Founder of Axolotl AI), and
1 more.

sparsegpt by IST-DASLab

0.4%
824
Code for massive language model one-shot pruning (ICML 2023 paper)
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Didier Lopes Didier Lopes(Founder of OpenBB), and
12 more.

pyo3 by PyO3

0.3%
14k
Rust bindings for Python, enabling native extension modules
created 8 years ago
updated 1 day ago
Starred by Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI), Travis Fischer Travis Fischer(Founder of Agentic), and
11 more.

tch-rs by LaurentMazare

0.2%
5k
Rust bindings for PyTorch C++ API (libtorch)
created 6 years ago
updated 1 week ago
Starred by Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
15 more.

redis by redis

0.2%
70k
Redis is a versatile data structure server, cache, and query engine
created 16 years ago
updated 1 day ago
Feedback? Help us improve.