beta
Home
Browse all repos
Follow on
X
/
Popular searches
MCP
model serving
fine tuning
conversational speech model
observability
evaluation framework
Home
Browse all repos
Follow on
X
Home
>
Users
>
srush
Sasha Rush
@srush
Research Scientist at Cursor; Professor at Cornell Tech
GitHub
View on GitHub
Authored Projects (5)
Starred by
Wing Lian
(Founder of Axolotl AI)
,
Alex Cheema
(Cofounder of EXO Labs),
and
1 more.
Triton-Puzzles
by
srush
0.9%
2k
Interactive puzzles for learning Triton
created 1 year ago
updated 9 months ago
Starred by
Jason Knight
(Director AI Compilers at NVIDIA; Cofounder of OctoML)
,
Tim J. Baek
(Founder of Open WebUI),
and
5 more.
awesome-o1
by
srush
0%
1k
Bibliography for OpenAI's o1 project
created 10 months ago
updated 9 months ago
Starred by
Yaowei Zheng
(Author of LLaMA-Factory)
,
Lysandre Debut
(Chief Open-Source Officer at Hugging Face),
and
6 more.
MiniChain
by
srush
0%
1k
Tiny library for coding with large language models
created 2 years ago
updated 1 year ago
Starred by
Jiayi Pan
(Author of SWE-Gym; MTS at xAI)
,
Albert Gu
(Cofounder of Cartesia; Professor at CMU),
and
10 more.
LLM-Training-Puzzles
by
srush
0.3%
1k
Hands-on puzzles for large language model training
created 2 years ago
updated 1 year ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
.
llama2.rs
by
srush
0.2%
1k
Rust library for fast Llama2 inference on CPU
created 2 years ago
updated 1 year ago
Starred Projects (35)
Starred by
Eric Zhang
(Founding Engineer at Modal)
.
scaling-book
by
jax-ml
5.0%
478
LLM scaling guide on TPUs
created 6 months ago
updated 2 days ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
Travis Fischer
(Founder of Agentic),
and
3 more.
picotron
by
huggingface
0.4%
2k
Minimalist distributed training framework for educational use
created 11 months ago
updated 1 month ago
awesome-discrete-diffusion-models
by
kuleshov-group
1.9%
425
Curated list of discrete diffusion model resources
created 2 years ago
updated 2 months ago
Starred by
Clément Renault
(Cofounder of Meilisearch)
.
lm.rs
by
samuel-vitorino
0.2%
1k
Minimal LLM inference in Rust
created 1 year ago
updated 9 months ago
Starred by
Solomon Hykes
(Cofounder of Docker, Dagger)
and
Eiso Kant
(Cofounder of poolside)
.
zml
by
zml
0.5%
2k
AI inference stack for production
created 11 months ago
updated 2 days ago
Starred by
Yaowei Zheng
(Author of LLaMA-Factory)
,
Zhiqiang Xie
(Author of SGLang),
and
8 more.
flash-linear-attention
by
fla-org
0.6%
3k
Efficient Torch/Triton implementations for linear attention models
created 1 year ago
updated 1 day ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
Georgios Konstantopoulos
(CTO, General Partner at Paradigm),
and
10 more.
ThunderKittens
by
HazyResearch
0.5%
3k
CUDA kernel framework for fast deep learning primitives
created 1 year ago
updated 1 week ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
Suale Hasif
(Cofounder of Cursor),
and
2 more.
attorch
by
BobMcDear
0.2%
568
PyTorch nn module subset, implemented in Python using Triton
created 2 years ago
updated 4 days ago
Starred by
Peter Norvig
(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google)
,
Didier Lopes
(Founder of OpenBB),
and
23 more.
llm.c
by
karpathy
0.2%
27k
LLM training in pure C/CUDA, no PyTorch needed
created 1 year ago
updated 1 month ago
Starred by
Elie Bursztein
(Cybersecurity Lead at Google DeepMind)
,
Philipp Schmid
(DevRel at Google DeepMind),
and
25 more.
sglang
by
sgl-project
1.3%
17k
Fast serving framework for LLMs and vision language models
created 1 year ago
updated 1 day ago
Starred by
George Hotz
(Author of tinygrad; Founder of the tiny corp, comma.ai)
,
Zhiyuan Li
(Cofounder of Nexa AI),
and
18 more.
mamba
by
state-spaces
0.3%
16k
Mamba SSM architecture for sequence modeling
created 1 year ago
updated 4 weeks ago
Starred by
Jesse Clark
(Cofounder of Marqo)
,
Taranjeet Singh
(Cofounder of Mem0),
and
1 more.
vec2text
by
vec2text
0.2%
918
Utilities for decoding deep representations (sentence embeddings) back to text
created 2 years ago
updated 1 week ago
Starred by
Pietro Schirano
(Founder of MagicPath)
,
Jonathan Ragan-Kelley
(Professor at MIT),
and
8 more.
insanely-fast-whisper
by
Vaibhavs10
0.2%
9k
Fast Whisper transcription CLI
created 1 year ago
updated 1 year ago
epub2tts
by
aedocw
0.5%
829
CLI tool to create audiobooks from epub/text files using TTS engines
created 2 years ago
updated 2 months ago
Starred by
Eugene Yan
(AI Scientist at AWS)
,
Philipp Schmid
(DevRel at Google DeepMind),
and
11 more.
alignment-handbook
by
huggingface
0.3%
5k
Handbook for aligning language models with human/AI preferences
created 2 years ago
updated 2 weeks ago
Starred by
Jeremy Howard
(Cofounder of fast.ai)
and
Omar Sanseviero
(DevRel at Google DeepMind)
.
GPTQ-triton
by
fpgaminer
0.7%
305
Triton kernel for GPTQ inference, improving context scaling
created 2 years ago
updated 2 years ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Jason Knight
(Director AI Compilers at NVIDIA; Cofounder of OctoML),
and
19 more.
candle
by
huggingface
0.3%
18k
Minimalist ML framework for Rust, emphasizing performance and ease of use
created 2 years ago
updated 5 days ago
Starred by
Omar Sanseviero
(DevRel at Google DeepMind)
.
gradio-tools
by
freddyaboulton
0%
602
Gradio apps to LLM agent tool converter
created 2 years ago
updated 2 years ago
Starred by
John Resig
(Author of jQuery; Chief Software Architect at Khan Academy)
.
llmparser
by
kyang6
0%
425
LLM tool for structured data extraction and classification
created 2 years ago
updated 2 years ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
John Yang
(Author of SWE-bench, SWE-agent),
and
20 more.
stanford_alpaca
by
tatsu-lab
0.0%
30k
Instruction-following LLaMA model training and data generation
created 2 years ago
updated 1 year ago
Starred by
Nathan Lambert
(Research Scientist at AI2)
.
theseus
by
facebookresearch
0.1%
2k
Library for differentiable nonlinear optimization layers in PyTorch
created 3 years ago
updated 7 months ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems"),
and
8 more.
evaluate
by
huggingface
0.3%
2k
ML model evaluation library for standardized performance reporting
created 3 years ago
updated 2 days ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems"),
and
7 more.
promptsource
by
bigscience-workshop
0.2%
3k
Toolkit for creating, sharing, and using natural language prompts
created 4 years ago
updated 1 year ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Phil Wang
(Prolific Research Paper Implementer),
and
4 more.
pytorch_block_sparse
by
huggingface
0%
548
PyTorch extension for block-sparse linear layers
created 5 years ago
updated 4 years ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems"),
and
19 more.
datasets
by
huggingface
0.2%
21k
Access and process large AI datasets efficiently
created 5 years ago
updated 3 days ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Syrus Akbary
(Founder of Wasmer),
and
19 more.
tokenizers
by
huggingface
0.2%
10k
Fast tokenizer library optimized for research and production
created 5 years ago
updated 2 weeks ago
Starred by
Clement Delangue
(Cofounder of Hugging Face)
,
Julien Chaumond
(Cofounder of Hugging Face),
and
2 more.
node-question-answering
by
huggingface
0%
466
Fast question answering for Node.js
created 5 years ago
updated 2 years ago
Starred by
Thomas Wolf
(Cofounder of Hugging Face)
.
detecting-fake-text
by
HendrikStrobelt
0.2%
482
Tool for detecting text generated by large language models
created 6 years ago
updated 1 year ago
Starred by
Eugene Yan
(AI Scientist at AWS)
,
Artidoro Pagnoni
(Author of QLoRA; Research Scientist at Meta),
and
10 more.
text
by
pytorch
0.1%
4k
PyTorch library for NLP tasks
created 8 years ago
updated 1 day ago
Starred by
Boris Cherny
(Creator of Claude Code; MTS at Anthropic)
,
Andrey Vasnetsov
(Cofounder of Qdrant),
and
17 more.
fairseq-lua
by
facebookresearch
0.0%
4k
Lua-based toolkit for sequence-to-sequence learning
created 8 years ago
updated 3 years ago
Starred by
Aravind Srinivas
(Cofounder of Perplexity)
,
Evan Hubinger
(Head of Alignment Stress-Testing at Anthropic),
and
8 more.
generating-reviews-discovering-sentiment
by
openai
0%
2k
Language model code for generating reviews and discovering sentiment
created 8 years ago
updated 2 years ago
Starred by
Omar Sanseviero
(DevRel at Google DeepMind)
,
Jeff Hammerbacher
(Cofounder of Cloudera),
and
10 more.
OpenNMT-py
by
OpenNMT
0.1%
7k
PyTorch framework for neural machine translation and LLM experimentation
created 8 years ago
updated 5 months ago
Starred by
Lilian Weng
(Cofounder of Thinking Machines Lab)
,
Jeff Hammerbacher
(Cofounder of Cloudera),
and
22 more.
examples
by
pytorch
0.1%
23k
PyTorch examples for diverse AI tasks
created 9 years ago
updated 3 days ago
im2latex-tensorflow
by
ritheshkumar95
0%
293
TensorFlow implementation of an im2latex system
created 8 years ago
updated 3 years ago
Starred by
Michael Truell
(Cofounder of Cursor)
,
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems"),
and
6 more.
requests-for-research
by
openai
0%
2k
Deep learning problems collection for research
created 9 years ago
updated 1 year ago
Feedback? Help us improve.