beta
Home
Browse all repos
Newsletter
/
Popular searches
MCP
model serving
fine tuning
conversational speech model
observability
evaluation framework
Home
Browse all repos
Newsletter
Home
>
Users
>
CodeCreator
Alexander Wettig
@CodeCreator
Author of SWE-bench, SWE-agent
GitHub
View on GitHub
Starred Projects (32)
Starred by
Calvin French-Owen
(Coounder of Segment)
and
Michael Han
(Cofounder of Unsloth)
.
DeepEP
by
deepseek-ai
0.4%
8k
Expert-parallel communication library for MoE, targeting high-throughput and low-latency
created 5 months ago
updated 1 day ago
Starred by
Jiayi Pan
(Author of SWE-Gym; AI Researcher at UC Berkeley)
.
ring-flash-attention
by
zhuzilin
1.1%
827
FlashAttention extension for ring attention
created 1 year ago
updated 1 week ago
Starred by
Lysandre Debut
(Chief Open-Source Officer at Hugging Face)
,
Calvin French-Owen
(Coounder of Segment),
and
8 more.
LLaMA-Factory
by
hiyouga
0.8%
55k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
created 2 years ago
updated 3 days ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
Zhuohan Li
(Author of vLLM),
and
5 more.
torchtitan
by
pytorch
0.9%
4k
PyTorch platform for generative AI model training research
created 1 year ago
updated 19 hours ago
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
,
Jeff Hammerbacher
(Cofounder of Cloudera),
and
9 more.
open-r1
by
huggingface
0.2%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 3 days ago
Starred by
Tim J. Baek
(Founder of Open WebUI)
,
Nathan Lambert
(AI Researcher at AI2),
and
1 more.
awesome-o1
by
srush
0%
1k
Bibliography for OpenAI's o1 project
created 9 months ago
updated 8 months ago
Starred by
Elie Bursztein
(Cybersecurity Lead at Google DeepMind)
,
Philipp Schmid
(DevRel at Google DeepMind),
and
17 more.
sglang
by
sgl-project
1.1%
16k
Fast serving framework for LLMs and vision language models
created 1 year ago
updated 11 hours ago
Starred by
Ben Firshman
(Cofounder of Replicate)
,
Joe Walnes
(Head of Experimental Projects at Stripe),
and
10 more.
marker
by
datalab-to
0.6%
27k
CLI tool for converting PDFs and other documents to Markdown, JSON, and HTML
created 1 year ago
updated 21 hours ago
Starred by
Teknium
(Cofounder of Nous Research)
.
dclm
by
mlfoundations
0.4%
1k
Framework for LLM dataset creation, training, and evaluation
created 1 year ago
updated 4 months ago
Starred by
Matei Zaharia
(Cofounder of Databricks)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
6 more.
megablocks
by
databricks
0.1%
1k
Lightweight library for mixture-of-experts (MoE) training
created 2 years ago
updated 1 month ago
Starred by
Michael Truell
(Cofounder of Cursor)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
13 more.
SWE-agent
by
SWE-agent
0.5%
17k
Agent for automated software engineering (NeurIPS 2024)
created 1 year ago
updated 2 days ago
Starred by
Lewis Tunstall
(Researcher at Hugging Face)
and
Nathan Lambert
(AI Researcher at AI2)
.
SPIN
by
uclaml
0.3%
1k
Self-Play Fine-Tuning (SPIN) research paper implementation
created 1 year ago
updated 1 year ago
Starred by
John Yang
(Author of SWE-bench, SWE-agent)
.
LLM-Shearing
by
princeton-nlp
0.2%
626
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
created 1 year ago
updated 1 year ago
Starred by
Nat Friedman
(Former CEO of GitHub)
,
Alexey Milovidov
(Cofounder of Clickhouse),
and
5 more.
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
created 2 years ago
updated 7 months ago
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
,
Jeremy Howard
(Cofounder of fast.ai),
and
1 more.
data-juicer
by
modelscope
0.7%
5k
Data-Juicer: Data processing system for foundation models
created 2 years ago
updated 1 day ago
Starred by
Travis Fischer
(Founder of Agentic)
,
Ying Sheng
(Author of SGLang),
and
4 more.
SWE-bench
by
SWE-bench
0.8%
3k
Benchmark for evaluating LLMs on real-world GitHub issues
created 1 year ago
updated 3 days ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Clément Renault
(Cofounder of Meilisearch),
and
21 more.
outlines
by
dottxt-ai
0.3%
12k
SDK for structured LLM text generation
created 2 years ago
updated 12 hours ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Matei Zaharia
(Cofounder of Databricks),
and
27 more.
dspy
by
stanfordnlp
0.6%
27k
Framework for programming language models, not prompting
created 2 years ago
updated 1 day ago
Starred by
Lewis Tunstall
(Researcher at Hugging Face)
,
Ying Sheng
(Author of SGLang),
and
1 more.
llm-reasoners
by
maitrix-org
0.2%
2k
Library for advanced LLM reasoning with search algorithms
created 2 years ago
updated 1 month ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
Jiayi Pan
(Author of SWE-Gym; AI Researcher at UC Berkeley),
and
15 more.
flash-attention
by
Dao-AILab
0.7%
19k
Fast, memory-efficient attention implementation
created 3 years ago
updated 15 hours ago
Starred by
Nat Friedman
(Former CEO of GitHub)
,
Jeff Hammerbacher
(Cofounder of Cloudera),
and
11 more.
tiktoken
by
openai
0.4%
15k
Fast BPE tokenizer for OpenAI models
created 2 years ago
updated 4 months ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
.
ALCE
by
princeton-nlp
0%
490
Benchmark for evaluating LLMs' citation abilities
created 2 years ago
updated 9 months ago
AutoCompressors
by
princeton-nlp
0%
309
Research paper adapting LMs for long context compression
created 2 years ago
updated 10 months ago
Starred by
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems)
and
Jeremy Howard
(Cofounder of fast.ai)
.
Sophia
by
Liuhong99
0%
965
Optimizer for language model pre-training (research paper)
created 2 years ago
updated 1 year ago
Starred by
Jiayi Pan
(Author of SWE-Gym; AI Researcher at UC Berkeley)
,
Chip Huyen
(Author of AI Engineering, Designing Machine Learning Systems),
and
5 more.
EasyLM
by
young-geng
0.2%
2k
LLM training/finetuning framework in JAX/Flax
created 2 years ago
updated 11 months ago
Starred by
Andrej Karpathy
(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n)
,
John Yang
(Author of SWE-bench, SWE-agent),
and
12 more.
stanford_alpaca
by
tatsu-lab
0.1%
30k
Instruction-following LLaMA model training and data generation
created 2 years ago
updated 1 year ago
LM-BFF
by
princeton-nlp
0%
728
Research paper on few-shot fine-tuning of language models
created 4 years ago
updated 2 years ago
Starred by
Luca Antiga
(CTO of Lightning AI)
.
lightning-transformers
by
Lightning-Universe
0%
609
Archived library for training Transformers with PyTorch Lightning
created 4 years ago
updated 2 years ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Anton Troynikov
(Cofounder of Chroma),
and
12 more.
haystack
by
deepset-ai
0.4%
22k
AI orchestration framework for LLM application development
created 5 years ago
updated 2 days ago
Starred by
Chenlin Meng
(Cofounder of Pika)
.
normalizing_flows
by
kamenbliznashki
0%
627
PyTorch for density estimation research
created 6 years ago
updated 4 years ago
Starred by
Aravind Srinivas
(Cofounder of Perplexity)
and
Chenlin Meng
(Cofounder of Pika)
.
vdvae
by
openai
0%
446
Research paper implementation for very deep VAE models
created 4 years ago
updated 2 years ago
pytorch-generative
by
EugenHotaj
0%
440
PyTorch library for generative modeling
created 5 years ago
updated 1 year ago
Feedback? Help us improve.