Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Nathan Lambert
Nathan Lambert
Research Scientist at AI2
GitHub
X
Authored Projects (1)
Starred
by
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems")
and
Vincent Weisser
(Cofounder of Prime Intellect)
.
rlhf-book
by
natolambert
1.2%
1k
Pandoc template for generating technical books
Compiles markdown content into PDF, EPUB, HTML, and DOCX outputs.
Features Pandoc-crossref for advanced cross-referencing of elements.
Automates build processes using Makefiles and allows content filtering.
Tailored for creating educational materials, like RLHF textbooks.
Created 1 year ago
Updated 2 days ago
Starred Projects (62)
marin
by
marin-community
1.0%
707
Framework for reproducible foundation model research and development
Starred by
+2
Created 1 year ago
Updated 9 hours ago
DeepGEMM
by
deepseek-ai
0.4%
6k
CUDA library for efficient FP8 GEMM kernels with fine-grained scaling
Starred by
+6
Created 11 months ago
Updated 5 days ago
OpenEnv
by
meta-pytorch
1.5%
983
Framework for agentic RL training environments
Starred by
+5
Created 3 months ago
Updated 2 days ago
nanochat
by
karpathy
1.0%
40k
A minimal, full-stack LLM implementation for accessible AI development
Starred by
+23
Created 3 months ago
Updated 2 days ago
PRarena
by
aavetis
0.7%
298
Monitoring AI coding agent pull request performance
Starred by
Created 7 months ago
Updated 11 hours ago
chat_templates
by
chujiezheng
0.3%
712
Chat templates for HuggingFace LLMs
Starred by
Created 2 years ago
Updated 1 year ago
OLMoE.swift
by
allenai
0.3%
307
Swift app for local, offline AI experience
Starred by
Created 1 year ago
Updated 9 months ago
verdict
by
haizelabs
1.6%
322
Framework for LLM-as-a-judge systems, scaling evaluation
Starred by
Created 1 year ago
Updated 2 months ago
CodeIO
by
hkust-nlp
0%
566
Research paper enhancing LLMs' reasoning via code I/O prediction
Created 11 months ago
Updated 8 months ago
awesome-open-source-lms
by
allenai
0.3%
356
Curated list of open-source language models and resources
Starred by
Created 1 year ago
Updated 3 months ago
awesome-o1
by
srush
0%
1k
Bibliography for OpenAI's o1 project
Starred by
+3
Created 1 year ago
Updated 1 year ago
RLAIF-V
by
RLHF-V
0.5%
438
Framework for aligning MLLMs using open-source AI feedback
Created 1 year ago
Updated 8 months ago
OLMoE
by
allenai
0.7%
948
Open MoE language model research paper
Starred by
Created 1 year ago
Updated 3 months ago
nomic
by
nomic-ai
0%
2k
Python client for massive unstructured data interaction
Starred by
Created 3 years ago
Updated 2 months ago
MAP-NEO
by
multimodal-art-projection
0.1%
973
Open-source LLM with pretraining data, pipeline, scripts, and alignment code
Starred by
Created 1 year ago
Updated 11 months ago
rlhf-book
by
natolambert
1.2%
1k
Pandoc template for generating technical books
Starred by
Created 1 year ago
Updated 2 days ago
RLHF-Reward-Modeling
by
RLHFlow
0.3%
1k
Recipes to train reward models for RLHF
Starred by
Created 1 year ago
Updated 8 months ago
OpenRLHF
by
OpenRLHF
0.7%
9k
RLHF framework for scalable training of large language models
Starred by
+9
Created 2 years ago
Updated 3 days ago
arena-hard-auto
by
lmarena
0.3%
979
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 6 months ago
smol-podcaster
by
FanaHOVA
0%
406
Podcast production agent
Starred by
+2
Created 2 years ago
Updated 2 months ago
yet-another-applied-llm-benchmark
by
carlini
0.1%
1k
LLM benchmark for evaluating models on previously asked programming questions
Starred by
+2
Created 2 years ago
Updated 8 months ago
DataDreamer
by
datadreamer-dev
0.1%
1k
Python library for synthetic data generation and training workflows
Starred by
+1
Created 2 years ago
Updated 11 months ago
alpaca_eval
by
tatsu-lab
0.1%
2k
Automatic evaluator for instruction-following language models
Starred by
+3
Created 2 years ago
Updated 5 months ago
SPIN
by
uclaml
0%
1k
Self-Play Fine-Tuning (SPIN) research paper implementation
Starred by
Created 1 year ago
Updated 1 year ago
cutlass
by
NVIDIA
0.5%
9k
CUDA C++ and Python DSLs for high-performance linear algebra
Starred by
+20
Created 8 years ago
Updated 2 days ago
llm-swarm
by
huggingface
0%
278
CLI tool to manage scalable open LLM inference endpoints in Slurm clusters
Starred by
Created 2 years ago
Updated 1 year ago
mergekit
by
arcee-ai
0.2%
7k
CLI tool for merging pretrained language models, combining strengths without retraining
Starred by
+15
Created 2 years ago
Updated 1 week ago
do-not-answer
by
Libr-AI
0.3%
303
Dataset for evaluating LLM safety mechanisms
Starred by
Created 2 years ago
Updated 1 year ago
reward-bench
by
allenai
0.4%
676
Reward model evaluation tool
Starred by
Created 2 years ago
Updated 7 months ago
OLMo
by
allenai
0.2%
6k
Open language model code for training, evaluation, and inference
Starred by
+4
Created 2 years ago
Updated 1 month ago
open-instruct
by
allenai
0.5%
4k
Training codebase for instruction-following language models
Starred by
+10
Created 2 years ago
Updated 20 hours ago
distilabel
by
argilla-io
0.8%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 2 weeks ago
unified-io-2
by
allenai
0.8%
641
Unified-IO 2 code for training, inference, and demo
Starred by
Created 2 years ago
Updated 1 year ago
mamba
by
state-spaces
0.3%
17k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 2 days ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 7 months ago
alignment-handbook
by
huggingface
0.1%
5k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 4 months ago
evaluate
by
huggingface
0.4%
2k
ML model evaluation library for standardized performance reporting
Starred by
+9
Created 3 years ago
Updated 1 month ago
chatarena
by
Farama-Foundation
0%
2k
Multi-agent environment for LLM research
Starred by
+1
Created 2 years ago
Updated 5 months ago
evals
by
openai
0.1%
18k
Framework for evaluating LLMs and LLM systems, plus benchmark registry
Starred by
+32
Created 3 years ago
Updated 2 months ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 2 years ago
Updated 1 year ago
large_language_model_training_playbook
by
huggingface
0.2%
491
Tips for training large language models
Starred by
+1
Created 2 years ago
Updated 2 years ago
MiniChain
by
srush
0.1%
1k
Tiny library for coding with large language models
Starred by
+7
Created 2 years ago
Updated 1 year ago
PaLM-rlhf-pytorch
by
lucidrains
0.1%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 3 months ago
trl
by
huggingface
0.4%
17k
Library for transformer RL
Starred by
+28
Created 5 years ago
Updated 2 days ago
theseus
by
facebookresearch
0.3%
2k
Library for differentiable nonlinear optimization layers in PyTorch
Starred by
Created 4 years ago
Updated 1 year ago
rl
by
pytorch
0.3%
3k
PyTorch library for reinforcement learning research
Starred by
Created 4 years ago
Updated 9 hours ago
diffusers
by
huggingface
0.3%
32k
PyTorch/Flax library for diffusion model research and applications
Starred by
+34
Created 3 years ago
Updated 22 hours ago
trlx
by
CarperAI
0.0%
5k
Distributed RLHF for LLMs
Starred by
+16
Created 3 years ago
Updated 2 years ago
nn-zero-to-hero
by
karpathy
1.0%
20k
Educational resource for neural network development, from basics to advanced models
Starred by
+5
Created 3 years ago
Updated 1 year ago
makemore
by
karpathy
0.6%
4k
Character-level language model for generating text
Starred by
Created 3 years ago
Updated 1 year ago
Awesome-LLM-Robotics
by
GT-RIPL
0.3%
4k
Curated list of papers using LLMs/multimodal models for robotics/RL
Starred by
Created 3 years ago
Updated 1 month ago
cleanrl
by
vwxyzjn
0.6%
9k
RL algorithms implementation with research-friendly features
Starred by
+3
Created 6 years ago
Updated 6 months ago
aqueduct
by
RunLLM
0%
520
MLOps framework for cloud deployment of LLM/ML workloads
Starred by
+3
Created 3 years ago
Updated 2 years ago
transformers
by
huggingface
0.2%
155k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 1 day ago
tianshou
by
thu-ml
0.1%
9k
PyTorch RL library for algorithm development and application
Starred by
+3
Created 7 years ago
Updated 1 month ago
TD3
by
sfujim
0.2%
2k
PyTorch implementation of TD3 for OpenAI gym tasks
Starred by
Created 7 years ago
Updated 2 years ago
PettingZoo
by
Farama-Foundation
0.4%
3k
Python library for multi-agent reinforcement learning environments
Starred by
Created 6 years ago
Updated 1 month ago
BIG-bench
by
google
0.1%
3k
Collaborative benchmark for probing and extrapolating LLM capabilities
Starred by
+11
Created 5 years ago
Updated 1 year ago
rlpyt
by
astooke
0%
2k
PyTorch library for deep reinforcement learning research
Starred by
+7
Created 6 years ago
Updated 5 years ago
rlkit
by
rail-berkeley
0.1%
3k
RL algorithm collection implemented in PyTorch
Starred by
+2
Created 8 years ago
Updated 1 year ago
roboschool
by
openai
0%
2k
Deprecated robot simulation software integrated with OpenAI Gym
Starred by
+5
Created 8 years ago
Updated 2 years ago
baselines
by
openai
0.0%
17k
RL algorithm implementations for research
Starred by
+25
Created 8 years ago
Updated 1 year ago
Feedback? Help us improve.