Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Nathan Lambert
Nathan Lambert
Research Scientist at AI2
GitHub
X
Authored Projects (1)
Starred
by
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems")
and
Vincent Weisser
(Cofounder of Prime Intellect)
.
rlhf-book
by
natolambert
0.4%
2k
Pandoc template for generating technical books
Compiles markdown content into PDF, EPUB, HTML, and DOCX outputs.
Features Pandoc-crossref for advanced cross-referencing of elements.
Automates build processes using Makefiles and allows content filtering.
Tailored for creating educational materials, like RLHF textbooks.
Created 2 years ago
Updated 2 days ago
Starred Projects (70)
Math-To-Manim
by
HarleyCoops
5.4%
2k
AI tool for generating mathematical animations from text using AI
Starred by
Created 1 year ago
Updated 2 days ago
autoresearch
by
karpathy
1.6%
84k
Autonomous LLM research agent for single-GPU training
Starred by
+23
Created 2 months ago
Updated 2 months ago
OLMo-core
by
allenai
0.7%
1k
PyTorch building blocks for large language model training and inference
Starred by
Created 2 years ago
Updated 23 hours ago
reasoning-gym
by
open-thought
0.1%
1k
Procedural dataset generator for reasoning models
Starred by
+6
Created 1 year ago
Updated 1 month ago
marin
by
marin-community
3.1%
1k
Framework for reproducible foundation model research and development
Starred by
+2
Created 2 years ago
Updated 23 hours ago
DeepGEMM
by
deepseek-ai
0.5%
7k
CUDA library for efficient FP8 GEMM kernels with fine-grained scaling
Starred by
+6
Created 1 year ago
Updated 2 weeks ago
OpenEnv
by
meta-pytorch
0.4%
2k
Framework for agentic RL training environments
Starred by
+5
Created 7 months ago
Updated 6 days ago
nanochat
by
karpathy
0.7%
54k
A minimal, full-stack LLM implementation for accessible AI development
Starred by
+26
Created 7 months ago
Updated 3 weeks ago
PRarena
by
aavetis
0%
299
Monitoring AI coding agent pull request performance
Starred by
Created 1 year ago
Updated 1 day ago
chat_templates
by
chujiezheng
0%
718
Chat templates for HuggingFace LLMs
Starred by
Created 2 years ago
Updated 1 year ago
OLMoE.swift
by
allenai
0%
311
Swift app for local, offline AI experience
Starred by
Created 1 year ago
Updated 1 year ago
verdict
by
haizelabs
0.9%
341
Framework for LLM-as-a-judge systems, scaling evaluation
Starred by
Created 1 year ago
Updated 6 months ago
CodeIO
by
hkust-nlp
0%
569
Research paper enhancing LLMs' reasoning via code I/O prediction
Created 1 year ago
Updated 1 year ago
awesome-open-source-lms
by
allenai
0%
364
Curated list of open-source language models and resources
Starred by
Created 1 year ago
Updated 8 months ago
awesome-o1
by
srush
0.1%
1k
Bibliography for OpenAI's o1 project
Starred by
+3
Created 1 year ago
Updated 1 year ago
RLAIF-V
by
RLHF-V
0.4%
454
Framework for aligning MLLMs using open-source AI feedback
Created 2 years ago
Updated 1 year ago
OLMoE
by
allenai
0.1%
1k
Open MoE language model research paper
Starred by
Created 1 year ago
Updated 8 months ago
nomic
by
nomic-ai
0%
2k
Python client for massive unstructured data interaction
Starred by
Created 3 years ago
Updated 6 months ago
MAP-NEO
by
multimodal-art-projection
0.1%
986
Open-source LLM with pretraining data, pipeline, scripts, and alignment code
Starred by
Created 2 years ago
Updated 1 year ago
rlhf-book
by
natolambert
0.4%
2k
Pandoc template for generating technical books
Starred by
Created 2 years ago
Updated 2 days ago
RLHF-Reward-Modeling
by
RLHFlow
0%
2k
Recipes to train reward models for RLHF
Starred by
Created 2 years ago
Updated 1 year ago
OpenRLHF
by
OpenRLHF
0.3%
10k
RLHF framework for scalable training of large language models
Starred by
+9
Created 2 years ago
Updated 1 day ago
arena-hard-auto
by
lmarena
0.2%
1k
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 11 months ago
elevenlabs-python
by
elevenlabs
0.4%
3k
Python SDK for lifelike text-to-speech and voice AI
Starred by
Created 3 years ago
Updated 1 day ago
WildBench
by
allenai
0%
253
Benchmarking LLMs with challenging real-user tasks
Starred by
Created 2 years ago
Updated 1 year ago
smol-podcaster
by
FanaHOVA
0%
414
Podcast production agent
Starred by
+2
Created 2 years ago
Updated 6 months ago
yet-another-applied-llm-benchmark
by
carlini
0.3%
1k
LLM benchmark for evaluating models on previously asked programming questions
Starred by
+2
Created 2 years ago
Updated 1 year ago
DataDreamer
by
datadreamer-dev
0%
1k
Python library for synthetic data generation and training workflows
Starred by
+1
Created 3 years ago
Updated 1 year ago
alpaca_eval
by
tatsu-lab
0.1%
2k
Automatic evaluator for instruction-following language models
Starred by
+3
Created 3 years ago
Updated 9 months ago
SPIN
by
uclaml
0%
1k
Self-Play Fine-Tuning (SPIN) research paper implementation
Starred by
Created 2 years ago
Updated 2 years ago
cutlass
by
NVIDIA
0.5%
10k
CUDA C++ and Python DSLs for high-performance linear algebra
Starred by
+22
Created 8 years ago
Updated 1 day ago
llm-swarm
by
huggingface
0%
287
CLI tool to manage scalable open LLM inference endpoints in Slurm clusters
Starred by
Created 2 years ago
Updated 1 year ago
mergekit
by
arcee-ai
0.2%
7k
CLI tool for merging pretrained language models, combining strengths without retraining
Starred by
+15
Created 2 years ago
Updated 3 weeks ago
do-not-answer
by
Libr-AI
0.3%
327
Dataset for evaluating LLM safety mechanisms
Starred by
Created 2 years ago
Updated 1 year ago
reward-bench
by
allenai
0%
715
Reward model evaluation tool
Starred by
Created 2 years ago
Updated 3 months ago
OLMo
by
allenai
0.1%
7k
Open language model code for training, evaluation, and inference
Starred by
+4
Created 3 years ago
Updated 6 months ago
open-instruct
by
allenai
0.1%
4k
Training codebase for instruction-following language models
Starred by
+10
Created 3 years ago
Updated 1 day ago
distilabel
by
argilla-io
0.1%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 2 days ago
unified-io-2
by
allenai
0%
648
Unified-IO 2 code for training, inference, and demo
Starred by
Created 2 years ago
Updated 2 years ago
mamba
by
state-spaces
0.2%
18k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 2 weeks ago
FastChat
by
lm-sys
0.0%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 3 years ago
Updated 3 weeks ago
alignment-handbook
by
huggingface
0.1%
6k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 1 day ago
evaluate
by
huggingface
0.1%
2k
ML model evaluation library for standardized performance reporting
Starred by
+9
Created 4 years ago
Updated 1 day ago
chatarena
by
Farama-Foundation
0.1%
2k
Multi-agent environment for LLM research
Starred by
+1
Created 3 years ago
Updated 9 months ago
evals
by
openai
0.2%
19k
Framework for evaluating LLMs and LLM systems, plus benchmark registry
Starred by
+32
Created 3 years ago
Updated 1 month ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 3 years ago
Updated 2 years ago
large_language_model_training_playbook
by
huggingface
0%
501
Tips for training large language models
Starred by
+1
Created 3 years ago
Updated 3 years ago
MiniChain
by
srush
0%
1k
Tiny library for coding with large language models
Starred by
+7
Created 3 years ago
Updated 1 year ago
PaLM-rlhf-pytorch
by
lucidrains
0%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 7 months ago
trl
by
huggingface
0.3%
18k
Library for transformer RL
Starred by
+28
Created 6 years ago
Updated 23 hours ago
openrlbenchmark
by
openrlbenchmark
0.4%
264
RL experiment benchmarking and comparison
Starred by
Created 4 years ago
Updated 2 months ago
theseus
by
facebookresearch
0.1%
2k
Library for differentiable nonlinear optimization layers in PyTorch
Starred by
Created 4 years ago
Updated 1 year ago
rl
by
pytorch
0.3%
3k
PyTorch library for reinforcement learning research
Starred by
Created 4 years ago
Updated 1 day ago
diffusers
by
huggingface
0.1%
34k
PyTorch/Flax library for diffusion model research and applications
Starred by
+34
Created 4 years ago
Updated 1 day ago
trlx
by
CarperAI
0.0%
5k
Distributed RLHF for LLMs
Starred by
+16
Created 3 years ago
Updated 2 years ago
nn-zero-to-hero
by
karpathy
3.8%
23k
Educational resource for neural network development, from basics to advanced models
Starred by
+5
Created 3 years ago
Updated 1 year ago
makemore
by
karpathy
0.8%
4k
Character-level language model for generating text
Starred by
Created 4 years ago
Updated 2 years ago
Awesome-LLM-Robotics
by
GT-RIPL
0.1%
4k
Curated list of papers using LLMs/multimodal models for robotics/RL
Starred by
Created 3 years ago
Updated 1 month ago
cleanrl
by
vwxyzjn
0.6%
10k
RL algorithms implementation with research-friendly features
Starred by
+3
Created 7 years ago
Updated 1 month ago
aqueduct
by
RunLLM
0%
519
MLOps framework for cloud deployment of LLM/ML workloads
Starred by
+3
Created 4 years ago
Updated 3 years ago
transformers
by
huggingface
0.1%
161k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 23 hours ago
tianshou
by
thu-ml
0.2%
11k
PyTorch RL library for algorithm development and application
Starred by
+3
Created 8 years ago
Updated 1 month ago
TD3
by
sfujim
0.1%
2k
PyTorch implementation of TD3 for OpenAI gym tasks
Starred by
Created 8 years ago
Updated 2 years ago
PettingZoo
by
Farama-Foundation
0.1%
3k
Python library for multi-agent reinforcement learning environments
Starred by
Created 6 years ago
Updated 2 days ago
BIG-bench
by
google
0.1%
3k
Collaborative benchmark for probing and extrapolating LLM capabilities
Starred by
+11
Created 5 years ago
Updated 1 year ago
AirLearning
by
harvard-edge
0%
253
Reinforcement learning infrastructure for autonomous aerial robots
Created 7 years ago
Updated 4 years ago
rlpyt
by
astooke
0%
2k
PyTorch library for deep reinforcement learning research
Starred by
+7
Created 7 years ago
Updated 5 years ago
rlkit
by
rail-berkeley
0.1%
3k
RL algorithm collection implemented in PyTorch
Starred by
+2
Created 8 years ago
Updated 1 year ago
roboschool
by
openai
0%
2k
Deprecated robot simulation software integrated with OpenAI Gym
Starred by
+5
Created 9 years ago
Updated 3 years ago
baselines
by
openai
0.0%
17k
RL algorithm implementations for research
Starred by
+25
Created 9 years ago
Updated 1 year ago
Feedback? Help us improve.