Nathan Lambert

Research Scientist at AI2

Authored Projects (1)

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect).

rlhf-book by natolambert

Pandoc template for generating technical books

Compiles markdown content into PDF, EPUB, HTML, and DOCX outputs.
Features Pandoc-crossref for advanced cross-referencing of elements.
Automates build processes using Makefiles and allows content filtering.
Tailored for creating educational materials, like RLHF textbooks.

Created 1 year ago

Updated 2 days ago

Starred Projects (62)

marin by marin-community

Framework for reproducible foundation model research and development

lysandrejik:

john-b-yang:

percyliang:

froystig:

Created 1 year ago

Updated 9 hours ago

DeepGEMM by deepseek-ai

CUDA library for efficient FP8 GEMM kernels with fine-grained scaling

chiphuyen:

ekzhang:

patrickvonplaten:

parano:

Created 11 months ago

Updated 5 days ago

OpenEnv by meta-pytorch

Framework for agentic RL training environments

clmnt:

shizhediao:

mlejva:

hammer:

Created 3 months ago

Updated 2 days ago

nanochat by karpathy

A minimal, full-stack LLM implementation for accessible AI development

geohot:

ekzhang:

parano:

eugeneyan:

Created 3 months ago

Updated 2 days ago

PRarena by aavetis

Monitoring AI coding agent pull request performance

chiphuyen:

wsxiaoys:

Created 7 months ago

Updated 11 hours ago

chat_templates by chujiezheng

Chat templates for HuggingFace LLMs

shizhediao:

soldni:

winglian:

osanseviero:

Created 2 years ago

Updated 1 year ago

OLMoE.swift by allenai

Swift app for local, offline AI experience

ggerganov:

Created 1 year ago

Updated 9 months ago

verdict by haizelabs

Framework for LLM-as-a-judge systems, scaling evaluation

didierrlopes:

thomwolf:

willccbb:

hammer:

Created 1 year ago

Updated 2 months ago

CodeIO by hkust-nlp

Research paper enhancing LLMs' reasoning via code I/O prediction

Created 11 months ago

Updated 8 months ago

awesome-open-source-lms by allenai

Curated list of open-source language models and resources

hammer:

Created 1 year ago

Updated 3 months ago

awesome-o1 by srush

Bibliography for OpenAI's o1 project

binarybana:

tjbck:

JohannesHa:

shizhediao:

Created 1 year ago

Updated 1 year ago

RLAIF-V by RLHF-V

Framework for aligning MLLMs using open-source AI feedback

Created 1 year ago

Updated 8 months ago

OLMoE by allenai

Open MoE language model research paper

chiphuyen:

peakji:

Created 1 year ago

Updated 3 months ago

nomic by nomic-ai

Python client for massive unstructured data interaction

willingc:

transitive-bullshit:

gakonst:

chitalian:

Created 3 years ago

Updated 2 months ago

MAP-NEO by multimodal-art-projection

Open-source LLM with pretraining data, pipeline, scripts, and alignment code

hiyouga:

soldni:

Created 1 year ago

Updated 11 months ago

rlhf-book by natolambert

Pandoc template for generating technical books

chiphuyen:

vincentweisser:

Created 1 year ago

Updated 2 days ago

RLHF-Reward-Modeling by RLHFlow

Recipes to train reward models for RLHF

hiyouga:

osanseviero:

Created 1 year ago

Updated 8 months ago

OpenRLHF by OpenRLHF

RLHF framework for scalable training of large language models

beyang:

parano:

vincentweisser:

binarybana:

Created 2 years ago

Updated 3 days ago

arena-hard-auto by lmarena

Automatic LLM benchmark for instruction-tuned models, correlating with human preference

hiyouga:

pgarbacki:

mlabonne:

zhuohan123:

Created 2 years ago

Updated 6 months ago

smol-podcaster by FanaHOVA

Podcast production agent

kiwicopple:

schickling:

ishaan-jaff:

transitive-bullshit:

Created 2 years ago

Updated 2 months ago

yet-another-applied-llm-benchmark by carlini

LLM benchmark for evaluating models on previously asked programming questions

karpathy:

ezyang:

patrickvonplaten:

simonw:

Created 2 years ago

Updated 8 months ago

DataDreamer by datadreamer-dev

Python library for synthetic data generation and training workflows

soldni:

hammer:

omarsar:

JustinLin610:

Created 2 years ago

Updated 11 months ago

alpaca_eval by tatsu-lab

Automatic evaluator for instruction-following language models

philschmid:

shizhediao:

bhancock8:

lewtun:

Created 2 years ago

Updated 5 months ago

SPIN by uclaml

Self-Play Fine-Tuning (SPIN) research paper implementation

CodeCreator:

lewtun:

pgarbacki:

hiyouga:

Created 1 year ago

Updated 1 year ago

cutlass by NVIDIA

CUDA C++ and Python DSLs for high-performance linear algebra

tridao:

chiphuyen:

joker-eph:

mattjj:

Created 8 years ago

Updated 2 days ago

llm-swarm by huggingface

CLI tool to manage scalable open LLM inference endpoints in Slurm clusters

mlabonne:

thomwolf:

osanseviero:

lewtun:

Created 2 years ago

Updated 1 year ago

mergekit by arcee-ai

CLI tool for merging pretrained language models, combining strengths without retraining

shizhediao:

Ying1123:

transitive-bullshit:

thomwolf:

Created 2 years ago

Updated 1 week ago

do-not-answer by Libr-AI

Dataset for evaluating LLM safety mechanisms

winglian:

Created 2 years ago

Updated 1 year ago

reward-bench by allenai

Reward model evaluation tool

lewtun:

shizhediao:

soldni:

Created 2 years ago

Updated 7 months ago

OLMo by allenai

Open language model code for training, evaluation, and inference

tjbck:

winglian:

john-b-yang:

transitive-bullshit:

Created 2 years ago

Updated 1 month ago

open-instruct by allenai

Training codebase for instruction-following language models

hammer:

zhyncs:

vincentweisser:

RJT1990:

Created 2 years ago

Updated 20 hours ago

distilabel by argilla-io

Framework for synthetic data and AI feedback pipelines

lvwerra:

jn2clark:

hiyouga:

pgarbacki:

Created 2 years ago

Updated 2 weeks ago

unified-io-2 by allenai

Unified-IO 2 code for training, inference, and demo

Jiayi-Pan:

shizhediao:

jwyang:

teknium1:

Created 2 years ago

Updated 1 year ago

mamba by state-spaces

Mamba SSM architecture for sequence modeling

geohot:

alexchen4ai:

luiscape:

zhiyuan8:

Created 2 years ago

Updated 2 days ago

FastChat by lm-sys

Open platform for training, serving, and evaluating LLM-based chatbots

zjasper666:

aangelopoulos:

osanseviero:

lewtun:

Created 2 years ago

Updated 7 months ago

alignment-handbook by huggingface

Handbook for aligning language models with human/AI preferences

eugeneyan:

drishanarora:

philschmid:

vincentweisser:

Created 2 years ago

Updated 4 months ago

evaluate by huggingface

ML model evaluation library for standardized performance reporting

clmnt:

chiphuyen:

lvwerra:

hammer:

Created 3 years ago

Updated 1 month ago

chatarena by Farama-Foundation

Multi-agent environment for LLM research

hammer:

chiphuyen:

omarsar:

abidlabs:

Created 2 years ago

Updated 5 months ago

evals by openai

Framework for evaluating LLMs and LLM systems, plus benchmark registry

aangelopoulos:

chiphuyen:

agola11:

taranjeet:

Created 3 years ago

Updated 2 months ago

OpenChatKit by togethercomputer

Open-source toolkit for building specialized/general-purpose chat models

AntonOsika:

pgarbacki:

winglian:

casper-hansen:

Created 2 years ago

Updated 1 year ago

large_language_model_training_playbook by huggingface

Tips for training large language models

stas00:

osanseviero:

lysandrejik:

abhishekkrthakur:

Created 2 years ago

Updated 2 years ago

MiniChain by srush

Tiny library for coding with large language models

soldni:

hiyouga:

lysandrejik:

jph00:

Created 2 years ago

Updated 1 year ago

PaLM-rlhf-pytorch by lucidrains

RLHF implementation on PaLM

pgarbacki:

winglian:

osanseviero:

jquesnelle:

Created 3 years ago

Updated 3 months ago

trl by huggingface

Library for transformer RL

jeffchuber:

vincentweisser:

tjbck:

alexchen4ai:

Created 5 years ago

Updated 2 days ago

theseus by facebookresearch

Library for differentiable nonlinear optimization layers in PyTorch

srush:

Created 4 years ago

Updated 1 year ago

rl by pytorch

PyTorch library for reinforcement learning research

evhub:

Jiayi-Pan:

Created 4 years ago

Updated 9 hours ago

diffusers by huggingface

PyTorch/Flax library for diffusion model research and applications

karpathy:

clmnt:

vincentweisser:

hammer:

Created 3 years ago

Updated 22 hours ago

trlx by CarperAI

Distributed RLHF for LLMs

nat:

chiphuyen:

eugeneyan:

huybery:

Created 3 years ago

Updated 2 years ago

nn-zero-to-hero by karpathy

Educational resource for neural network development, from basics to advanced models

zhiyuan8:

didierrlopes:

omarsar:

ogabrielluiz:

Created 3 years ago

Updated 1 year ago

makemore by karpathy

Character-level language model for generating text

osanseviero:

deshraj:

Created 3 years ago

Updated 1 year ago

Awesome-LLM-Robotics by GT-RIPL

Curated list of papers using LLMs/multimodal models for robotics/RL

Jiayi-Pan:

vnivargi:

huybery:

Created 3 years ago

Updated 1 month ago

cleanrl by vwxyzjn

RL algorithms implementation with research-friendly features

sxyu:

john-b-yang:

lysandrejik:

infwinston:

Created 6 years ago

Updated 6 months ago

aqueduct by RunLLM

MLOps framework for cloud deployment of LLM/ML workloads

ShishirPatil:

hammer:

jheer:

spencerkimball:

Created 3 years ago

Updated 2 years ago

transformers by huggingface

ML library for pretrained model inference and training

clmnt:

lilianweng:

karpathy:

tjbck:

Created 7 years ago

Updated 1 day ago

tianshou by thu-ml

PyTorch RL library for algorithm development and application

chiphuyen:

pgarbacki:

shizhediao:

youkaichao:

Created 7 years ago

Updated 1 month ago

TD3 by sfujim

PyTorch implementation of TD3 for OpenAI gym tasks

lucidrains:

jachiam:

Created 7 years ago

Updated 2 years ago

PettingZoo by Farama-Foundation

Python library for multi-agent reinforcement learning environments

chiphuyen:

Created 6 years ago

Updated 1 month ago

BIG-bench by google

Collaborative benchmark for probing and extrapolating LLM capabilities

ShengjiaZhao:

chiphuyen:

hiyouga:

nirga:

Created 5 years ago

Updated 1 year ago

rlpyt by astooke

PyTorch library for deep reinforcement learning research

chenlin9:

suquark:

hammer:

jspahrsummers:

Created 6 years ago

Updated 5 years ago

rlkit by rail-berkeley

RL algorithm collection implemented in PyTorch

aravindsrinivas:

MishaLaskin:

millionintegrals:

jiamings:

Created 8 years ago

Updated 1 year ago

roboschool by openai

Deprecated robot simulation software integrated with OpenAI Gym

aravindsrinivas:

truell20:

bcherny:

vincentweisser:

Created 8 years ago

Updated 2 years ago

baselines by openai

RL algorithm implementations for research

aravindsrinivas:

lilianweng:

JohannesHa:

jspahrsummers:

Created 8 years ago

Updated 1 year ago

Feedback? Help us improve.