Yaowei Zheng

Author of LLaMA-Factory

Authored Projects (4)

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral),

Alex Chen

Alex Chen(Cofounder of Nexa AI),

Tony Lee

Tony Lee(Author of HELM; Research Engineer at Meta),

Lysandre Debut

Lysandre Debut(Chief Open-Source Officer at Hugging Face), and

25 more.

LLaMA-Factory by hiyouga

Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)

Fine-tune LLaMA, Mistral, Qwen, Gemma, etc., via CLI/Web UI.
Supports pre-training, SFT, reward modeling, PPO, DPO, KTO, ORPO.
Offers LoRA, QLoRA, GaLore, BAdam, FlashAttention-2, Unsloth, and more.
Enables multi-turn dialogue, tool use, image/video/audio understanding.

Created 2 years ago

Updated 1 day ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA) and

Alex Chen

Alex Chen(Cofounder of Nexa AI).

EasyR1 by hiyouga

RL training framework for multi-modality models

Supports Llama3, Qwen, DeepSeek-R1 language models; Qwen2-VL vision language models.
Implements GRPO, Reinforce++, ReMax, RLOO algorithms.
Enables padding-free training, checkpoint resuming, and Wandb/SwanLab/MLflow tracking.
Uses vLLM's SPMD mode for efficient, scalable training.

Created 9 months ago

Updated 3 days ago

ChatGLM-Efficient-Tuning by hiyouga

Fine-tuning tool for ChatGLM-6B

PEFT (LoRA, P-Tuning V2, Freeze) for efficient adaptation.
Supports full parameter fine-tuning, quantization (4/8-bit).
Includes RLHF training, reward modeling, and evaluation scripts.
Web UI, API, and CLI demos for interaction.

Created 2 years ago

Updated 2 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

FastEdit by hiyouga

Tool for fast edits to large language models

Injects fresh knowledge into LLMs using Rank-One Model Editing (ROME).
Supports models like LLaMA, Falcon, Baichuan, InternLM, and GPT-J.
Edits models in FP16, with reported times around ~10 seconds.

Created 2 years ago

Updated 2 years ago

Starred Projects (413)

GPTCache by zilliztech

Semantic cache for LLM queries, integrated with LangChain and LlamaIndex

chiphuyen:

hammer:

Ying1123:

ogabrielluiz:

Created 2 years ago

Updated 4 months ago

Acontext by memodb-io

Context data platform for self-learning AI agents

Created 4 months ago

Updated 1 day ago

memobase by memodb-io

Memory system for GenAI apps, enabling long-term user understanding

Created 1 year ago

Updated 1 week ago

toon by toon-format

Compact data format for LLMs

anurag:

transitive-bullshit:

hammer:

abhiaiyer91:

Created 1 month ago

Updated 1 day ago

flame by fla-org

Minimal, efficient framework for LLM training

winglian:

Created 10 months ago

Updated 2 weeks ago

PatentWriterAgent by ninehills

AI agent for automated patent drafting

Created 1 month ago

Updated 1 month ago

PokeeResearchOSS by Pokee-AI

Deep research agent for complex queries

Created 1 month ago

Updated 1 month ago

DeepResearch by Alibaba-NLP

Benchmark for LLMs in web traversal

hammer:

yiranwu0:

omarsar:

winglian:

Created 10 months ago

Updated 1 week ago

SkyRL by NovaSky-AI

RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks

lewtun:

WoosukKwon:

zhuohan123:

JohannesHa:

Created 7 months ago

Updated 4 days ago

gem by axon-rl

Agentic LLM training environment for interactive reinforcement learning

lewtun:

vincentweisser:

willccbb:

winglian:

Created 6 months ago

Updated 3 weeks ago

tunix by google

JAX-native library for efficient LLM post-training

ekzhang:

zhyncs:

ebursztein:

hammer:

Created 8 months ago

Updated 3 days ago

tilelang by tile-ai

DSL for high-performance GPU/CPU kernel development (GEMM, attention, etc.)

JustinLin610:

luiscape:

zhuohan123:

yang-song:

Created 1 year ago

Updated 22 hours ago

codex by openai

Coding agent CLI tool for terminal-based chat-driven development

ValentaTomas:

gakonst:

victortaelin:

gregpr07:

Created 7 months ago

Updated 1 day ago

atropos by NousResearch

RL environment framework for LLM trajectory collection/evaluation

gakonst:

thomwolf:

pgarbacki:

teknium1:

Created 7 months ago

Updated 4 days ago

checkpoint-engine by MoonshotAI

Middleware for efficient LLM weight updates during inference

pgarbacki:

zhuohan123:

luiscape:

zhyncs:

Created 2 months ago

Updated 6 days ago

LESS by princeton-nlp

Data selection research paper for targeted instruction tuning

winglian:

transitive-bullshit:

hammer:

Created 1 year ago

Updated 1 year ago

DataFlow by OpenDCAI

Data preparation and LLM training system

Created 1 year ago

Updated 3 days ago

trae-agent by bytedance

LLM-powered CLI for software engineering tasks

yiranwu0:

syrusakbary:

zhyncs:

transitive-bullshit:

Created 5 months ago

Updated 2 months ago

VeOmni by ByteDance-Seed

Framework for scaling multimodal model training across accelerators

zhyncs:

JustinLin610:

Created 8 months ago

Updated 1 day ago

llama.cpp by ggml-org

C/C++ library for local LLM inference

karpathy:

nat:

tobi:

julien-c:

Created 2 years ago

Updated 1 day ago

DFT by yongliang-wu

Improving SFT generalization with reward rectification

winglian:

Created 4 months ago

Updated 3 weeks ago

harmony by openai

Renderer for OpenAI's harmony response format

chiphuyen:

shyamal-anadkat:

t3dotgg:

ankane:

Created 4 months ago

Updated 3 weeks ago

gpt-oss-recipes by huggingface

OpenAI GPT-OSS model optimization and fine-tuning

eugeneyan:

willccbb:

lysandrejik:

lewtun:

Created 4 months ago

Updated 3 months ago

gpt-oss by openai

Open-weight LLMs for reasoning and agents

karpathy:

danielhanchen:

borzunov:

chiphuyen:

Created 5 months ago

Updated 1 month ago

ARPO by RUC-NLPIR

Agentic RL for LLM tool use

Created 4 months ago

Updated 2 weeks ago

gemini-cli by google-gemini

AI agent for terminal workflows

MagMueller:

lucidrains:

khou22:

koraykv:

Created 7 months ago

Updated 1 day ago

qwen-code by QwenLM

AI coding agent for complex codebases

JustinLin610:

infwinston:

chiphuyen:

victortaelin:

Created 5 months ago

Updated 1 day ago

higgs-audio by boson-ai

Expressive text-to-audio generation model

jiamings:

alexchen4ai:

Created 4 months ago

Updated 2 months ago

Show-o by showlab

Unified transformer research paper for multimodal tasks

Created 1 year ago

Updated 1 month ago

Kimi-K2 by MoonshotAI

State-of-the-art MoE language model

krisr:

lucidrains:

merrymercy:

shizhediao:

Created 5 months ago

Updated 3 weeks ago

Skywork-R1V by SkyworkAI

Multimodal model for advanced visual/text reasoning, using chain-of-thought

hammer:

Created 8 months ago

Updated 1 week ago

12-factor-agents by humanlayer

Principles for reliable LLM application development

didierrlopes:

omarsar:

shyamal-anadkat:

pgarbacki:

Created 8 months ago

Updated 2 months ago

GraphGen by open-sciencelab

Framework for LLM fine-tuning with knowledge-driven synthetic data

Created 10 months ago

Updated 4 days ago

GLM-V by zai-org

Multimodal reasoning model with a "thinking" paradigm

Created 5 months ago

Updated 1 month ago

POLARIS by ChenxinAn-fdu

Scaling RL for advanced reasoning models

Created 5 months ago

Updated 1 month ago

slime by THUDM

LLM post-training framework for RL scaling

jiamings:

hammer:

pgarbacki:

pcmoritz:

Created 5 months ago

Updated 1 day ago

python-sdk by modelcontextprotocol

Python SDK for Model Context Protocol (MCP) servers/clients

ValentaTomas:

simonw:

eugeneyan:

alexchen4ai:

Created 1 year ago

Updated 2 days ago

flash-linear-attention by fla-org

Efficient Torch/Triton implementations for linear attention models

xiezhq-hermann:

zhyncs:

yang-song:

winglian:

Created 1 year ago

Updated 1 day ago

reverse-engineering-gemma-3n by antimatter15

Reverse engineering Google's edge-optimized language model for local inference

Created 6 months ago

Updated 6 months ago

gemini-fullstack-langgraph-quickstart by google-gemini

Full-stack agent quickstart

logankilpatrick:

amin3141:

omarsar:

casper-hansen:

Created 6 months ago

Updated 2 weeks ago

DeepEyes by Visual-Agent

Agentic RL training framework

Created 8 months ago

Updated 1 week ago

fastmcp by jlowin

Pythonic SDK for building Model Context Protocol (MCP) servers/clients

rasbt:

didierrlopes:

shyamal-anadkat:

chiphuyen:

Created 1 year ago

Updated 4 days ago

langfun by google

Library for object-oriented LLM prompting

ogabrielluiz:

Created 2 years ago

Updated 1 week ago

GPTQModel by ModelCloud

LLM compression toolkit for accelerated CPU/GPU inference

Ying1123:

Created 1 year ago

Updated 2 days ago

WeClone by xming521

Digital twin one-stop solution

chiphuyen:

Created 1 year ago

Updated 3 weeks ago

arena-hard-auto by lmarena

Automatic LLM benchmark for instruction-tuned models, correlating with human preference

pgarbacki:

mlabonne:

zhuohan123:

merrymercy:

Created 2 years ago

Updated 5 months ago

vllm-ascend by vllm-project

Hardware plugin for vLLM on Ascend NPU

Created 10 months ago

Updated 1 day ago

deer-flow by bytedance

Deep research framework combining language models with specialized tools

apsdehal:

yiranwu0:

ekzhu:

omarsar:

Created 6 months ago

Updated 1 day ago

aci by aipotheosis-labs

Open-source infra for AI-agent tool use

robinjhuang:

transitive-bullshit:

torantulino:

Created 1 year ago

Updated 2 months ago

FramePack by lllyasviel

Desktop software for video generation via next-frame prediction

dror-weiss:

JustinLin610:

Created 7 months ago

Updated 1 month ago

Seed-Thinking-v1.5 by ByteDance-Seed

Reasoning model for STEM, coding, and general tasks

shizhediao:

Created 7 months ago

Updated 5 months ago

PipelineRL by ServiceNow

Scalable RL for training LLM agents

lewtun:

Created 7 months ago

Updated 1 day ago

AWorld by inclusionAI

Multi-agent runtime for self-improvement

Created 8 months ago

Updated 2 days ago

Tina by shangshang-wang

LoRA reasoning models

Created 7 months ago

Updated 2 months ago

markitdown by microsoft

Python tool for converting files to Markdown for LLM text analysis

chiphuyen:

omarsar:

handotdev:

gregpr07:

Created 1 year ago

Updated 6 days ago

MagiAttention by SandAI-org

Distributed attention mechanism research paper for ultra-long context, heterogeneous data training

pgarbacki:

Created 7 months ago

Updated 2 days ago

cooragent by LeapLabTHU

AI agent collaboration community for building agents and workflows

Created 7 months ago

Updated 2 months ago

llm-foundry by mosaicml

LLM training code for Databricks foundation models

vincentweisser:

hammer:

Sanger2000:

john-b-yang:

Created 2 years ago

Updated 1 month ago

Kimi-VL by MoonshotAI

Vision-language model for multimodal reasoning and agent tasks

Created 7 months ago

Updated 4 months ago

prismatic-vlms by TRI-ML

VLM codebase for training visually-conditioned language models

Jiayi-Pan:

hammer:

Created 1 year ago

Updated 1 year ago

UI-TARS-desktop by bytedance

GUI agent app for computer control via natural language

hugs:

ekzhu:

evhub:

gregpr07:

Created 10 months ago

Updated 3 days ago

DAPO by BytedTsinghua-SIA

Open-source RL system for large-scale LLM training

pgarbacki:

Created 8 months ago

Updated 6 months ago

ml-cross-entropy by apple

PyTorch module for memory-efficient cross-entropy in LLMs

patrickvonplaten:

pgarbacki:

divchenko:

shimmyshimmer:

Created 1 year ago

Updated 2 months ago

Light-R1 by Qihoo360

Math model research paper using curriculum SFT, DPO, and RL

Created 9 months ago

Updated 2 months ago

easy-dataset by ConardLi

Dataset tool for LLM fine-tuning

Created 9 months ago

Updated 1 day ago

GamingAgent by lmgame-org

SDK for LLM/VLM gaming agents, enabling model evaluation via games

winglian:

Created 9 months ago

Updated 2 weeks ago

Gymnasium by Farama-Foundation

Python API standard for single-agent reinforcement learning environments

vincentweisser:

thomwolf:

chiphuyen:

ValentaTomas:

Created 3 years ago

Updated 1 day ago

Awesome-LLM-Post-training by mbzuai-oryx

Curated list of LLM post-training resources

Created 9 months ago

Updated 1 month ago

Search-R1 by PeterGriffinJin

RL framework for training LLMs to use search engines

jxnl:

omarsar:

vincentweisser:

pgarbacki:

Created 9 months ago

Updated 2 weeks ago

Wan2.1 by Wan-Video

Video foundation model for text-to-video, image-to-video, and video editing

luiscape:

shizhediao:

jiamings:

jn2clark:

Created 9 months ago

Updated 4 months ago

FlashMLA by deepseek-ai

Efficient CUDA kernels for MLA decoding

hammer:

jmorganca:

shizhediao:

suquark:

Created 9 months ago

Updated 2 months ago

VLM-R1 by om-ai-lab

VLM for visual understanding via reinforced VLMs

Created 9 months ago

Updated 1 month ago

Open-Reasoner-Zero by Open-Reasoner-Zero

Open-source RL training for scalable reasoning on base models

Created 9 months ago

Updated 6 months ago

open-infra-index by deepseek-ai

AI infrastructure tools for efficient AGI development

lattner:

vincentweisser:

hammer:

pankajroark:

Created 9 months ago

Updated 6 months ago

Awesome-ML-SYS-Tutorial by zhaochenyang20

ML SYS learning notes and code

lilianweng:

yiranwu0:

shizhediao:

merrymercy:

Created 1 year ago

Updated 5 days ago

demystify-long-cot by eddycmu

Research code for long chain-of-thought reasoning in LLMs

hammer:

willccbb:

Created 10 months ago

Updated 6 months ago

rllm by rllm-org

Framework for post-training language agents via reinforcement learning

yiranwu0:

WoosukKwon:

pgarbacki:

vincentweisser:

Created 10 months ago

Updated 4 days ago

Logic-RL by Unakar

LLM reasoning via rule-based reinforcement learning, research paper

Created 10 months ago

Updated 8 months ago

s1 by simplescaling

Test-time scaling recipe for strong reasoning performance

lewtun:

shizhediao:

taranjeet:

chiphuyen:

Created 10 months ago

Updated 5 months ago

open-thoughts by open-thoughts

Open dataset for training reasoning models

hammer:

vincentweisser:

shizhediao:

JohannesHa:

Created 10 months ago

Updated 2 months ago

oumi by oumi-ai

Open-source platform for end-to-end foundation model lifecycle

hammer:

transitive-bullshit:

omarsar:

thomwolf:

Created 1 year ago

Updated 1 day ago

curator by bespokelabsai

Synthetic data curation tool for post-training and structured data extraction

RJT1990:

winglian:

jmorganca:

mchiang0610:

Created 1 year ago

Updated 4 months ago

TinyZero by Jiayi-Pan

Minimal reproduction of DeepSeek R1 Zero for countdown/multiplication tasks

geohot:

karpathy:

vincentweisser:

chiphuyen:

Created 10 months ago

Updated 7 months ago

DeepSeek-R1 by deepseek-ai

Reasoning models research paper

shimmyshimmer:

rasbt:

transitive-bullshit:

omarsar:

Created 10 months ago

Updated 5 months ago

simpleRL-reason by hkust-nlp

RL recipe for reasoning ability in models

vincentweisser:

shizhediao:

lewtun:

osanseviero:

Created 10 months ago

Updated 4 months ago

open-r1 by huggingface

SDK for reproducing DeepSeek-R1

chiphuyen:

hammer:

vincentweisser:

shizhediao:

Created 10 months ago

Updated 6 days ago

Math-Verify by huggingface

Math evaluator for LLM outputs in mathematical tasks

zhyncs:

shizhediao:

thomwolf:

lewtun:

Created 10 months ago

Updated 5 months ago

SkyThought by NovaSky-AI

Training recipes for Sky-T1 family of models

luiscape:

chiphuyen:

ashtom:

sxyu:

Created 10 months ago

Updated 4 months ago

GUI-Agents-Paper-List by OSU-NLP-Group

Paper list for GUI agents

huybery:

Created 1 year ago

Updated 1 month ago

UI-TARS by bytedance

Multimodal agent for GUI interaction in virtual worlds (research paper)

jiamings:

hugs:

jmorganca:

zhyncs:

Created 10 months ago

Updated 2 weeks ago

rStar by microsoft

Research paper repo for math reasoning in small LLMs via deep thinking

winglian:

tgaddair:

Created 1 year ago

Updated 2 months ago

FastVideo by hao-ai-lab

Framework for accelerated video generation

zhyncs:

luiscape:

Created 1 year ago

Updated 2 days ago

llm.c by karpathy

LLM training in pure C/CUDA, no PyTorch needed

norvig:

alexey-milovidov:

didierrlopes:

shizhediao:

Created 1 year ago

Updated 5 months ago

modded-nanogpt by KellerJordan

Language model training speedrun on 8x H100 GPUs

geohot:

karpathy:

zjasper666:

agajews:

Created 1 year ago

Updated 1 week ago

coconut by facebookresearch

Research paper implementation for LLM reasoning in latent space

winglian:

teknium1:

alexchen4ai:

Created 10 months ago

Updated 3 months ago

UFO by microsoft

Desktop AgentOS for automating Windows workflows via natural language

transitive-bullshit:

chiphuyen:

Created 1 year ago

Updated 2 weeks ago

audiocraft by facebookresearch

PyTorch library for audio processing and generation research

Jiayi-Pan:

jn2clark:

calvinfo:

jrk:

Created 2 years ago

Updated 8 months ago

Kiln by Kiln-AI

AI prototyping and dataset collaboration tool

chiphuyen:

danielhanchen:

Created 1 year ago

Updated 2 days ago

MiniMax-01 by MiniMax-AI

Large language & vision-language models based on linear attention

sxyu:

transitive-bullshit:

omarsar:

zhyncs:

Created 10 months ago

Updated 4 months ago

ReaLHF by openpsi-project

Efficient RLHF training system for LLMs using parameter reallocation

Created 1 year ago

Updated 7 months ago

grade-school-math by openai

Dataset for grade school math word problems

infwinston:

omarsar:

Edward-Sun:

Created 4 years ago

Updated 1 year ago

browser-use by browser-use

SDK for AI agent browser control

gregpr07:

shimmyshimmer:

khou22:

ekzhu:

Created 1 year ago

Updated 1 day ago

math-evaluation-harness by ZubinGou

Benchmarking toolkit for LLM mathematical reasoning

JustinLin610:

lewtun:

Created 1 year ago

Updated 1 year ago

PRIME by PRIME-RL

Scalable RL solution for advanced reasoning of language models

vincentweisser:

lewtun:

omarsar:

philschmid:

Created 11 months ago

Updated 8 months ago

Qwen2.5-Math by QwenLM

Math LLM for solving math problems in Chinese and English

omarsar:

JustinLin610:

huybery:

Created 1 year ago

Updated 10 months ago

libai by Oneflow-Inc

Large-scale distributed parallel training toolbox

omarsar:

shizhediao:

Created 4 years ago

Updated 4 months ago

OS-Agent-Survey by OS-Agent-Survey

Survey paper on OS Agents using MLLMs for computer, phone, and browser automation

Created 11 months ago

Updated 3 months ago

DeepSeek-V3 by deepseek-ai

MoE language model research paper with 671B total parameters

tobi:

shimmyshimmer:

jiamings:

syrusakbary:

Created 11 months ago

Updated 3 months ago

prm800k by openai

Dataset of LLM solutions to math problems with step-level correctness labels

Jiayi-Pan:

jph00:

pgarbacki:

transitive-bullshit:

Created 2 years ago

Updated 2 years ago

SwanLab by SwanHubX

AI training tracking and visualization tool

Created 2 years ago

Updated 1 week ago

Hermes-Function-Calling by NousResearch

Function-calling code for LLMs, demoing financial queries

ebursztein:

vincentweisser:

doriandarko:

teknium1:

Created 1 year ago

Updated 1 year ago

ZhiLight by zhihu

LLM inference engine for Llama and variants, optimized for PCIe GPUs

zhyncs:

Created 11 months ago

Updated 4 months ago

Infini-Megrez by infinigence

AI model for edge-side intelligence, optimized for speed

Created 1 year ago

Updated 1 month ago

stable-dreamfusion by ashawkey

Text-to-3D model using NeRF and diffusion

chiphuyen:

patrickvonplaten:

forresti:

torantulino:

Created 3 years ago

Updated 2 years ago

APOLLO by zhuhanqing

Memory-efficient optimizer for LLM training

Created 11 months ago

Updated 2 days ago

one-api by songquanpeng

LLM API management/redistribution system for OpenAI, Gemini, Claude, etc

thinkall:

chiphuyen:

geekan:

Created 2 years ago

Updated 1 week ago

smol-course by huggingface

Practical course for aligning small language models

hammer:

zhiyuan8:

didierrlopes:

osanseviero:

Created 1 year ago

Updated 2 weeks ago

smollm by huggingface

Lightweight AI models for text and vision tasks

lvwerra:

lewtun:

hammer:

osanseviero:

Created 1 year ago

Updated 1 week ago

EasyRAG by BUAADreamer

RAG framework for network automation, CCF AIOps challenge solution

Created 1 year ago

Updated 1 year ago

Qwen3-Coder by QwenLM

Code LLM for code completion, generation, and assistant use cases

hugs:

victortaelin:

luiscape:

cournape:

Created 1 year ago

Updated 4 months ago

verl by volcengine

RL training library for LLMs

WoosukKwon:

hammer:

yiranwu0:

luiscape:

Created 1 year ago

Updated 1 day ago

openr by openreasoner

Open-source framework for advanced LLM reasoning

vincentweisser:

shizhediao:

Created 1 year ago

Updated 10 months ago

inference by xorbitsai

Model serving library for language, speech, and multimodal models

JustinLin610:

transitive-bullshit:

ggerganov:

Created 2 years ago

Updated 1 day ago

CUDATutorial by PaddleJitLab

CUDA tutorial for high-performance programming

Created 3 years ago

Updated 5 months ago

megablocks by databricks

Lightweight library for mixture-of-experts (MoE) training

mateiz:

jfrankle:

CodeCreator:

JohannesHa:

Created 2 years ago

Updated 5 months ago

bc-omni by westlake-baichuan-mllm

Open-source research paper for multimodal LLM

Created 1 year ago

Updated 10 months ago

O1-Journey by GAIR-NLP

Research paper on replicating O1 via "journey learning"

omarsar:

shizhediao:

divchenko:

huybery:

Created 1 year ago

Updated 10 months ago

Open-O1 by Open-Source-O1

AI model for matching OpenAI O1 capabilities with open-source alternatives

omarsar:

JustinLin610:

Created 1 year ago

Updated 1 year ago

AutoIF by QwenLM

Research paper for improving LLM instruction-following via self-play with execution feedback

winglian:

Created 1 year ago

Updated 1 year ago

onediff by siliconflow

Acceleration library for diffusion models

luiscape:

Created 3 years ago

Updated 6 months ago

ao by pytorch

PyTorch library for quantization and sparsity in training/inference

danielhanchen:

shimmyshimmer:

parano:

willccbb:

Created 2 years ago

Updated 23 hours ago

auto-round by intel

Quantization algorithm for LLMs and VLMs

winglian:

JustinLin610:

Created 1 year ago

Updated 2 days ago

mini-omni by gpt-omni

Open-source multimodal LLM for real-time speech interaction

osanseviero:

Created 1 year ago

Updated 1 year ago

VILA by NVlabs

Open-source VLMs for efficient video/multi-image understanding

ogabrielluiz:

shizhediao:

sxyu:

pgarbacki:

Created 1 year ago

Updated 3 days ago

Qwen3-VL by QwenLM

Multimodal LLM for vision-language tasks, document parsing, and agent functionality

wangshangsam:

transitive-bullshit:

gregpr07:

MagMueller:

Created 1 year ago

Updated 3 days ago

long-context-attention by feifeibear

Unified sequence parallel attention for long context LLM training/inference

vincentweisser:

winglian:

jiamings:

Created 1 year ago

Updated 1 month ago

Liger-Kernel by linkedin

Triton kernels for efficient LLM training

karpathy:

pgarbacki:

Jiayi-Pan:

WoosukKwon:

Created 1 year ago

Updated 2 days ago

cambrian by cambrian-mllm

Multimodal LLM research paper with vision-centric design

hammer:

infwinston:

rwightman:

rstojnic:

Created 1 year ago

Updated 3 weeks ago

MAP-NEO by multimodal-art-projection

Open-source LLM with pretraining data, pipeline, scripts, and alignment code

natolambert:

soldni:

Created 1 year ago

Updated 9 months ago

MobileLLM by facebookresearch

Sub-billion parameter LLM training code for on-device use

shizhediao:

luiscape:

omarsar:

winglian:

Created 1 year ago

Updated 7 months ago

m2 by HazyResearch

Sub-quadratic architecture research paper

pgarbacki:

jn2clark:

osanseviero:

albertfgu:

Created 2 years ago

Updated 11 months ago

LLM-workshop-2024 by rasbt

Coding workshop for understanding LLM implementation and usage

Created 1 year ago

Updated 10 months ago

Mooncake by kvcache-ai

Research paper on a disaggregated architecture for LLM serving

jiamings:

luiscape:

merrymercy:

WoosukKwon:

Created 1 year ago

Updated 2 days ago

cookbook by mistralai

Cookbook with examples using Mistral models

patrickvonplaten:

Created 1 year ago

Updated 1 week ago

DoRA by NVlabs

PyTorch code for weight-decomposed low-rank adaptation (DoRA)

chiphuyen:

Ying1123:

Created 1 year ago

Updated 1 year ago

LLM101n by karpathy

Educational resource for building a Storyteller AI LLM

vincentweisser:

robinjhuang:

jaredpalmer:

dsa:

Created 1 year ago

Updated 1 year ago

magpie by magpie-align

Synthetic data pipeline for LLM alignment (ICLR 2025 paper)

mlabonne:

Created 1 year ago

Updated 8 months ago

OpenRLHF by OpenRLHF

RLHF framework for scalable training of large language models

beyang:

parano:

vincentweisser:

binarybana:

Created 2 years ago

Updated 3 weeks ago

Index-1.9B by bilibili

Multilingual LLM for chat, translation, and role-playing

Created 1 year ago

Updated 3 months ago

LanguageBind by PKU-YuanGroup

Multimodal pretraining research paper using language-based semantic alignment

jn2clark:

Created 2 years ago

Updated 1 year ago

EasyContext by jzhang38

Recipes for language model context length extrapolation to 1M tokens

jiamings:

MishaLaskin:

pgarbacki:

mlabonne:

Created 1 year ago

Updated 1 year ago

MixEval by JinjieNi

Dynamic LLM evaluation suite for accurate, cost-effective benchmarking

mlabonne:

winglian:

Created 1 year ago

Updated 1 year ago

GLM-4 by zai-org

Open multilingual multimodal chat LMs for dialogue, reasoning, and rumination

Created 1 year ago

Updated 5 months ago

ChatTTS by 2noise

Generative speech model for daily dialogue

osanseviero:

thinkall:

peakji:

hugs:

Created 1 year ago

Updated 3 days ago

LangGPT by langgptai

Structured prompting framework for LLM prompt engineering

Created 2 years ago

Updated 1 day ago

MiniCPM-V by OpenBMB

MLLM for vision, speech, and multimodal live streaming on your phone

hammer:

luiscape:

chiphuyen:

ebursztein:

Created 1 year ago

Updated 2 months ago

RLHF-Reward-Modeling by RLHFlow

Recipes to train reward models for RLHF

osanseviero:

natolambert:

Created 1 year ago

Updated 7 months ago

HALOs by ContextualAI

Library for aligning LLMs using human-aware loss functions

winglian:

stas00:

Created 2 years ago

Updated 2 months ago

Yi-1.5 by 01-ai

Yi-1.5: upgraded open-source language model series

abidlabs:

Created 1 year ago

Updated 1 year ago

distilabel by argilla-io

Framework for synthetic data and AI feedback pipelines

lvwerra:

jn2clark:

pgarbacki:

jph00:

Created 2 years ago

Updated 6 days ago

ollama by ollama

CLI tool for running LLMs locally

tobi:

jmorganca:

domoritz:

ekzhu:

Created 2 years ago

Updated 1 day ago

InternVL by OpenGVLab

Open-source MLLM alternative to GPT-4o

hammer:

Created 2 years ago

Updated 2 months ago

MetaMath by meta-math

Math question generation for LLM training and evaluation

Created 2 years ago

Updated 1 year ago

GPTS-Prompt-Collection by B3o

Prompt collection for GPTS Store

Created 1 year ago

Updated 1 month ago

torchtitan by pytorch

PyTorch platform for generative AI model training research

karpathy:

pgarbacki:

lewtun:

zhuohan123:

Created 1 year ago

Updated 1 day ago

Llama3-Chinese-Chat by Shenzhi-Wang

Chinese chat model fine-tuned from Llama3-8B-Instruct

Created 1 year ago

Updated 1 year ago

llama3-chinese by seanzhang-zhichen

Large language model for Chinese language tasks

Created 1 year ago

Updated 1 year ago

InfiniTransformer by Beomi

PyTorch implementation of Infini-attention for efficient, infinite context Transformers

Created 1 year ago

Updated 1 year ago

llama3 by meta-llama

*Deprecated* minimal example for loading and running Llama 3 models

tobi:

mckaywrigley:

osanseviero:

simonw:

Created 1 year ago

Updated 10 months ago

LLMTest_NeedleInAHaystack by gkamradt

LLM testing tool for evaluating in-context retrieval accuracy

winglian:

huybery:

omarsar:

Edward-Sun:

Created 2 years ago

Updated 1 year ago

BAdam by Ledzy

Memory-efficient optimizer for large language model finetuning

winglian:

Created 1 year ago

Updated 8 months ago

ragas by vibrantlabsai

Toolkit for LLM application evaluation

gregpr07:

alexchen4ai:

nirga:

simonw:

Created 2 years ago

Updated 2 days ago

pyreft by stanfordnlp

Python library for representation finetuning (ReFT) of language models

jph00:

hammer:

winglian:

Created 1 year ago

Updated 9 months ago

ragflow by infiniflow

Open-source RAG engine for deep document understanding

tobi:

rodrigosnader:

dguido:

transitive-bullshit:

Created 2 years ago

Updated 2 days ago

orpo by xfactlab

Preference optimization without a reference model

lewtun:

winglian:

Created 1 year ago

Updated 1 year ago

hqq by dropbox

Model quantizer for fast, accurate post-training quantization, skipping calibration

zhyncs:

danielhanchen:

winglian:

osanseviero:

Created 2 years ago

Updated 1 month ago

ray by ray-project

AI compute engine for scaling Python and AI applications

beyang:

hsbt:

gregpr07:

eddyxu:

Created 9 years ago

Updated 1 day ago

torchtune by meta-pytorch

PyTorch library for LLM post-training and experimentation

zhyncs:

lewtun:

patrickvonplaten:

JustinLin610:

Created 2 years ago

Updated 6 days ago

veScale by volcengine

PyTorch-native framework for LLM training

JustinLin610:

casper-hansen:

stas00:

xiezhq-hermann:

Created 1 year ago

Updated 4 days ago

grok-1 by xai-org

JAX example code for loading and running Grok-1 open-weights model

geohot:

yiranwu0:

omarsar:

handotdev:

Created 1 year ago

Updated 1 year ago

LLM-Training-Puzzles by srush

Hands-on puzzles for large language model training

Jiayi-Pan:

albertfgu:

willccbb:

patrickvonplaten:

Created 2 years ago

Updated 1 year ago

Awesome-Efficient-LLM by horseee

Curated list for efficient LLMs

Created 2 years ago

Updated 5 months ago

fsdp_qlora by AnswerDotAI

Training script for LLMs using QLoRA + FSDP

hammer:

calvinfo:

chiphuyen:

mlabonne:

Created 1 year ago

Updated 1 year ago

streaming by mosaicml

Data streaming library for efficient neural network training

youkaichao:

jn2clark:

johnmullan:

tachim:

Created 3 years ago

Updated 1 month ago

GaLore by jiaweizzhao

Memory-efficient training for large language models via gradient low-rank projection

vincentweisser:

danielhanchen:

Created 1 year ago

Updated 1 year ago

openvino by openvinotoolkit

Open source toolkit for optimizing and deploying AI inference

mfuntowicz:

Created 7 years ago

Updated 1 day ago

SakuraLLM by SakuraLLM

Japanese-to-Chinese translation model for light novels/Galgame

Created 2 years ago

Updated 9 months ago

AQLM by Vahe1994

PyTorch code for LLM compression via Additive Quantization (AQLM)

lysandrejik:

mlabonne:

chiphuyen:

casper-hansen:

Created 1 year ago

Updated 3 months ago

llm-awq by mit-han-lab

Weight quantization research paper for LLM compression/acceleration

chiphuyen:

jph00:

lysandrejik:

hammer:

Created 2 years ago

Updated 4 months ago

relora by Guitaricet

PEFT pretraining code for ReLoRA research paper

winglian:

codekansas:

Created 2 years ago

Updated 1 year ago

gptq by IST-DASLab

Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers

chiphuyen:

gakonst:

hammer:

codekansas:

Created 3 years ago

Updated 1 year ago

Long-Context-Data-Engineering by FranxYao

Research paper implementation for long-context data engineering

hammer:

Created 1 year ago

Updated 1 year ago

minbpe by karpathy

Minimal BPE encoder/decoder for LLM tokenization

hammer:

willingc:

chiphuyen:

shizhediao:

Created 1 year ago

Updated 1 year ago

code-act by xingyaoww

Research paper on executable code actions for LLM agents

transitive-bullshit:

vincentweisser:

hammer:

chiphuyen:

Created 1 year ago

Updated 1 year ago

SPIN by uclaml

Self-Play Fine-Tuning (SPIN) research paper implementation

CodeCreator:

lewtun:

pgarbacki:

natolambert:

Created 1 year ago

Updated 1 year ago

Qwen3 by QwenLM

Large language model series by Qwen team, Alibaba Cloud

ggerganov:

sxyu:

transitive-bullshit:

omarsar:

Created 1 year ago

Updated 1 month ago

Yi by 01-ai

Open-source bilingual LLMs trained from scratch

chiphuyen:

simonw:

yiranwu0:

pgarbacki:

Created 2 years ago

Updated 1 year ago

Machine-Mindset by PKU-YuanGroup

Research paper exploring LLMs through the lens of MBTI personality types

Created 2 years ago

Updated 1 year ago

sglang by sgl-project

Fast serving framework for LLMs and vision language models

shimmyshimmer:

beyang:

samlambert:

ebursztein:

Created 1 year ago

Updated 21 hours ago

functionary by MeetKai

Chat language model for tool use and result interpretation

ogabrielluiz:

parano:

jph00:

winglian:

Created 2 years ago

Updated 2 weeks ago

datatrove by huggingface

Data processing library for large-scale text data

lewtun:

shizhediao:

apsdehal:

mlabonne:

Created 2 years ago

Updated 5 days ago

nanotron by huggingface

Minimalistic library for large language model pretraining

clmnt:

shizhediao:

vincentweisser:

luiscape:

Created 2 years ago

Updated 1 week ago

infinity by michaelfeil

REST API for high-throughput, low-latency embedding and reranking

luiscape:

ishaan-jaff:

ankane:

zhyncs:

Created 2 years ago

Updated 6 days ago

DeepSeek-MoE by deepseek-ai

MoE language model for research purposes

omarsar:

mlabonne:

huybery:

Created 1 year ago

Updated 1 year ago

RAG-Survey by Tongji-KGLLM

RAG survey and knowledge base

omarsar:

deshraj:

Created 1 year ago

Updated 1 year ago

QAnything by netease-youdao

Anything Q&A system for local knowledge bases, supporting diverse file formats

chiphuyen:

Created 1 year ago

Updated 8 months ago

helm by stanford-crfm

Open-source Python framework for holistic evaluation of foundation models

teetone:

chiphuyen:

pgarbacki:

lantiga:

Created 4 years ago

Updated 1 week ago

neurips_llm_efficiency_challenge by llm-efficiency-challenge

Competition toolkit for efficient LLM inference on a single GPU

jph00:

rasbt:

artidoro:

Created 2 years ago

Updated 2 years ago

TensorRT-LLM by NVIDIA

LLM inference optimization SDK for NVIDIA GPUs

beyang:

hammer:

zhyncs:

shizhediao:

Created 2 years ago

Updated 22 hours ago

deita by hkust-nlp

Data-efficient instruction tuning for LLM alignment (ICLR 2024)

shizhediao:

winglian:

Created 2 years ago

Updated 11 months ago

ATLAS by VILA-Lab

Instruction benchmark for effective LLM queries and prompts

omarsar:

osanseviero:

Created 2 years ago

Updated 1 year ago

llama-moe by pjlab-sys4nlp

MoE model from LLaMA with continual pre-training

casper-hansen:

osanseviero:

Created 2 years ago

Updated 1 year ago

long-llms-learning by Strivin0311

Literature repository for long-context LLM methodologies

winglian:

Created 2 years ago

Updated 1 year ago

quip-sharp by Cornell-RelaxML

LLM quantization for extreme compression

mlabonne:

Created 2 years ago

Updated 1 year ago

DeepSpeed-MII by deepspeedai

Python library for high-throughput, low-latency, and cost-effective model inference

Ying1123:

merrymercy:

Sanger2000:

casper-hansen:

Created 3 years ago

Updated 5 months ago

H2O by FMInference

KV cache eviction research paper for efficient LLM inference

pgarbacki:

Ying1123:

Created 2 years ago

Updated 1 year ago

chroma by chroma-core

Open-source embedding database for building LLM apps with memory

tobi:

hugs:

kiwicopple:

azayarni:

Created 3 years ago

Updated 2 days ago

Qwen-Agent by QwenLM

Agent framework for LLM application development

ekzhu:

jph00:

ogabrielluiz:

thomwolf:

Created 2 years ago

Updated 2 months ago

llm-inference-benchmark by ninehills

LLM inference benchmark for comparing frameworks

Created 1 year ago

Updated 1 year ago

tensor_parallel by BlackSamorez

PyTorch module for multi-GPU model parallelism

winglian:

apsdehal:

borzunov:

Created 3 years ago

Updated 1 year ago

Data-Copilot by zwq2018

LLM-based system for autonomous data workflows

Created 2 years ago

Updated 1 year ago

URIAL by Re-Align

ICL method for LLM alignment, no tuning required

Created 2 years ago

Updated 1 year ago

mamba by state-spaces

Mamba SSM architecture for sequence modeling

geohot:

alexchen4ai:

luiscape:

zhiyuan8:

Created 2 years ago

Updated 2 weeks ago

clip-interrogator by pharmapsychotic

Image-to-prompt tool for text-to-image models

jn2clark:

chuanli11:

osanseviero:

Edward-Sun:

Created 3 years ago

Updated 1 year ago

unicom by deepglint

Visual representation model for multimodal LLMs

Created 2 years ago

Updated 2 months ago

unsloth by unslothai

Finetuning tool for LLMs, targeting speed and memory efficiency

tobi:

karpathy:

alexchen4ai:

danielhanchen:

Created 2 years ago

Updated 23 hours ago

gpt-fast by meta-pytorch

PyTorch text generation for efficient transformer inference

karpathy:

antiagainst:

jamesr66a:

merrymercy:

Created 2 years ago

Updated 3 months ago

gpt_paper_assistant by tatsu-lab

ArXiv scanner using GPT-4 for personalized paper recommendations

rodrigosnader:

Edward-Sun:

Ying1123:

soldni:

Created 2 years ago

Updated 1 year ago

DeepSeek-LLM by deepseek-ai

Large language model for research/commercial use

vincentweisser:

Created 2 years ago

Updated 1 year ago

llm-course by mlabonne

LLM course with roadmaps and notebooks

shizhediao:

shimmyshimmer:

willccbb:

zhiyuan8:

Created 2 years ago

Updated 5 months ago

Yuan-2.0 by IEIT-Yuan

Large language model for research, fine-tuning, and deployment

Created 2 years ago

Updated 1 year ago

generative-ai-for-beginners by microsoft

Course for learning generative AI application development

osanseviero:

dguido:

chiphuyen:

vincentweisser:

Created 2 years ago

Updated 1 week ago

ML-Papers-Explained by dair-ai

ML papers explained: key concepts demystified

hammer:

vnivargi:

omarsar:

Created 2 years ago

Updated 5 months ago

ML-Papers-of-the-Week by dair-ai

Weekly ML papers, top picks

gregpr07:

vincentweisser:

ValentaTomas:

osanseviero:

Created 2 years ago

Updated 4 months ago

Awesome-Chinese-LLM by HqWu-HITCS

Chinese LLM collection for smaller, privatizable models with lower training costs

Created 2 years ago

Updated 6 months ago

MergeLM by yule-BUAA

Codebase for merging language models via parameter averaging

JohannesHa:

winglian:

Ying1123:

Created 2 years ago

Updated 1 year ago

video-subtitle-remover by YaoFANGUK

AI-powered tool for video subtitle and watermark removal

Created 2 years ago

Updated 5 months ago

chat-langchain by langchain-ai

Chatbot for question answering over LangChain documentation

winglian:

eugeneyan:

transitive-bullshit:

mckaywrigley:

Created 2 years ago

Updated 6 days ago

rag-demystified by pchunduri6

LLM-powered RAG pipeline for question answering, built from scratch

tobi:

jerryjliu:

Created 2 years ago

Updated 1 year ago

generative-ai by GoogleCloudPlatform

GenAI samples and notebooks for Google Cloud Vertex AI

anantb:

omarsar:

Created 2 years ago

Updated 3 days ago

data-juicer by datajuicer

Data-Juicer: Data processing system for foundation models

chiphuyen:

CodeCreator:

JustinLin610:

jph00:

Created 2 years ago

Updated 3 days ago

axolotl by axolotl-ai-cloud

CLI tool for streamlined post-training of AI models

tobi:

beyang:

zhyncs:

patrickvonplaten:

Created 2 years ago

Updated 1 day ago

LongMem by Victorwz

Research paper implementation for augmenting language models with long-term memory

Created 2 years ago

Updated 1 year ago

LongChat by DachengLi1

Long-context LLM chatbot training and evaluation framework

casper-hansen:

huybery:

pgarbacki:

Ying1123:

Created 2 years ago

Updated 1 year ago

self-instruct by yizhongw

Self-Instruct: Research paper for aligning language models with self-generated instructions

pgarbacki:

soldni:

transitive-bullshit:

lewtun:

Created 2 years ago

Updated 2 years ago

LLMLingua by microsoft

Prompt compression for accelerated LLM inference

quincylarson:

bryanhelmig:

osanseviero:

luiscape:

Created 2 years ago

Updated 1 month ago

alignment-handbook by huggingface

Handbook for aligning language models with human/AI preferences

eugeneyan:

drishanarora:

philschmid:

vincentweisser:

Created 2 years ago

Updated 2 months ago

FireAct by anchen1011

Language agent fine-tuning research paper

winglian:

Created 2 years ago

Updated 2 years ago

streaming-llm by mit-han-lab

Framework for efficient LLM streaming

gakonst:

chiphuyen:

ValentaTomas:

omarsar:

Created 2 years ago

Updated 1 year ago

NexusRaven by nexusflowai

Evaluation framework for function-calling LLM, NexusRaven-13B

simonw:

pgarbacki:

Created 2 years ago

Updated 2 years ago

LMOps by microsoft

AI research initiative for building AI products with foundation models

ishaan-jaff:

pgarbacki:

yiranwu0:

apsdehal:

Created 3 years ago

Updated 1 week ago

LongLoRA by dvlab-research

LongLoRA: Efficient fine-tuning for long-context LLMs

chiphuyen:

pgarbacki:

shizhediao:

gakonst:

Created 2 years ago

Updated 1 year ago

lm-evaluation-harness by EleutherAI

Framework for few-shot language model evaluation

aravindsrinivas:

zjasper666:

zhuohan123:

shizhediao:

Created 5 years ago

Updated 3 days ago

DreamLLM by RunpeiDong

Multimodal LLM framework for comprehension and creation

jiamings:

Created 2 years ago

Updated 1 year ago

Awesome-Embodied-Robotics-and-Agent by zchoi

Curated list for embodied AI/robotics research using VLMs & LLMs

Created 2 years ago

Updated 1 month ago

vits_chinese by PlayVoice

TTS best practice based on BERT and VITS

Created 4 years ago

Updated 1 year ago

LLM-Agent-Paper-List by WooooDyy

Paper list for LLM-based agents

xiezhq-hermann:

transitive-bullshit:

andreasjansson:

vincentweisser:

Created 2 years ago

Updated 2 months ago

calculate-flops.pytorch by MrYxJ

PyTorch tool to calculate FLOPs, MACs, and parameters for neural networks

Created 2 years ago

Updated 1 year ago

lagent by InternLM

Framework for building LLM-based agents

Created 2 years ago

Updated 3 months ago

Baichuan2 by baichuan-inc

LLM for research/commercial use (license required for some commercial use cases)

Created 2 years ago

Updated 1 year ago

yarn by jquesnelle

Context window extension method for LLMs (research paper, models)

vincentweisser:

chiphuyen:

hammer:

simonw:

Created 2 years ago

Updated 1 year ago

LLM-Agent-Survey by Paitesanshi

Survey paper on LLM-based autonomous agents

Created 2 years ago

Updated 9 months ago

codellama by meta-llama

Inference code for CodeLlama models

chiphuyen:

vincentweisser:

shizhediao:

JustinLin610:

Created 2 years ago

Updated 1 year ago

Lemur by OpenLemur

Open language model for language agents

osanseviero:

huybery:

jph00:

dzhulgakov:

Created 2 years ago

Updated 2 years ago

LLaMA-Factory by hiyouga

Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)

patrickvonplaten:

alexchen4ai:

teetone:

lysandrejik:

Created 2 years ago

Updated 1 day ago

llm-hallucination-survey by HillZhang1999

Survey of hallucination in LLMs

shizhediao:

Created 2 years ago

Updated 2 months ago

Zhongjing by SupritYoung

Chinese medical chatbot based on LLaMa, trained with RLHF

Created 2 years ago

Updated 1 year ago

ToolBench by OpenBMB

Open platform for LLM tool learning (ICLR'24 spotlight)

pgarbacki:

Ying1123:

chiphuyen:

andreasjansson:

Created 2 years ago

Updated 6 months ago

llm-attacks by llm-attacks

Attack framework for aligned LLMs, based on a research paper

ebursztein:

chiphuyen:

jph00:

merrymercy:

Created 2 years ago

Updated 1 year ago

LLM-Agents-Papers by AGI-Edgerunners

Paper list for LLM-based agents

ogabrielluiz:

vincentweisser:

Created 2 years ago

Updated 4 months ago

AgentBench by THUDM

Benchmark for evaluating LLMs as agents across diverse environments

omarsar:

vincentweisser:

chiphuyen:

victortaelin:

Created 2 years ago

Updated 1 week ago

ChatPLUG by X-PLUG

Chinese dialogue system for open-domain conversation and digital human applications

Created 2 years ago

Updated 2 years ago

Safety-Prompts by thu-coai

Chinese safety prompts for LLM evaluation/alignment

Created 2 years ago

Updated 1 year ago

CValues by X-PLUG

Chinese LLM value alignment research

Created 2 years ago

Updated 2 years ago

lorahub by sail-sg

Framework for efficient cross-task generalization via dynamic LoRA composition

pgarbacki:

huybery:

Created 2 years ago

Updated 1 year ago

XVERSE-13B by xverse-ai

Multilingual LLM for chat, knowledge QA, and code generation

Created 2 years ago

Updated 1 year ago

LightLLM by ModelTC

Python framework for LLM inference and serving

zhyncs:

chiphuyen:

pgarbacki:

JustinLin610:

Created 2 years ago

Updated 2 days ago

Qwen by QwenLM

Chat & pretrained LLM by Alibaba Cloud

spartee:

shimmyshimmer:

omarsar:

shizhediao:

Created 2 years ago

Updated 4 days ago

langui by LangbaseInc

Open-source UI components for generative AI projects

didierrlopes:

rodrigosnader:

marcklingen:

philschmid:

Created 2 years ago

Updated 1 year ago

Chinese-LLaMA-Alpaca-2 by ymcui

Chinese LLaMA/Alpaca-2: LLMs with long context for Chinese language

lysandrejik:

Created 2 years ago

Updated 4 months ago

llama by meta-llama

Inference code for Llama 2 models (deprecated)

froystig:

xiezhq-hermann:

fabhed:

borzunov:

Created 2 years ago

Updated 10 months ago

flash-attention by Dao-AILab

Fast, memory-efficient attention implementation

karpathy:

Jiayi-Pan:

zhiyuan8:

alexchen4ai:

Created 3 years ago

Updated 5 days ago

instruct-eval by declare-lab

Evaluation code for instruction-tuned LLMs

infwinston:

Created 2 years ago

Updated 1 year ago

ToolAlpaca by tangqiaoyu

Tool-learning framework for language models, research paper

pgarbacki:

Created 2 years ago

Updated 1 year ago

h2o-llmstudio by h2oai

LLM Studio: framework for LLM fine-tuning via GUI or CLI

ebursztein:

andyk:

winglian:

JustinLin610:

Created 2 years ago

Updated 2 months ago

FastEdit by hiyouga

Tool for fast edits to large language models

chiphuyen:

shizhediao:

Created 2 years ago

Updated 2 years ago

Baichuan-13B by baichuan-inc

LLM for both pretraining and chat

Created 2 years ago

Updated 2 years ago

UER-py by dbiir

PyTorch toolkit for pre-training and fine-tuning NLP models

huybery:

shizhediao:

hammer:

apsdehal:

Created 6 years ago

Updated 1 year ago

lmdeploy by InternLM

Toolkit for LLM compression, deployment, and serving

shimmyshimmer:

wsxiaoys:

luiscape:

jn2clark:

Created 2 years ago

Updated 1 day ago

InternLM by InternLM

LLM series (InternLM, InternLM2, InternLM2.5, InternLM3) official release

hammer:

simonw:

soldni:

jph00:

Created 2 years ago

Updated 1 month ago

ChatLaw by PKU-YuanGroup

LLM for Chinese legal applications, research paper

yiranwu0:

omarsar:

Created 2 years ago

Updated 11 months ago

direct-preference-optimization by eric-mitchell

Reference implementation for Direct Preference Optimization (DPO)

rasbt:

chiphuyen:

huybery:

hammer:

Created 2 years ago

Updated 1 year ago

server by triton-inference-server

AI model inference serving optimized for cloud and edge

hammer:

tjbck:

Edward-Sun:

jn2clark:

Created 7 years ago

Updated 3 days ago

GPTQ-for-LLaMa by qwopqwop200

4-bit quantization for LLaMA models using GPTQ

chiphuyen:

gakonst:

soldni:

jph00:

Created 2 years ago

Updated 1 year ago

AutoGPTQ by AutoGPTQ

LLM quantization package using GPTQ algorithm

vincentweisser:

youkaichao:

chiphuyen:

osanseviero:

Created 2 years ago

Updated 7 months ago

openchat by imoneoi

Open-source LLM fine-tuned with C-RLFT, inspired by offline reinforcement learning

vincentweisser:

philschmid:

chiphuyen:

transitive-bullshit:

Created 2 years ago

Updated 1 year ago

FastChat by lm-sys

Open platform for training, serving, and evaluating LLM-based chatbots

zjasper666:

aangelopoulos:

osanseviero:

natolambert:

Created 2 years ago

Updated 6 months ago

ChatGLM2-6B by zai-org

Bilingual chat LLM for research/commercial use (after registration)

huybery:

Created 2 years ago

Updated 1 year ago

vllm by vllm-project

LLM serving engine for high-throughput, memory-efficient inference

karpathy:

clmnt:

tobi:

danielhanchen:

Created 2 years ago

Updated 23 hours ago

BIG-bench by google

Collaborative benchmark for probing and extrapolating LLM capabilities

ShengjiaZhao:

chiphuyen:

nirga:

JustinLin610:

Created 4 years ago

Updated 1 year ago

GAOKAO-Bench by OpenLMLab

Evaluation framework for assessing LLMs using Chinese GAOKAO (college entrance exam) questions

Created 2 years ago

Updated 10 months ago

BayLing by ictnlp

Multilingual LLM for cross-lingual alignment and instruction following

Created 2 years ago

Updated 1 year ago

HugNLP by HugAILab

NLP library based on HuggingFace Transformers

Created 2 years ago

Updated 2 years ago

NBCE by bojone

Context extension technique for LLMs (research paper)

Created 2 years ago

Updated 11 months ago

Baichuan-7B by baichuan-inc

7B-parameter LLM for commercial use

osanseviero:

Created 2 years ago

Updated 1 year ago

MiniChain by srush

Tiny library for coding with large language models

soldni:

lysandrejik:

jph00:

ValentaTomas:

Created 2 years ago

Updated 1 year ago

WizardLM by nlpxucan

LLMs built using Evol-Instruct for complex instruction following

vincentweisser:

chiphuyen:

WoosukKwon:

ishaan-jaff:

Created 2 years ago

Updated 5 months ago

rome by kmeng01

Model editing research paper for GPT-2 and GPT-J

shizhediao:

evhub:

stellaathena:

Created 3 years ago

Updated 1 year ago

CodeGen by salesforce

Open-source model family for program synthesis

nat:

hammer:

omarsar:

mckaywrigley:

Created 3 years ago

Updated 1 month ago

uniem by wangyuxinwhy

Unified embedding model for Chinese text

Created 2 years ago

Updated 2 years ago

big-AGI by enricoros

AI suite for advanced AI/AGI functions, deployable on-prem or cloud

mudler:

chitalian:

swyxio:

Created 2 years ago

Updated 1 day ago

InternLM-techreport by InternLM

Multilingual LLM research paper with 104B parameters

Ying1123:

JustinLin610:

Created 2 years ago

Updated 2 years ago

FlagAI by FlagAI-Open

Toolkit for large-scale model training, fine-tuning, and deployment

omarsar:

JustinLin610:

Created 3 years ago

Updated 2 weeks ago

awesome-pretrained-chinese-nlp-models by lonePatient

Resource list: Chinese NLP pretrained models, LLMs, multimodal models

shizhediao:

huybery:

Created 6 years ago

Updated 3 weeks ago

ceval by hkust-nlp

Chinese eval suite for foundation models (NeurIPS 2023)

huybery:

Created 2 years ago

Updated 4 months ago

YuLan-Chat by RUC-GSAI

Open-source LLM for chat, instruction-following, and general language tasks

Created 2 years ago

Updated 10 months ago

langchain by langchain-ai

Framework for building LLM-powered applications

karpathy:

MagMueller:

gregpr07:

willingc:

Created 3 years ago

Updated 2 days ago

TigerBot by TigerResearch

LLM foundation for multi-language, multi-task applications

Created 2 years ago

Updated 11 months ago

GalTransl by GalTransl

Tool for visual novel translation using LLMs

Created 2 years ago

Updated 1 month ago

Sophia by Liuhong99

Optimizer for language model pre-training (research paper)

chiphuyen:

vincentweisser:

eiso:

jph00:

Created 2 years ago

Updated 1 year ago

Chain-of-ThoughtsPapers by Timothyxxx

List of research papers on chain-of-thought prompting

omarsar:

jasonwei20:

Created 3 years ago

Updated 2 years ago

document.ai by GanymedeNil

Local knowledge base solution using vector DB and GPT-3.5

omarsar:

shizhediao:

Created 2 years ago

Updated 2 years ago

LLM-ToolMaker by ctlllll

Research paper on LLMs creating their own tools

omarsar:

osanseviero:

huybery:

Created 2 years ago

Updated 2 years ago

pyllama by henrywoo

Hacked LLaMA version for single consumer-grade GPU inference

shizhediao:

Created 2 years ago

Updated 2 years ago

lawyer-llama by AndrewZhe

Chinese legal LLaMA for law knowledge and consultation

Created 2 years ago

Updated 1 year ago

HuatuoGPT by FreedomIntelligence

Medical LLM for doctor-patient consultation

Created 2 years ago

Updated 11 months ago

ml_timeline by osanseviero

Curated timeline of recent ML model releases, code, and papers

omarsar:

osanseviero:

philschmid:

Created 2 years ago

Updated 2 years ago

MeZO by princeton-nlp

Research paper implementation for memory-efficient LM fine-tuning

pgarbacki:

winglian:

huybery:

Created 2 years ago

Updated 1 year ago

KnowledgeEditingPapers by zjunlp

Curated list of must-read research papers on knowledge editing for LLMs

shyamal-anadkat:

Created 3 years ago

Updated 4 months ago

CPM-Bee by OpenBMB

Bilingual base model for research/commercial use

Created 2 years ago

Updated 2 years ago

qlora by artidoro

Finetuning tool for quantized LLMs

tobi:

chiphuyen:

vincentweisser:

jph00:

Created 2 years ago

Updated 1 year ago

LaWGPT by pengxiao-song

Chinese LLaMA tuned for legal use

osanseviero:

Created 2 years ago

Updated 1 year ago

ColossalAI by hpcaitech

AI system for large-scale parallel training

tobi:

hammer:

thinkall:

chiphuyen:

Created 4 years ago

Updated 6 days ago

BiLLa by Neutralzz

Bilingual LLaMA enhances reasoning

Created 2 years ago

Updated 2 years ago

MiniGPT-4 by Vision-CAIR

Vision-language model for multi-task learning

pgarbacki:

forresti:

jn2clark:

JustinLin610:

Created 2 years ago

Updated 1 year ago

Fengshenbang-LM by IDEA-CCNL

Chinese foundation model ecosystem for AI infrastructure

JustinLin610:

shizhediao:

Created 4 years ago

Updated 1 year ago

ChatWaifu_Mobile by Voine

Mobile app for AI character chat

Created 2 years ago

Updated 2 years ago

awesome-llm-human-preference-datasets by glgh

Curated list of human preference datasets for LLM training

Created 2 years ago

Updated 2 years ago

LLMsPracticalGuide by Mooler0410

Curated list of LLM practical guide resources (tree, examples, papers)

eugeneyan:

swyxio:

ogabrielluiz:

vincentweisser:

Created 2 years ago

Updated 1 year ago

auto-evaluator by rlancemartin

Evaluation tool for LLM QA chains

krrishdholakia:

osanseviero:

hammer:

transitive-bullshit:

Created 2 years ago

Updated 2 years ago

trl by huggingface

Library for transformer RL

jeffchuber:

vincentweisser:

tjbck:

alexchen4ai:

Created 5 years ago

Updated 2 days ago

whisper by openai

Speech recognition model for multilingual transcription/translation

bcherny:

karpathy:

ogabrielluiz:

zhyncs:

Created 3 years ago

Updated 2 months ago

UltraChat by thunlp

Multi-round dialogue dataset and models for chat language model training

jph00:

winglian:

teknium1:

Created 2 years ago

Updated 1 year ago

awesome-RLHF by opendilab

Curated list of RLHF resources for language model alignment

vincentweisser:

omarsar:

Created 2 years ago

Updated 2 months ago

MOSS by OpenMOSS

Open-source tool-augmented conversational language model

chiphuyen:

osanseviero:

Ying1123:

hammer:

Created 2 years ago

Updated 1 year ago

TencentPretrain by Tencent

PyTorch framework for multimodal pre-training and fine-tuning

Created 3 years ago

Updated 1 year ago

LMFlow by OptimalScale

Toolkit for finetuning and inference of large foundation models

tobi:

shizhediao:

ebursztein:

zhuohan123:

Created 2 years ago

Updated 2 days ago

Alpaca-CoT by PhoebusSi

IFT platform for instruction collection, parameter-efficient methods, and LLMs

chiphuyen:

JustinLin610:

pgarbacki:

omarsar:

Created 2 years ago

Updated 1 year ago

FindTheChatGPTer by chenking2020

Collection of ChatGPT open-source alternatives

shizhediao:

Created 2 years ago

Updated 2 years ago

RRHF by GanjinZero

RRHF for aligning LLMs to human preferences

philschmid:

winglian:

Created 2 years ago

Updated 2 years ago

Instructgpt-prompts by kevinamiri

Instruction-following prompts for ChatGPT, GPT-3.5, GPT-4

infwinston:

Created 2 years ago

Updated 2 weeks ago

ChatGLM-Efficient-Tuning by hiyouga

Fine-tuning tool for ChatGLM-6B

Created 2 years ago

Updated 2 years ago

DeepSpeed by deepspeedai

Deep learning optimization library for distributed training and inference

aravindsrinivas:

ValentaTomas:

winglian:

stas00:

Created 5 years ago

Updated 4 days ago

InstructGLM by yanqiangmiffy

LoRA tuning script for ChatGLM-6B

Created 2 years ago

Updated 2 years ago

ChatGLM-finetune-LoRA by lich99

LoRA finetuning code for ChatGLM-6b

winglian:

Created 2 years ago

Updated 2 years ago

ChatGLM-LLaMA-chinese-insturct by 27182812

Fine-tuning exploration for ChatGLM, LLaMA on Chinese instruction data

Created 2 years ago

Updated 2 years ago

chat-dataset-baseline by hikariming

Fine-tuned chat model and dataset for Chinese dialogue

Created 2 years ago

Updated 7 months ago

nlp_chinese_corpus by brightmart

Chinese NLP corpus for pre-training and language model tasks

shizhediao:

JustinLin610:

Created 6 years ago

Updated 2 months ago

stanford_alpaca by tatsu-lab

Instruction-following LLaMA model training and data generation

karpathy:

john-b-yang:

pgarbacki:

osanseviero:

Created 2 years ago

Updated 1 year ago

alpaca-lora by tloen

LoRA fine-tuning for LLaMA

JustinLin610:

vincentweisser:

nirga:

chiphuyen:

Created 2 years ago

Updated 1 year ago

Chinese-LLaMA-Alpaca by ymcui

Chinese LLaMA & Alpaca: LLMs for Chinese NLP research

ggerganov:

Created 2 years ago

Updated 4 months ago

GPT-4-LLM by Instruction-Tuning-with-GPT-4

GPT-4 data for instruction-tuning LLMs via supervised/RL

jph00:

gakonst:

marcklingen:

teknium1:

Created 2 years ago

Updated 2 years ago

memit by kmeng01

Transformer memory mass-editor (ICLR 2023 research paper)

ericciarla:

bobvanluijt:

winglian:

Created 3 years ago

Updated 1 year ago

baize-chatbot by project-baize

Chat model trained via LoRA, using ChatGPT-generated dialogs

winglian:

pgarbacki:

Ying1123:

tunguz:

Created 2 years ago

Updated 1 year ago

Linly by CVI-SZU

Chinese LLMs and datasets for pretraining/finetuning

Created 2 years ago

Updated 1 year ago

peft by huggingface

Parameter-efficient fine-tuning (PEFT) library

tobi:

gakonst:

chiphuyen:

Ying1123:

Created 3 years ago

Updated 1 week ago

LoRA by microsoft

PyTorch library for low-rank adaptation (LoRA) of LLMs

chiphuyen:

patrickvonplaten:

hammer:

zhiyuan8:

Created 4 years ago

Updated 11 months ago

gpt_academic by binary-husky

LLM tool for paper reading/polishing/writing, optimized UI

yiranwu0:

soldni:

JustinLin610:

osanseviero:

Created 2 years ago

Updated 2 months ago

BELLE by LianjiaTech

Chinese LLM engine for democratized access and instruction tuning

JustinLin610:

omarsar:

Created 2 years ago

Updated 1 year ago

TaskMatrix by chenfei-wu

Visual ChatGPT connects LLMs to visual foundation models

chiphuyen:

taranjeet:

parano:

torantulino:

Created 2 years ago

Updated 1 year ago

chatgpt_please_improve_my_paper_writing by ashawkey

Thin wrapper for academic paper refinement

Created 2 years ago

Updated 2 years ago

mend by eric-mitchell

Fast model editing for LLMs

hammer:

Created 4 years ago

Updated 2 years ago

adapters by adapter-hub

Unified library for parameter-efficient transfer learning in NLP

Ying1123:

chiphuyen:

ogabrielluiz:

osanseviero:

Created 5 years ago

Updated 1 month ago

ContinualLM by UIC-Liu-Lab

PyTorch framework for continual learning of language models

Created 2 years ago

Updated 1 year ago

ConSERT by yym6472

Research paper code for contrastive self-supervised sentence representation transfer

Created 4 years ago

Updated 4 years ago

datasets by huggingface

Access and process large AI datasets efficiently

clmnt:

chiphuyen:

transitive-bullshit:

gakonst:

Created 5 years ago

Updated 3 days ago

aim by aimhubio

Experiment tracker for AI model training runs

amin3141:

patrick-kidger:

transitive-bullshit:

yang-song:

Created 6 years ago

Updated 1 day ago

pytorch3d by facebookresearch

PyTorch3D is a PyTorch library for 3D deep learning research

aravindsrinivas:

chiphuyen:

gkioxari:

codekansas:

Created 6 years ago

Updated 2 days ago

vit-pytorch by lucidrains

PyTorch library for Vision Transformer variants and related techniques

karpathy:

lucidrains:

chiphuyen:

forresti:

Created 5 years ago

Updated 3 days ago

CLIP_prefix_caption by rmokady

Image captioning model using CLIP embeddings as a prefix

chiphuyen:

collin-burns:

Created 4 years ago

Updated 1 year ago

NL-Augmenter by GEM-benchmark

Framework for natural language dataset augmentation

hammer:

rodrigosnader:

omarsar:

Edward-Sun:

Created 4 years ago

Updated 1 year ago

pytorch-image-models by huggingface

PyTorch image model collection with training, eval, and inference scripts

clmnt:

karpathy:

osanseviero:

Jiayi-Pan:

Created 6 years ago

Updated 3 days ago

vision_transformer by google-research

Vision Transformer and MLP-Mixer models in JAX/Flax

aravindsrinivas:

gakonst:

jn2clark:

merrymercy:

Created 5 years ago

Updated 9 months ago

graph4nlp by graph4ai

SDK for graph neural networks in NLP

huybery:

Created 5 years ago

Updated 1 year ago

TextAttack by QData

Python framework for NLP adversarial attacks, data augmentation, and model training

chiphuyen:

ankane:

lewtun:

eugeneyan:

Created 6 years ago

Updated 4 months ago

Text_Classification by kk7nc

Survey paper for text classification algorithms

mlabonne:

Created 7 years ago

Updated 8 months ago

robustbench by RobustBench

Standardized benchmark for adversarial robustness research

Created 5 years ago

Updated 8 months ago

CLIP by openai

Image-text matching model for zero-shot prediction

aravindsrinivas:

transitive-bullshit:

Edward-Sun:

mholt:

Created 5 years ago

Updated 1 year ago

CVPR2025-Papers-with-Code by amusi

Curated list of CVPR 2025 papers with code

yiranwu0:

huybery:

Created 5 years ago

Updated 5 months ago

TAADpapers by thunlp

Curated list of must-read papers on textual adversarial attack and defense

Created 6 years ago

Updated 5 months ago

flax by google

NN library for JAX, designed for flexibility in neural network research

charliermarsh:

codekansas:

Jiayi-Pan:

jaredpalmer:

Created 5 years ago

Updated 3 days ago

Real-Time-Voice-Cloning by CorentinJ

Voice cloning for real-time speech generation

willingc:

tjbck:

claforte:

nirga:

Created 6 years ago

Updated 2 months ago

Chinese-ELECTRA by ymcui

Chinese ELECTRA pre-trained language models

Created 5 years ago

Updated 4 months ago

electra by google-research

Text encoder pre-training via GAN-like discriminator

hammer:

codekansas:

lysandrejik:

forresti:

Created 5 years ago

Updated 1 year ago

apex by NVIDIA

PyTorch extension for streamlined mixed precision & distributed training

aravindsrinivas:

JohannesHa:

spartee:

tgaddair:

Created 7 years ago

Updated 2 days ago

unilm by microsoft

Foundation models for language, vision, speech, and multimodal tasks

jph00:

AlexCheema:

chiphuyen:

osanseviero:

Created 6 years ago

Updated 5 months ago

ICLR2019-OpenReviewData by shaohua0116

Data & visualizations for ICLR 2019 OpenReview data, a research paper

aravindsrinivas:

Created 7 years ago

Updated 6 years ago

ICLR2020-OpenReviewData by shaohua0116

Data crawler for ICLR OpenReview webpages

chenlin9:

Created 6 years ago

Updated 6 years ago

universal-triggers by Eric-Wallace

NLP attack/analysis research paper (EMNLP 2019)

omarsar:

zhuohan123:

thomwolf:

Created 6 years ago

Updated 1 year ago

text-to-text-transfer-transformer by google-research

Unified text-to-text transformer for NLP research

aravindsrinivas:

chiphuyen:

pgarbacki:

patrickvonplaten:

Created 6 years ago

Updated 3 weeks ago

transferlearning by jindongwang

Resource list for transfer learning research and development

rstojnic:

huybery:

parano:

youkaichao:

Created 8 years ago

Updated 9 months ago

naacl_transfer_learning_tutorial by huggingface

NLP transfer learning tutorial code

clmnt:

julien-c:

hammer:

soumith:

Created 6 years ago

Updated 6 years ago

RAdam by LiyuanLucasLiu

Optimizer for neural network training, addressing adaptive learning rate variance

danielhanchen:

millionintegrals:

rwightman:

jwyang:

Created 6 years ago

Updated 4 years ago

Pytorch-UNet by milesial

PyTorch implementation for image semantic segmentation

aangelopoulos:

chenlin9:

Created 8 years ago

Updated 1 year ago

higgsfield by higgsfield-ai

ML framework for large model training and GPU orchestration

aravindsrinivas:

luiscape:

pgarbacki:

chiphuyen:

Created 7 years ago

Updated 1 year ago

DQN_pytorch by dxyang

PyTorch implementations of DQN variants

Created 8 years ago

Updated 7 years ago

transformers by huggingface

ML library for pretrained model inference and training

clmnt:

lilianweng:

karpathy:

tjbck:

Created 7 years ago

Updated 1 day ago

bertviz by jessevig

Interactive tool for visualizing attention in Transformer language models

clmnt:

chiphuyen:

vincentweisser:

borzunov:

Created 7 years ago

Updated 6 months ago

ABSA-PyTorch by songyouwei

PyTorch implementations for aspect-based sentiment analysis

Created 7 years ago

Updated 2 years ago

deeplearning-papernotes by dennybritz

Deep learning paper notes and summaries

gakonst:

jinze1994:

ilblackdragon:

nottombrown:

Created 10 years ago

Updated 7 years ago

bert by google-research

TensorFlow code and pre-trained models for BERT

aravindsrinivas:

pgarbacki:

jn2clark:

evhub:

Created 7 years ago

Updated 1 year ago

nmt by tensorflow

Build state-of-the-art Neural Machine Translation systems

antiagainst:

philschmid:

parano:

jasonwei20:

Created 8 years ago

Updated 3 years ago

tensorflow by tensorflow

Open-source ML framework

norvig:

aravindsrinivas:

karpathy:

bcherny:

Created 10 years ago

Updated 1 day ago

Feedback? Help us improve.