Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Yaowei Zheng
Yaowei Zheng
Author of LLaMA-Factory
GitHub
X
Authored Projects (4)
Starred
by
Tony Lee
(Author of HELM; Research Engineer at Meta)
,
Lysandre Debut
(Chief Open-Source Officer at Hugging Face)
,
Gregor Zunic
(Cofounder of Browser Use)
,
Calvin French-Owen
(Cofounder of Segment),
and
23 more.
LLaMA-Factory
by
hiyouga
0.6%
60k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Fine-tune LLaMA, Mistral, Qwen, Gemma, etc., via CLI/Web UI.
Supports pre-training, SFT, reward modeling, PPO, DPO, KTO, ORPO.
Offers LoRA, QLoRA, GaLore, BAdam, FlashAttention-2, Unsloth, and more.
Enables multi-turn dialogue, tool use, image/video/audio understanding.
Created 2 years ago
Updated 1 day ago
Starred
by
Shizhe Diao
(Author of LMFlow; Research Scientist at NVIDIA)
and
Alex Chen
(Cofounder of Nexa AI)
.
EasyR1
by
hiyouga
1.3%
4k
RL training framework for multi-modality models
Supports Llama3, Qwen, DeepSeek-R1 language models; Qwen2-VL vision language models.
Implements GRPO, Reinforce++, ReMax, RLOO algorithms.
Enables padding-free training, checkpoint resuming, and Wandb/SwanLab/MLflow tracking.
Uses vLLM's SPMD mode for efficient, scalable training.
Created 7 months ago
Updated 1 day ago
ChatGLM-Efficient-Tuning
by
hiyouga
0.1%
4k
Fine-tuning tool for ChatGLM-6B
PEFT (LoRA, P-Tuning V2, Freeze) for efficient adaptation.
Supports full parameter fine-tuning, quantization (4/8-bit).
Includes RLHF training, reward modeling, and evaluation scripts.
Web UI, API, and CLI demos for interaction.
Created 2 years ago
Updated 2 years ago
Starred
by
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems")
and
Shizhe Diao
(Author of LMFlow; Research Scientist at NVIDIA)
.
FastEdit
by
hiyouga
0%
1k
Tool for fast edits to large language models
Injects fresh knowledge into LLMs using Rank-One Model Editing (ROME).
Supports models like LLaMA, Falcon, Baichuan, InternLM, and GPT-J.
Edits models in FP16, with reported times around ~10 seconds.
Created 2 years ago
Updated 2 years ago
Starred Projects (403)
SkyRL
by
NovaSky-AI
5.3%
1k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+11
Created 5 months ago
Updated 1 day ago
gem
by
axon-rl
19.3%
305
Agentic LLM training environment for interactive reinforcement learning
Starred by
Created 4 months ago
Updated 2 days ago
tunix
by
google
5.2%
2k
JAX-native library for efficient LLM post-training
Starred by
Created 6 months ago
Updated 21 hours ago
tilelang
by
tile-ai
9.7%
4k
DSL for high-performance GPU/CPU kernel development (GEMM, attention, etc.)
Starred by
+1
Created 1 year ago
Updated 22 hours ago
codex
by
openai
2.3%
47k
Coding agent CLI tool for terminal-based chat-driven development
Starred by
+30
Created 6 months ago
Updated 20 hours ago
atropos
by
NousResearch
1.7%
713
RL environment framework for LLM trajectory collection/evaluation
Starred by
+1
Created 5 months ago
Updated 1 day ago
checkpoint-engine
by
MoonshotAI
1.8%
767
Middleware for efficient LLM weight updates during inference
Starred by
+3
Created 1 month ago
Updated 20 hours ago
LESS
by
princeton-nlp
0%
496
Data selection research paper for targeted instruction tuning
Starred by
Created 1 year ago
Updated 1 year ago
DataFlow
by
OpenDCAI
0.9%
1k
Data preparation and LLM training system
Created 1 year ago
Updated 20 hours ago
trae-agent
by
bytedance
0.6%
10k
LLM-powered CLI for software engineering tasks
Starred by
+1
Created 4 months ago
Updated 3 weeks ago
VeOmni
by
ByteDance-Seed
1.6%
1k
Framework for scaling multimodal model training across accelerators
Starred by
Created 6 months ago
Updated 22 hours ago
llama.cpp
by
ggml-org
0.5%
88k
C/C++ library for local LLM inference
Starred by
+51
Created 2 years ago
Updated 1 day ago
DFT
by
yongliang-wu
0.6%
467
Improving SFT generalization with reward rectification
Starred by
Created 2 months ago
Updated 2 weeks ago
harmony
by
openai
0.5%
4k
Renderer for OpenAI's harmony response format
Starred by
+9
Created 2 months ago
Updated 2 months ago
gpt-oss-recipes
by
huggingface
0.7%
456
OpenAI GPT-OSS model optimization and fine-tuning
Starred by
Created 2 months ago
Updated 1 month ago
gpt-oss
by
openai
0.5%
19k
Open-weight LLMs for reasoning and agents
Starred by
+15
Created 3 months ago
Updated 1 week ago
ARPO
by
RUC-NLPIR
2.3%
649
Agentic RL for LLM tool use
Created 2 months ago
Updated 3 weeks ago
gemini-cli
by
google-gemini
1.2%
79k
AI agent for terminal workflows
Starred by
+27
Created 6 months ago
Updated 21 hours ago
qwen-code
by
QwenLM
2.5%
14k
AI coding agent for complex codebases
Starred by
+1
Created 3 months ago
Updated 21 hours ago
higgs-audio
by
boson-ai
0.4%
7k
Expressive text-to-audio generation model
Starred by
Created 2 months ago
Updated 4 weeks ago
Show-o
by
showlab
0.8%
2k
Unified transformer research paper for multimodal tasks
Created 1 year ago
Updated 3 days ago
Kimi-K2
by
MoonshotAI
0.4%
8k
State-of-the-art MoE language model
Starred by
+4
Created 3 months ago
Updated 1 month ago
Skywork-R1V
by
SkyworkAI
0.1%
3k
Multimodal model for advanced visual/text reasoning, using chain-of-thought
Starred by
Created 7 months ago
Updated 2 months ago
12-factor-agents
by
humanlayer
1.3%
16k
Principles for reliable LLM application development
Starred by
+4
Created 6 months ago
Updated 3 weeks ago
GraphGen
by
open-sciencelab
2.4%
389
Framework for LLM fine-tuning with knowledge-driven synthetic data
Created 9 months ago
Updated 20 hours ago
GLM-V
by
zai-org
1.1%
2k
Multimodal reasoning model with a "thinking" paradigm
Created 3 months ago
Updated 3 weeks ago
POLARIS
by
ChenxinAn-fdu
2.2%
610
Scaling RL for advanced reasoning models
Created 3 months ago
Updated 2 months ago
slime
by
THUDM
3.5%
2k
LLM post-training framework for RL scaling
Starred by
+2
Created 3 months ago
Updated 1 day ago
python-sdk
by
modelcontextprotocol
1.1%
19k
Python SDK for Model Context Protocol (MCP) servers/clients
Starred by
+4
Created 1 year ago
Updated 22 hours ago
flash-linear-attention
by
fla-org
1.5%
3k
Efficient Torch/Triton implementations for linear attention models
Starred by
+8
Created 1 year ago
Updated 2 days ago
gemini-fullstack-langgraph-quickstart
by
google-gemini
0.4%
17k
Full-stack agent quickstart
Starred by
+1
Created 4 months ago
Updated 1 month ago
DeepEyes
by
Visual-Agent
1.6%
862
Agentic RL training framework
Created 7 months ago
Updated 1 month ago
fastmcp
by
jlowin
1.6%
19k
Pythonic SDK for building Model Context Protocol (MCP) servers/clients
Starred by
+5
Created 10 months ago
Updated 1 day ago
langfun
by
google
0.2%
862
Library for object-oriented LLM prompting
Starred by
Created 2 years ago
Updated 6 days ago
GPTQModel
by
ModelCloud
1.4%
827
LLM compression toolkit for accelerated CPU/GPU inference
Starred by
Created 1 year ago
Updated 21 hours ago
WeClone
by
xming521
0.2%
16k
Digital twin one-stop solution
Starred by
Created 1 year ago
Updated 3 days ago
arena-hard-auto
by
lmarena
0.5%
940
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 1 year ago
Updated 3 months ago
vllm-ascend
by
vllm-project
1.7%
1k
Hardware plugin for vLLM on Ascend NPU
Created 8 months ago
Updated 21 hours ago
deer-flow
by
bytedance
0.9%
17k
Deep research framework combining language models with specialized tools
Starred by
+3
Created 5 months ago
Updated 2 days ago
aci
by
aipotheosis-labs
0.3%
5k
Open-source infra for AI-agent tool use
Starred by
Created 1 year ago
Updated 2 weeks ago
FramePack
by
lllyasviel
0.3%
16k
Desktop software for video generation via next-frame prediction
Starred by
Created 6 months ago
Updated 3 months ago
Seed-Thinking-v1.5
by
ByteDance-Seed
0%
816
Reasoning model for STEM, coding, and general tasks
Starred by
Created 6 months ago
Updated 4 months ago
AWorld
by
inclusionAI
3.3%
881
Multi-agent runtime for self-improvement
Created 7 months ago
Updated 20 hours ago
Tina
by
shangshang-wang
2.0%
296
LoRA reasoning models
Created 6 months ago
Updated 3 weeks ago
markitdown
by
microsoft
1.0%
81k
Python tool for converting files to Markdown for LLM text analysis
Starred by
+17
Created 11 months ago
Updated 1 month ago
MagiAttention
by
SandAI-org
0.6%
532
Distributed attention mechanism research paper for ultra-long context, heterogeneous data training
Starred by
Created 5 months ago
Updated 21 hours ago
cooragent
by
LeapLabTHU
0.2%
2k
AI agent collaboration community for building agents and workflows
Created 6 months ago
Updated 1 month ago
llm-foundry
by
mosaicml
0.1%
4k
LLM training code for Databricks foundation models
Starred by
+14
Created 2 years ago
Updated 2 days ago
Kimi-VL
by
MoonshotAI
0.4%
1k
Vision-language model for multimodal reasoning and agent tasks
Created 6 months ago
Updated 3 months ago
prismatic-vlms
by
TRI-ML
1.0%
817
VLM codebase for training visually-conditioned language models
Starred by
Created 1 year ago
Updated 1 year ago
UI-TARS-desktop
by
bytedance
0.4%
19k
GUI agent app for computer control via natural language
Starred by
Created 8 months ago
Updated 22 hours ago
DAPO
by
BytedTsinghua-SIA
0.9%
2k
Open-source RL system for large-scale LLM training
Starred by
Created 7 months ago
Updated 5 months ago
ml-cross-entropy
by
apple
0%
529
PyTorch module for memory-efficient cross-entropy in LLMs
Starred by
+1
Created 11 months ago
Updated 3 weeks ago
Light-R1
by
Qihoo360
0.3%
745
Math model research paper using curriculum SFT, DPO, and RL
Created 7 months ago
Updated 1 month ago
easy-dataset
by
ConardLi
0.8%
11k
Dataset tool for LLM fine-tuning
Created 7 months ago
Updated 2 weeks ago
GamingAgent
by
lmgame-org
0.5%
778
SDK for LLM/VLM gaming agents, enabling model evaluation via games
Starred by
Created 7 months ago
Updated 1 month ago
Gymnasium
by
Farama-Foundation
0.6%
10k
Python API standard for single-agent reinforcement learning environments
Starred by
+3
Created 3 years ago
Updated 1 week ago
Awesome-LLM-Post-training
by
mbzuai-oryx
0.6%
2k
Curated list of LLM post-training resources
Created 8 months ago
Updated 1 day ago
Search-R1
by
PeterGriffinJin
1.6%
3k
RL framework for training LLMs to use search engines
Starred by
+2
Created 7 months ago
Updated 1 week ago
Wan2.1
by
Wan-Video
0.6%
14k
Video foundation model for text-to-video, image-to-video, and video editing
Starred by
Created 7 months ago
Updated 2 months ago
FlashMLA
by
deepseek-ai
0.2%
12k
Efficient CUDA kernels for MLA decoding
Starred by
+5
Created 7 months ago
Updated 2 weeks ago
VLM-R1
by
om-ai-lab
0.3%
6k
VLM for visual understanding via reinforced VLMs
Created 8 months ago
Updated 1 month ago
Open-Reasoner-Zero
by
Open-Reasoner-Zero
0.2%
2k
Open-source RL training for scalable reasoning on base models
Created 7 months ago
Updated 4 months ago
open-infra-index
by
deepseek-ai
0.0%
8k
AI infrastructure tools for efficient AGI development
Starred by
+15
Created 7 months ago
Updated 5 months ago
Awesome-ML-SYS-Tutorial
by
zhaochenyang20
1.4%
4k
ML SYS learning notes and code
Starred by
Created 11 months ago
Updated 1 week ago
demystify-long-cot
by
eddycmu
0.6%
321
Research code for long chain-of-thought reasoning in LLMs
Starred by
Created 8 months ago
Updated 4 months ago
rllm
by
rllm-org
1.1%
4k
Framework for post-training language agents via reinforcement learning
Starred by
+2
Created 8 months ago
Updated 22 hours ago
Logic-RL
by
Unakar
0.1%
2k
LLM reasoning via rule-based reinforcement learning, research paper
Created 8 months ago
Updated 6 months ago
s1
by
simplescaling
0.1%
7k
Test-time scaling recipe for strong reasoning performance
Starred by
+8
Created 8 months ago
Updated 3 months ago
open-thoughts
by
open-thoughts
0.2%
2k
Open dataset for training reasoning models
Starred by
+1
Created 8 months ago
Updated 1 month ago
oumi
by
oumi-ai
0.3%
9k
Open-source platform for end-to-end foundation model lifecycle
Starred by
+1
Created 1 year ago
Updated 22 hours ago
curator
by
bespokelabsai
0.8%
2k
Synthetic data curation tool for post-training and structured data extraction
Starred by
Created 11 months ago
Updated 2 months ago
TinyZero
by
Jiayi-Pan
0.2%
12k
Minimal reproduction of DeepSeek R1 Zero for countdown/multiplication tasks
Starred by
+8
Created 8 months ago
Updated 5 months ago
DeepSeek-R1
by
deepseek-ai
0.1%
91k
Reasoning models research paper
Starred by
+16
Created 8 months ago
Updated 3 months ago
simpleRL-reason
by
hkust-nlp
0.4%
4k
RL recipe for reasoning ability in models
Starred by
+1
Created 8 months ago
Updated 2 months ago
open-r1
by
huggingface
0.1%
26k
SDK for reproducing DeepSeek-R1
Starred by
+17
Created 8 months ago
Updated 1 month ago
Math-Verify
by
huggingface
1.1%
966
Math evaluator for LLM outputs in mathematical tasks
Starred by
Created 9 months ago
Updated 3 months ago
SkyThought
by
NovaSky-AI
0.1%
3k
Training recipes for Sky-T1 family of models
Starred by
+4
Created 9 months ago
Updated 3 months ago
GUI-Agents-Paper-List
by
OSU-NLP-Group
1.1%
523
Paper list for GUI agents
Starred by
Created 11 months ago
Updated 4 days ago
UI-TARS
by
bytedance
0.7%
8k
Multimodal agent for GUI interaction in virtual worlds (research paper)
Starred by
+3
Created 8 months ago
Updated 4 days ago
rStar
by
microsoft
1.5%
1k
Research paper repo for math reasoning in small LLMs via deep thinking
Starred by
Created 1 year ago
Updated 1 month ago
FastVideo
by
hao-ai-lab
1.6%
2k
Framework for accelerated video generation
Starred by
Created 11 months ago
Updated 1 day ago
llm.c
by
karpathy
0.2%
28k
LLM training in pure C/CUDA, no PyTorch needed
Starred by
+26
Created 1 year ago
Updated 3 months ago
modded-nanogpt
by
KellerJordan
1.4%
3k
Language model training speedrun on 8x H100 GPUs
Starred by
+6
Created 1 year ago
Updated 2 months ago
coconut
by
facebookresearch
0.5%
1k
Research paper implementation for LLM reasoning in latent space
Starred by
Created 9 months ago
Updated 2 months ago
UFO
by
microsoft
0.2%
8k
Desktop AgentOS for automating Windows workflows via natural language
Starred by
Created 1 year ago
Updated 1 month ago
audiocraft
by
facebookresearch
0.1%
23k
PyTorch library for audio processing and generation research
Starred by
+15
Created 2 years ago
Updated 7 months ago
Kiln
by
Kiln-AI
0.9%
4k
AI prototyping and dataset collaboration tool
Starred by
Created 1 year ago
Updated 20 hours ago
MiniMax-01
by
MiniMax-AI
1.1%
3k
Large language & vision-language models based on linear attention
Starred by
+2
Created 9 months ago
Updated 3 months ago
ReaLHF
by
openpsi-project
0.3%
320
Efficient RLHF training system for LLMs using parameter reallocation
Created 1 year ago
Updated 5 months ago
grade-school-math
by
openai
0.2%
1k
Dataset for grade school math word problems
Starred by
Created 4 years ago
Updated 1 year ago
browser-use
by
browser-use
0.5%
71k
SDK for AI agent browser control
Starred by
+28
Created 11 months ago
Updated 22 hours ago
math-evaluation-harness
by
ZubinGou
0.4%
258
Benchmarking toolkit for LLM mathematical reasoning
Starred by
Created 1 year ago
Updated 1 year ago
PRIME
by
PRIME-RL
0.3%
2k
Scalable RL solution for advanced reasoning of language models
Starred by
+3
Created 9 months ago
Updated 7 months ago
Qwen2.5-Math
by
QwenLM
0.2%
1k
Math LLM for solving math problems in Chinese and English
Starred by
Created 1 year ago
Updated 9 months ago
libai
by
Oneflow-Inc
0%
407
Large-scale distributed parallel training toolbox
Starred by
Created 4 years ago
Updated 2 months ago
OS-Agent-Survey
by
OS-Agent-Survey
0%
354
Survey paper on OS Agents using MLLMs for computer, phone, and browser automation
Created 10 months ago
Updated 1 month ago
DeepSeek-V3
by
deepseek-ai
0.1%
100k
MoE language model research paper with 671B total parameters
Starred by
+13
Created 9 months ago
Updated 1 month ago
prm800k
by
openai
0.1%
2k
Dataset of LLM solutions to math problems with step-level correctness labels
Starred by
+4
Created 2 years ago
Updated 2 years ago
SwanLab
by
SwanHubX
2.2%
3k
AI training tracking and visualization tool
Created 1 year ago
Updated 2 days ago
Hermes-Function-Calling
by
NousResearch
0.4%
1k
Function-calling code for LLMs, demoing financial queries
Starred by
Created 1 year ago
Updated 1 year ago
ZhiLight
by
zhihu
0%
900
LLM inference engine for Llama and variants, optimized for PCIe GPUs
Starred by
Created 10 months ago
Updated 3 months ago
Infini-Megrez
by
infinigence
0.3%
337
AI model for edge-side intelligence, optimized for speed
Created 1 year ago
Updated 4 days ago
stable-dreamfusion
by
ashawkey
0.1%
9k
Text-to-3D model using NeRF and diffusion
Starred by
+10
Created 3 years ago
Updated 1 year ago
APOLLO
by
zhuhanqing
0.4%
257
Memory-efficient optimizer for LLM training
Created 10 months ago
Updated 5 months ago
one-api
by
songquanpeng
0.4%
28k
LLM API management/redistribution system for OpenAI, Gemini, Claude, etc
Starred by
Created 2 years ago
Updated 2 months ago
smol-course
by
huggingface
0.3%
6k
Practical course for aligning small language models
Starred by
+5
Created 10 months ago
Updated 1 week ago
smollm
by
huggingface
0.3%
3k
Lightweight AI models for text and vision tasks
Starred by
+7
Created 11 months ago
Updated 4 weeks ago
EasyRAG
by
BUAADreamer
0.2%
583
RAG framework for network automation, CCF AIOps challenge solution
Created 1 year ago
Updated 11 months ago
Qwen3-Coder
by
QwenLM
0.7%
14k
Code LLM for code completion, generation, and assistant use cases
Starred by
+8
Created 1 year ago
Updated 2 months ago
verl
by
volcengine
1.5%
14k
RL training library for LLMs
Starred by
+13
Created 11 months ago
Updated 1 day ago
openr
by
openreasoner
0.1%
2k
Open-source framework for advanced LLM reasoning
Starred by
Created 1 year ago
Updated 9 months ago
inference
by
xorbitsai
0.3%
9k
Model serving library for language, speech, and multimodal models
Starred by
Created 2 years ago
Updated 1 day ago
CUDATutorial
by
PaddleJitLab
0.3%
751
CUDA tutorial for high-performance programming
Created 3 years ago
Updated 3 months ago
megablocks
by
databricks
0.2%
1k
Lightweight library for mixture-of-experts (MoE) training
Starred by
+15
Created 2 years ago
Updated 3 months ago
bc-omni
by
westlake-baichuan-mllm
0%
269
Open-source research paper for multimodal LLM
Created 1 year ago
Updated 8 months ago
O1-Journey
by
GAIR-NLP
0.1%
2k
Research paper on replicating O1 via "journey learning"
Starred by
+1
Created 1 year ago
Updated 9 months ago
Open-O1
by
Open-Source-O1
0%
1k
AI model for matching OpenAI O1 capabilities with open-source alternatives
Starred by
Created 1 year ago
Updated 10 months ago
AutoIF
by
QwenLM
0.7%
308
Research paper for improving LLM instruction-following via self-play with execution feedback
Starred by
Created 1 year ago
Updated 1 year ago
onediff
by
siliconflow
0.1%
2k
Acceleration library for diffusion models
Starred by
Created 3 years ago
Updated 5 months ago
ao
by
pytorch
0.7%
2k
PyTorch library for quantization and sparsity in training/inference
Starred by
+10
Created 1 year ago
Updated 1 day ago
auto-round
by
intel
2.0%
660
Quantization algorithm for LLMs and VLMs
Starred by
Created 1 year ago
Updated 20 hours ago
mini-omni
by
gpt-omni
0.1%
3k
Open-source multimodal LLM for real-time speech interaction
Starred by
Created 1 year ago
Updated 11 months ago
VILA
by
NVlabs
0.4%
4k
Open-source VLMs for efficient video/multi-image understanding
Starred by
+1
Created 1 year ago
Updated 2 months ago
Qwen3-VL
by
QwenLM
3.7%
14k
Multimodal LLM for vision-language tasks, document parsing, and agent functionality
Starred by
+5
Created 1 year ago
Updated 21 hours ago
long-context-attention
by
feifeibear
0.3%
574
Unified sequence parallel attention for long context LLM training/inference
Starred by
Created 1 year ago
Updated 1 day ago
Liger-Kernel
by
linkedin
0.5%
6k
Triton kernels for efficient LLM training
Starred by
+8
Created 1 year ago
Updated 23 hours ago
cambrian
by
cambrian-mllm
0.1%
2k
Multimodal LLM research paper with vision-centric design
Starred by
+2
Created 1 year ago
Updated 11 months ago
MAP-NEO
by
multimodal-art-projection
0.2%
964
Open-source LLM with pretraining data, pipeline, scripts, and alignment code
Starred by
Created 1 year ago
Updated 8 months ago
MobileLLM
by
facebookresearch
0.1%
1k
Sub-billion parameter LLM training code for on-device use
Starred by
+2
Created 1 year ago
Updated 5 months ago
m2
by
HazyResearch
0%
560
Sub-quadratic architecture research paper
Starred by
+1
Created 2 years ago
Updated 9 months ago
LLM-workshop-2024
by
rasbt
0%
1k
Coding workshop for understanding LLM implementation and usage
Created 1 year ago
Updated 9 months ago
Mooncake
by
kvcache-ai
0.8%
4k
Research paper on a disaggregated architecture for LLM serving
Starred by
+2
Created 1 year ago
Updated 20 hours ago
cookbook
by
mistralai
0.3%
2k
Cookbook with examples using Mistral models
Starred by
Created 1 year ago
Updated 2 weeks ago
DoRA
by
NVlabs
0.2%
866
PyTorch code for weight-decomposed low-rank adaptation (DoRA)
Starred by
Created 1 year ago
Updated 1 year ago
LLM101n
by
karpathy
0.4%
35k
Educational resource for building a Storyteller AI LLM
Starred by
+15
Created 1 year ago
Updated 1 year ago
magpie
by
magpie-align
0.6%
780
Synthetic data pipeline for LLM alignment (ICLR 2025 paper)
Starred by
Created 1 year ago
Updated 7 months ago
OpenRLHF
by
OpenRLHF
1.0%
8k
RLHF framework for scalable training of large language models
Starred by
+8
Created 2 years ago
Updated 6 days ago
Index-1.9B
by
bilibili
0.1%
998
Multilingual LLM for chat, translation, and role-playing
Created 1 year ago
Updated 2 months ago
LanguageBind
by
PKU-YuanGroup
0.4%
833
Multimodal pretraining research paper using language-based semantic alignment
Starred by
Created 2 years ago
Updated 1 year ago
EasyContext
by
jzhang38
0.1%
747
Recipes for language model context length extrapolation to 1M tokens
Starred by
+2
Created 1 year ago
Updated 1 year ago
MixEval
by
JinjieNi
0%
250
Dynamic LLM evaluation suite for accurate, cost-effective benchmarking
Starred by
Created 1 year ago
Updated 11 months ago
GLM-4
by
zai-org
0.2%
7k
Open multilingual multimodal chat LMs for dialogue, reasoning, and rumination
Created 1 year ago
Updated 3 months ago
ChatTTS
by
2noise
0.1%
38k
Generative speech model for daily dialogue
Starred by
Created 1 year ago
Updated 3 months ago
LangGPT
by
langgptai
1.7%
11k
Structured prompting framework for LLM prompt engineering
Created 2 years ago
Updated 2 days ago
MiniCPM-V
by
OpenBMB
0.2%
22k
MLLM for vision, speech, and multimodal live streaming on your phone
Starred by
+7
Created 1 year ago
Updated 2 weeks ago
RLHF-Reward-Modeling
by
RLHFlow
0.2%
1k
Recipes to train reward models for RLHF
Starred by
Created 1 year ago
Updated 5 months ago
HALOs
by
ContextualAI
0.1%
889
Library for aligning LLMs using human-aware loss functions
Starred by
Created 1 year ago
Updated 2 weeks ago
Yi-1.5
by
01-ai
0%
556
Yi-1.5: upgraded open-source language model series
Starred by
Created 1 year ago
Updated 11 months ago
distilabel
by
argilla-io
0.2%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 1 day ago
ollama
by
ollama
0.3%
154k
CLI tool for running LLMs locally
Starred by
+45
Created 2 years ago
Updated 21 hours ago
InternVL
by
OpenGVLab
0.4%
9k
Open-source MLLM alternative to GPT-4o
Starred by
Created 1 year ago
Updated 3 weeks ago
MetaMath
by
meta-math
0%
445
Math question generation for LLM training and evaluation
Created 2 years ago
Updated 1 year ago
GPTS-Prompt-Collection
by
B3o
0.3%
2k
Prompt collection for GPTS Store
Created 1 year ago
Updated 4 months ago
torchtitan
by
pytorch
0.6%
5k
PyTorch platform for generative AI model training research
Starred by
+11
Created 1 year ago
Updated 21 hours ago
Llama3-Chinese-Chat
by
Shenzhi-Wang
0%
323
Chinese chat model fine-tuned from Llama3-8B-Instruct
Created 1 year ago
Updated 1 year ago
llama3-chinese
by
seanzhang-zhichen
0%
296
Large language model for Chinese language tasks
Created 1 year ago
Updated 1 year ago
InfiniTransformer
by
Beomi
0%
369
PyTorch implementation of Infini-attention for efficient, infinite context Transformers
Created 1 year ago
Updated 1 year ago
llama3
by
meta-llama
0.1%
29k
*Deprecated* minimal example for loading and running Llama 3 models
Starred by
+13
Created 1 year ago
Updated 8 months ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.3%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 1 year ago
Updated 1 year ago
BAdam
by
Ledzy
0%
272
Memory-efficient optimizer for large language model finetuning
Starred by
Created 1 year ago
Updated 7 months ago
ragas
by
explodinggradients
0.7%
11k
Toolkit for LLM application evaluation
Starred by
+12
Created 2 years ago
Updated 1 day ago
pyreft
by
stanfordnlp
0.1%
2k
Python library for representation finetuning (ReFT) of language models
Starred by
Created 1 year ago
Updated 8 months ago
ragflow
by
infiniflow
0.5%
66k
Open-source RAG engine for deep document understanding
Starred by
+6
Created 1 year ago
Updated 20 hours ago
orpo
by
xfactlab
0%
463
Preference optimization without a reference model
Starred by
Created 1 year ago
Updated 1 year ago
hqq
by
mobiusml
0.5%
883
Model quantizer for fast, accurate post-training quantization, skipping calibration
Starred by
Created 1 year ago
Updated 1 month ago
ray
by
ray-project
0.2%
39k
AI compute engine for scaling Python and AI applications
Starred by
+52
Created 9 years ago
Updated 21 hours ago
torchtune
by
meta-pytorch
0.2%
6k
PyTorch library for LLM post-training and experimentation
Starred by
+12
Created 2 years ago
Updated 1 day ago
veScale
by
volcengine
0.2%
874
PyTorch-native framework for LLM training
Starred by
+1
Created 1 year ago
Updated 1 month ago
grok-1
by
xai-org
0.0%
51k
JAX example code for loading and running Grok-1 open-weights model
Starred by
+22
Created 1 year ago
Updated 1 year ago
LLM-Training-Puzzles
by
srush
0.1%
1k
Hands-on puzzles for large language model training
Starred by
+8
Created 2 years ago
Updated 1 year ago
Awesome-Efficient-LLM
by
horseee
0.1%
2k
Curated list for efficient LLMs
Created 2 years ago
Updated 4 months ago
fsdp_qlora
by
AnswerDotAI
0%
2k
Training script for LLMs using QLoRA + FSDP
Starred by
+3
Created 1 year ago
Updated 11 months ago
streaming
by
mosaicml
0.2%
1k
Data streaming library for efficient neural network training
Starred by
+4
Created 3 years ago
Updated 2 weeks ago
GaLore
by
jiaweizzhao
0%
2k
Memory-efficient training for large language models via gradient low-rank projection
Starred by
Created 1 year ago
Updated 11 months ago
openvino
by
openvinotoolkit
0.5%
9k
Open source toolkit for optimizing and deploying AI inference
Starred by
Created 7 years ago
Updated 21 hours ago
SakuraLLM
by
SakuraLLM
0.5%
4k
Japanese-to-Chinese translation model for light novels/Galgame
Created 2 years ago
Updated 8 months ago
AQLM
by
Vahe1994
0%
1k
PyTorch code for LLM compression via Additive Quantization (AQLM)
Starred by
+2
Created 1 year ago
Updated 2 months ago
llm-awq
by
mit-han-lab
0.4%
3k
Weight quantization research paper for LLM compression/acceleration
Starred by
+4
Created 2 years ago
Updated 2 months ago
relora
by
Guitaricet
0.2%
465
PEFT pretraining code for ReLoRA research paper
Starred by
Created 2 years ago
Updated 1 year ago
gptq
by
IST-DASLab
0.2%
2k
Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers
Starred by
+3
Created 3 years ago
Updated 1 year ago
Long-Context-Data-Engineering
by
FranxYao
0.4%
476
Research paper implementation for long-context data engineering
Starred by
Created 1 year ago
Updated 1 year ago
minbpe
by
karpathy
0.1%
10k
Minimal BPE encoder/decoder for LLM tokenization
Starred by
+11
Created 1 year ago
Updated 1 year ago
code-act
by
xingyaoww
0.4%
1k
Research paper on executable code actions for LLM agents
Starred by
Created 1 year ago
Updated 1 year ago
SPIN
by
uclaml
0.2%
1k
Self-Play Fine-Tuning (SPIN) research paper implementation
Starred by
Created 1 year ago
Updated 1 year ago
Qwen3
by
QwenLM
0.3%
25k
Large language model series by Qwen team, Alibaba Cloud
Starred by
+11
Created 1 year ago
Updated 2 days ago
Yi
by
01-ai
0.0%
8k
Open-source bilingual LLMs trained from scratch
Starred by
+7
Created 1 year ago
Updated 10 months ago
Machine-Mindset
by
PKU-YuanGroup
0.6%
504
Research paper exploring LLMs through the lens of MBTI personality types
Created 1 year ago
Updated 1 year ago
sglang
by
sgl-project
0.9%
19k
Fast serving framework for LLMs and vision language models
Starred by
+32
Created 1 year ago
Updated 20 hours ago
functionary
by
MeetKai
0.2%
2k
Chat language model for tool use and result interpretation
Starred by
+2
Created 2 years ago
Updated 4 weeks ago
datatrove
by
huggingface
0.3%
3k
Data processing library for large-scale text data
Starred by
+9
Created 2 years ago
Updated 6 days ago
nanotron
by
huggingface
0.3%
2k
Minimalistic library for large language model pretraining
Starred by
+11
Created 2 years ago
Updated 1 month ago
infinity
by
michaelfeil
0.6%
2k
REST API for high-throughput, low-latency embedding and reranking
Starred by
+8
Created 2 years ago
Updated 1 week ago
DeepSeek-MoE
by
deepseek-ai
0.3%
2k
MoE language model for research purposes
Starred by
Created 1 year ago
Updated 1 year ago
RAG-Survey
by
Tongji-KGLLM
0.1%
2k
RAG survey and knowledge base
Starred by
Created 1 year ago
Updated 1 year ago
QAnything
by
netease-youdao
0.1%
14k
Anything Q&A system for local knowledge bases, supporting diverse file formats
Starred by
Created 1 year ago
Updated 6 months ago
helm
by
stanford-crfm
0.4%
3k
Open-source Python framework for holistic evaluation of foundation models
Starred by
+10
Created 3 years ago
Updated 1 day ago
neurips_llm_efficiency_challenge
by
llm-efficiency-challenge
0%
256
Competition toolkit for efficient LLM inference on a single GPU
Starred by
Created 2 years ago
Updated 2 years ago
TensorRT-LLM
by
NVIDIA
0.4%
12k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+17
Created 2 years ago
Updated 20 hours ago
deita
by
hkust-nlp
0%
571
Data-efficient instruction tuning for LLM alignment (ICLR 2024)
Starred by
Created 2 years ago
Updated 10 months ago
ATLAS
by
VILA-Lab
0%
971
Instruction benchmark for effective LLM queries and prompts
Starred by
Created 1 year ago
Updated 1 year ago
llama-moe
by
pjlab-sys4nlp
0.1%
991
MoE model from LLaMA with continual pre-training
Starred by
Created 2 years ago
Updated 10 months ago
long-llms-learning
by
Strivin0311
0%
268
Literature repository for long-context LLM methodologies
Starred by
Created 1 year ago
Updated 1 year ago
quip-sharp
by
Cornell-RelaxML
0.2%
559
LLM quantization for extreme compression
Starred by
Created 1 year ago
Updated 11 months ago
DeepSpeed-MII
by
deepspeedai
0.1%
2k
Python library for high-throughput, low-latency, and cost-effective model inference
Starred by
+5
Created 3 years ago
Updated 3 months ago
H2O
by
FMInference
0.4%
480
KV cache eviction research paper for efficient LLM inference
Starred by
Created 2 years ago
Updated 1 year ago
chroma
by
chroma-core
0.5%
24k
Open-source embedding database for building LLM apps with memory
Starred by
+31
Created 3 years ago
Updated 1 day ago
Qwen-Agent
by
QwenLM
0.9%
12k
Agent framework for LLM application development
Starred by
+5
Created 2 years ago
Updated 2 weeks ago
llm-inference-benchmark
by
ninehills
0.2%
427
LLM inference benchmark for comparing frameworks
Created 1 year ago
Updated 1 year ago
tensor_parallel
by
BlackSamorez
0%
656
PyTorch module for multi-GPU model parallelism
Starred by
Created 3 years ago
Updated 1 year ago
Data-Copilot
by
zwq2018
0.1%
2k
LLM-based system for autonomous data workflows
Created 2 years ago
Updated 1 year ago
URIAL
by
Re-Align
0%
312
ICL method for LLM alignment, no tuning required
Created 1 year ago
Updated 1 year ago
mamba
by
state-spaces
0.5%
16k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 1 year ago
Updated 5 days ago
clip-interrogator
by
pharmapsychotic
0.1%
3k
Image-to-prompt tool for text-to-image models
Starred by
+3
Created 3 years ago
Updated 1 year ago
unicom
by
deepglint
0.1%
698
Visual representation model for multimodal LLMs
Created 2 years ago
Updated 2 weeks ago
unsloth
by
unslothai
0.5%
47k
Finetuning tool for LLMs, targeting speed and memory efficiency
Starred by
+36
Created 1 year ago
Updated 1 day ago
gpt-fast
by
meta-pytorch
0.2%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 2 years ago
Updated 1 month ago
gpt_paper_assistant
by
tatsu-lab
0%
535
ArXiv scanner using GPT-4 for personalized paper recommendations
Starred by
Created 1 year ago
Updated 1 year ago
DeepSeek-LLM
by
deepseek-ai
0.1%
7k
Large language model for research/commercial use
Starred by
Created 1 year ago
Updated 1 year ago
llm-course
by
mlabonne
0.6%
65k
LLM course with roadmaps and notebooks
Starred by
+14
Created 2 years ago
Updated 4 months ago
Yuan-2.0
by
IEIT-Yuan
0%
688
Large language model for research, fine-tuning, and deployment
Created 1 year ago
Updated 1 year ago
generative-ai-for-beginners
by
microsoft
0.3%
100k
Course for learning generative AI application development
Starred by
+1
Created 2 years ago
Updated 1 day ago
ML-Papers-Explained
by
dair-ai
0.1%
8k
ML papers explained: key concepts demystified
Starred by
Created 2 years ago
Updated 3 months ago
ML-Papers-of-the-Week
by
dair-ai
0.2%
12k
Weekly ML papers, top picks
Starred by
+2
Created 2 years ago
Updated 2 months ago
Awesome-Chinese-LLM
by
HqWu-HITCS
0.3%
21k
Chinese LLM collection for smaller, privatizable models with lower training costs
Created 2 years ago
Updated 4 months ago
MergeLM
by
yule-BUAA
0.1%
850
Codebase for merging language models via parameter averaging
Starred by
Created 1 year ago
Updated 1 year ago
video-subtitle-remover
by
YaoFANGUK
1.1%
8k
AI-powered tool for video subtitle and watermark removal
Created 2 years ago
Updated 3 months ago
chat-langchain
by
langchain-ai
0.1%
6k
Chatbot for question answering over LangChain documentation
Starred by
+3
Created 2 years ago
Updated 1 day ago
rag-demystified
by
pchunduri6
0.2%
854
LLM-powered RAG pipeline for question answering, built from scratch
Starred by
Created 2 years ago
Updated 1 year ago
generative-ai
by
GoogleCloudPlatform
0.4%
12k
GenAI samples and notebooks for Google Cloud Vertex AI
Starred by
Created 2 years ago
Updated 1 day ago
data-juicer
by
modelscope
0.8%
5k
Data-Juicer: Data processing system for foundation models
Starred by
+2
Created 2 years ago
Updated 1 day ago
axolotl
by
axolotl-ai-cloud
0.5%
11k
CLI tool for streamlined post-training of AI models
Starred by
+25
Created 2 years ago
Updated 1 day ago
LongMem
by
Victorwz
0%
808
Research paper implementation for augmenting language models with long-term memory
Created 2 years ago
Updated 1 year ago
LongChat
by
DachengLi1
0%
532
Long-context LLM chatbot training and evaluation framework
Starred by
+2
Created 2 years ago
Updated 1 year ago
self-instruct
by
yizhongw
0.2%
4k
Self-Instruct: Research paper for aligning language models with self-generated instructions
Starred by
+3
Created 2 years ago
Updated 2 years ago
LLMLingua
by
microsoft
0.3%
5k
Prompt compression for accelerated LLM inference
Starred by
+1
Created 2 years ago
Updated 7 months ago
alignment-handbook
by
huggingface
0.1%
5k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 1 month ago
FireAct
by
anchen1011
0%
280
Language agent fine-tuning research paper
Starred by
Created 2 years ago
Updated 2 years ago
streaming-llm
by
mit-han-lab
0.1%
7k
Framework for efficient LLM streaming
Starred by
+2
Created 2 years ago
Updated 1 year ago
NexusRaven
by
nexusflowai
0%
316
Evaluation framework for function-calling LLM, NexusRaven-13B
Starred by
Created 2 years ago
Updated 2 years ago
LMOps
by
microsoft
0.1%
4k
AI research initiative for building AI products with foundation models
Starred by
+7
Created 2 years ago
Updated 3 months ago
LongLoRA
by
dvlab-research
0.0%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
Starred by
+1
Created 2 years ago
Updated 1 year ago
lm-evaluation-harness
by
EleutherAI
0.5%
10k
Framework for few-shot language model evaluation
Starred by
+18
Created 5 years ago
Updated 3 days ago
DreamLLM
by
RunpeiDong
0%
459
Multimodal LLM framework for comprehension and creation
Starred by
Created 2 years ago
Updated 10 months ago
Awesome-Embodied-Robotics-and-Agent
by
zchoi
0.5%
2k
Curated list for embodied AI/robotics research using VLMs & LLMs
Created 2 years ago
Updated 3 weeks ago
vits_chinese
by
PlayVoice
0%
1k
TTS best practice based on BERT and VITS
Created 4 years ago
Updated 1 year ago
LLM-Agent-Paper-List
by
WooooDyy
0.1%
8k
Paper list for LLM-based agents
Starred by
+2
Created 2 years ago
Updated 1 month ago
calculate-flops.pytorch
by
MrYxJ
0.1%
881
PyTorch tool to calculate FLOPs, MACs, and parameters for neural networks
Created 2 years ago
Updated 1 year ago
lagent
by
InternLM
0.1%
2k
Framework for building LLM-based agents
Created 2 years ago
Updated 2 months ago
Baichuan2
by
baichuan-inc
0.0%
4k
LLM for research/commercial use (license required for some commercial use cases)
Created 2 years ago
Updated 11 months ago
yarn
by
jquesnelle
0.3%
2k
Context window extension method for LLMs (research paper, models)
Starred by
+4
Created 2 years ago
Updated 1 year ago
LLM-Agent-Survey
by
Paitesanshi
0.1%
3k
Survey paper on LLM-based autonomous agents
Created 2 years ago
Updated 7 months ago
codellama
by
meta-llama
0%
16k
Inference code for CodeLlama models
Starred by
+12
Created 2 years ago
Updated 1 year ago
Lemur
by
OpenLemur
0%
555
Open language model for language agents
Starred by
Created 2 years ago
Updated 1 year ago
LLaMA-Factory
by
hiyouga
0.6%
60k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+23
Created 2 years ago
Updated 1 day ago
llm-hallucination-survey
by
HillZhang1999
0.5%
1k
Survey of hallucination in LLMs
Starred by
Created 2 years ago
Updated 2 weeks ago
Zhongjing
by
SupritYoung
0%
378
Chinese medical chatbot based on LLaMa, trained with RLHF
Created 2 years ago
Updated 1 year ago
ToolBench
by
OpenBMB
0.3%
5k
Open platform for LLM tool learning (ICLR'24 spotlight)
Starred by
+6
Created 2 years ago
Updated 4 months ago
llm-attacks
by
llm-attacks
0.3%
4k
Attack framework for aligned LLMs, based on a research paper
Starred by
+3
Created 2 years ago
Updated 1 year ago
LLM-Agents-Papers
by
AGI-Edgerunners
0.6%
2k
Paper list for LLM-based agents
Starred by
Created 2 years ago
Updated 3 months ago
AgentBench
by
THUDM
0.7%
3k
Benchmark for evaluating LLMs as agents across diverse environments
Starred by
+6
Created 2 years ago
Updated 5 days ago
ChatPLUG
by
X-PLUG
0%
324
Chinese dialogue system for open-domain conversation and digital human applications
Created 2 years ago
Updated 2 years ago
Safety-Prompts
by
thu-coai
0.3%
1k
Chinese safety prompts for LLM evaluation/alignment
Created 2 years ago
Updated 1 year ago
CValues
by
X-PLUG
0%
536
Chinese LLM value alignment research
Created 2 years ago
Updated 2 years ago
lorahub
by
sail-sg
0.5%
654
Framework for efficient cross-task generalization via dynamic LoRA composition
Starred by
Created 2 years ago
Updated 1 year ago
XVERSE-13B
by
xverse-ai
0%
645
Multilingual LLM for chat, knowledge QA, and code generation
Created 2 years ago
Updated 1 year ago
LightLLM
by
ModelTC
0.5%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 22 hours ago
Qwen
by
QwenLM
0.3%
19k
Chat & pretrained LLM by Alibaba Cloud
Starred by
+12
Created 2 years ago
Updated 2 weeks ago
langui
by
LangbaseInc
0.0%
3k
Open-source UI components for generative AI projects
Starred by
Created 2 years ago
Updated 1 year ago
Chinese-LLaMA-Alpaca-2
by
ymcui
0.1%
7k
Chinese LLaMA/Alpaca-2: LLMs with long context for Chinese language
Starred by
Created 2 years ago
Updated 3 months ago
llama
by
meta-llama
0.1%
59k
Inference code for Llama 2 models (deprecated)
Starred by
+38
Created 2 years ago
Updated 8 months ago
flash-attention
by
Dao-AILab
0.6%
20k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 1 day ago
instruct-eval
by
declare-lab
0%
548
Evaluation code for instruction-tuned LLMs
Starred by
Created 2 years ago
Updated 1 year ago
ToolAlpaca
by
tangqiaoyu
0%
881
Tool-learning framework for language models, research paper
Starred by
Created 2 years ago
Updated 11 months ago
h2o-llmstudio
by
h2oai
0.5%
5k
LLM Studio: framework for LLM fine-tuning via GUI or CLI
Starred by
+4
Created 2 years ago
Updated 2 weeks ago
FastEdit
by
hiyouga
0%
1k
Tool for fast edits to large language models
Starred by
Created 2 years ago
Updated 2 years ago
Baichuan-13B
by
baichuan-inc
0%
3k
LLM for both pretraining and chat
Created 2 years ago
Updated 2 years ago
UER-py
by
dbiir
0.2%
3k
PyTorch toolkit for pre-training and fine-tuning NLP models
Starred by
Created 6 years ago
Updated 1 year ago
lmdeploy
by
InternLM
0.3%
7k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 1 day ago
InternLM
by
InternLM
0.1%
7k
LLM series (InternLM, InternLM2, InternLM2.5, InternLM3) official release
Starred by
+4
Created 2 years ago
Updated 2 months ago
ChatLaw
by
PKU-YuanGroup
0.1%
7k
LLM for Chinese legal applications, research paper
Starred by
Created 2 years ago
Updated 9 months ago
direct-preference-optimization
by
eric-mitchell
0.1%
3k
Reference implementation for Direct Preference Optimization (DPO)
Starred by
Created 2 years ago
Updated 1 year ago
server
by
triton-inference-server
0.3%
10k
AI model inference serving optimized for cloud and edge
Starred by
+12
Created 7 years ago
Updated 1 day ago
GPTQ-for-LLaMa
by
qwopqwop200
0.1%
3k
4-bit quantization for LLaMA models using GPTQ
Starred by
+2
Created 2 years ago
Updated 1 year ago
AutoGPTQ
by
AutoGPTQ
0.3%
5k
LLM quantization package using GPTQ algorithm
Starred by
+12
Created 2 years ago
Updated 6 months ago
openchat
by
imoneoi
0.1%
5k
Open-source LLM fine-tuned with C-RLFT, inspired by offline reinforcement learning
Starred by
+4
Created 2 years ago
Updated 1 year ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 4 months ago
ChatGLM2-6B
by
zai-org
0.0%
16k
Bilingual chat LLM for research/commercial use (after registration)
Starred by
Created 2 years ago
Updated 1 year ago
vllm
by
vllm-project
0.8%
60k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+57
Created 2 years ago
Updated 22 hours ago
BIG-bench
by
google
0.1%
3k
Collaborative benchmark for probing and extrapolating LLM capabilities
Starred by
+11
Created 4 years ago
Updated 1 year ago
GAOKAO-Bench
by
OpenLMLab
0.3%
687
Evaluation framework for assessing LLMs using Chinese GAOKAO (college entrance exam) questions
Created 2 years ago
Updated 9 months ago
BayLing
by
ictnlp
0%
317
Multilingual LLM for cross-lingual alignment and instruction following
Created 2 years ago
Updated 10 months ago
HugNLP
by
HugAILab
0.3%
391
NLP library based on HuggingFace Transformers
Created 2 years ago
Updated 1 year ago
NBCE
by
bojone
0%
324
Context extension technique for LLMs (research paper)
Created 2 years ago
Updated 10 months ago
Baichuan-7B
by
baichuan-inc
0%
6k
7B-parameter LLM for commercial use
Starred by
Created 2 years ago
Updated 1 year ago
MiniChain
by
srush
0%
1k
Tiny library for coding with large language models
Starred by
+7
Created 2 years ago
Updated 1 year ago
WizardLM
by
nlpxucan
0.1%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 2 years ago
Updated 4 months ago
rome
by
kmeng01
0.3%
673
Model editing research paper for GPT-2 and GPT-J
Starred by
Created 3 years ago
Updated 1 year ago
CodeGen
by
salesforce
0.1%
5k
Open-source model family for program synthesis
Starred by
+8
Created 3 years ago
Updated 8 months ago
uniem
by
wangyuxinwhy
0%
871
Unified embedding model for Chinese text
Created 2 years ago
Updated 2 years ago
big-AGI
by
enricoros
0.1%
7k
AI suite for advanced AI/AGI functions, deployable on-prem or cloud
Starred by
Created 2 years ago
Updated 22 hours ago
InternLM-techreport
by
InternLM
0%
904
Multilingual LLM research paper with 104B parameters
Starred by
Created 2 years ago
Updated 2 years ago
FlagAI
by
FlagAI-Open
0%
4k
Toolkit for large-scale model training, fine-tuning, and deployment
Starred by
Created 3 years ago
Updated 1 month ago
awesome-pretrained-chinese-nlp-models
by
lonePatient
0.3%
5k
Resource list: Chinese NLP pretrained models, LLMs, multimodal models
Starred by
Created 6 years ago
Updated 3 days ago
ceval
by
hkust-nlp
0.2%
2k
Chinese eval suite for foundation models (NeurIPS 2023)
Starred by
Created 2 years ago
Updated 2 months ago
YuLan-Chat
by
RUC-GSAI
0%
633
Open-source LLM for chat, instruction-following, and general language tasks
Created 2 years ago
Updated 9 months ago
langchain
by
langchain-ai
0.4%
117k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 1 day ago
TigerBot
by
TigerResearch
0%
2k
LLM foundation for multi-language, multi-task applications
Created 2 years ago
Updated 9 months ago
GalTransl
by
GalTransl
0.7%
2k
Tool for visual novel translation using LLMs
Created 2 years ago
Updated 3 days ago
Sophia
by
Liuhong99
0%
976
Optimizer for language model pre-training (research paper)
Starred by
+1
Created 2 years ago
Updated 1 year ago
Chain-of-ThoughtsPapers
by
Timothyxxx
0.1%
2k
List of research papers on chain-of-thought prompting
Starred by
Created 3 years ago
Updated 2 years ago
document.ai
by
GanymedeNil
0.0%
4k
Local knowledge base solution using vector DB and GPT-3.5
Starred by
Created 2 years ago
Updated 2 years ago
LLM-ToolMaker
by
ctlllll
0.1%
1k
Research paper on LLMs creating their own tools
Starred by
Created 2 years ago
Updated 2 years ago
pyllama
by
henrywoo
0%
3k
Hacked LLaMA version for single consumer-grade GPU inference
Starred by
Created 2 years ago
Updated 1 year ago
lawyer-llama
by
AndrewZhe
0%
962
Chinese legal LLaMA for law knowledge and consultation
Created 2 years ago
Updated 1 year ago
HuatuoGPT
by
FreedomIntelligence
0.3%
1k
Medical LLM for doctor-patient consultation
Created 2 years ago
Updated 10 months ago
ml_timeline
by
osanseviero
0%
587
Curated timeline of recent ML model releases, code, and papers
Starred by
Created 2 years ago
Updated 2 years ago
MeZO
by
princeton-nlp
0.2%
1k
Research paper implementation for memory-efficient LM fine-tuning
Starred by
Created 2 years ago
Updated 1 year ago
KnowledgeEditingPapers
by
zjunlp
0.8%
1k
Curated list of must-read research papers on knowledge editing for LLMs
Starred by
Created 2 years ago
Updated 3 months ago
CPM-Bee
by
OpenBMB
0%
2k
Bilingual base model for research/commercial use
Created 2 years ago
Updated 2 years ago
qlora
by
artidoro
0.1%
11k
Finetuning tool for quantized LLMs
Starred by
+19
Created 2 years ago
Updated 1 year ago
LaWGPT
by
pengxiao-song
0.1%
6k
Chinese LLaMA tuned for legal use
Starred by
Created 2 years ago
Updated 1 year ago
ColossalAI
by
hpcaitech
0.0%
41k
AI system for large-scale parallel training
Starred by
+24
Created 4 years ago
Updated 1 day ago
BiLLa
by
Neutralzz
0%
417
Bilingual LLaMA enhances reasoning
Created 2 years ago
Updated 2 years ago
MiniGPT-4
by
Vision-CAIR
0.0%
26k
Vision-language model for multi-task learning
Starred by
+15
Created 2 years ago
Updated 1 year ago
Fengshenbang-LM
by
IDEA-CCNL
0.1%
4k
Chinese foundation model ecosystem for AI infrastructure
Starred by
Created 4 years ago
Updated 1 year ago
ChatWaifu_Mobile
by
Voine
0.2%
1k
Mobile app for AI character chat
Created 2 years ago
Updated 2 years ago
awesome-llm-human-preference-datasets
by
glgh
0.3%
380
Curated list of human preference datasets for LLM training
Created 2 years ago
Updated 2 years ago
LLMsPracticalGuide
by
Mooler0410
0.1%
10k
Curated list of LLM practical guide resources (tree, examples, papers)
Starred by
+6
Created 2 years ago
Updated 1 year ago
auto-evaluator
by
rlancemartin
0%
1k
Evaluation tool for LLM QA chains
Starred by
+3
Created 2 years ago
Updated 2 years ago
trl
by
huggingface
0.7%
16k
Library for transformer RL
Starred by
+28
Created 5 years ago
Updated 1 day ago
whisper
by
openai
0.4%
89k
Speech recognition model for multilingual transcription/translation
Starred by
+40
Created 3 years ago
Updated 1 month ago
UltraChat
by
thunlp
0.1%
3k
Multi-round dialogue dataset and models for chat language model training
Starred by
Created 2 years ago
Updated 1 year ago
awesome-RLHF
by
opendilab
0.2%
4k
Curated list of RLHF resources for language model alignment
Starred by
Created 2 years ago
Updated 3 weeks ago
MOSS
by
OpenMOSS
0.0%
12k
Open-source tool-augmented conversational language model
Starred by
+2
Created 2 years ago
Updated 1 year ago
TencentPretrain
by
Tencent
0.2%
1k
PyTorch framework for multimodal pre-training and fine-tuning
Created 3 years ago
Updated 1 year ago
LMFlow
by
OptimalScale
0.0%
8k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 2 years ago
Updated 2 months ago
Alpaca-CoT
by
PhoebusSi
0.1%
3k
IFT platform for instruction collection, parameter-efficient methods, and LLMs
Starred by
Created 2 years ago
Updated 1 year ago
FindTheChatGPTer
by
chenking2020
0%
2k
Collection of ChatGPT open-source alternatives
Starred by
Created 2 years ago
Updated 2 years ago
RRHF
by
GanjinZero
0%
811
RRHF for aligning LLMs to human preferences
Starred by
Created 2 years ago
Updated 2 years ago
Instructgpt-prompts
by
kevinamiri
0.2%
534
Instruction-following prompts for ChatGPT, GPT-3.5, GPT-4
Starred by
Created 2 years ago
Updated 1 week ago
ChatGLM-Efficient-Tuning
by
hiyouga
0.1%
4k
Fine-tuning tool for ChatGLM-6B
Created 2 years ago
Updated 2 years ago
DeepSpeed
by
deepspeedai
0.2%
40k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 5 years ago
Updated 1 day ago
InstructGLM
by
yanqiangmiffy
0%
654
LoRA tuning script for ChatGLM-6B
Created 2 years ago
Updated 2 years ago
ChatGLM-finetune-LoRA
by
lich99
0.1%
720
LoRA finetuning code for ChatGLM-6b
Starred by
Created 2 years ago
Updated 2 years ago
ChatGLM-LLaMA-chinese-insturct
by
27182812
0%
388
Fine-tuning exploration for ChatGLM, LLaMA on Chinese instruction data
Created 2 years ago
Updated 2 years ago
chat-dataset-baseline
by
hikariming
0.1%
1k
Fine-tuned chat model and dataset for Chinese dialogue
Created 2 years ago
Updated 5 months ago
nlp_chinese_corpus
by
brightmart
0.1%
10k
Chinese NLP corpus for pre-training and language model tasks
Starred by
Created 6 years ago
Updated 1 month ago
stanford_alpaca
by
tatsu-lab
0.1%
30k
Instruction-following LLaMA model training and data generation
Starred by
+25
Created 2 years ago
Updated 1 year ago
alpaca-lora
by
tloen
0.0%
19k
LoRA fine-tuning for LLaMA
Starred by
+22
Created 2 years ago
Updated 1 year ago
Chinese-LLaMA-Alpaca
by
ymcui
0.1%
19k
Chinese LLaMA & Alpaca: LLMs for Chinese NLP research
Starred by
Created 2 years ago
Updated 3 months ago
GPT-4-LLM
by
Instruction-Tuning-with-GPT-4
0.0%
4k
GPT-4 data for instruction-tuning LLMs via supervised/RL
Starred by
+5
Created 2 years ago
Updated 2 years ago
memit
by
kmeng01
0.6%
519
Transformer memory mass-editor (ICLR 2023 research paper)
Starred by
Created 3 years ago
Updated 1 year ago
baize-chatbot
by
project-baize
0%
3k
Chat model trained via LoRA, using ChatGPT-generated dialogs
Starred by
+3
Created 2 years ago
Updated 1 year ago
Linly
by
CVI-SZU
0%
3k
Chinese LLMs and datasets for pretraining/finetuning
Created 2 years ago
Updated 1 year ago
peft
by
huggingface
0.3%
20k
Parameter-efficient fine-tuning (PEFT) library
Starred by
+16
Created 2 years ago
Updated 1 day ago
LoRA
by
microsoft
0.2%
13k
PyTorch library for low-rank adaptation (LoRA) of LLMs
Starred by
+12
Created 4 years ago
Updated 10 months ago
gpt_academic
by
binary-husky
0.1%
69k
LLM tool for paper reading/polishing/writing, optimized UI
Starred by
+2
Created 2 years ago
Updated 3 weeks ago
BELLE
by
LianjiaTech
0.1%
8k
Chinese LLM engine for democratized access and instruction tuning
Starred by
Created 2 years ago
Updated 1 year ago
TaskMatrix
by
chenfei-wu
0.0%
34k
Visual ChatGPT connects LLMs to visual foundation models
Starred by
+13
Created 2 years ago
Updated 1 year ago
chatgpt_please_improve_my_paper_writing
by
ashawkey
0%
254
Thin wrapper for academic paper refinement
Created 2 years ago
Updated 2 years ago
mend
by
eric-mitchell
0%
251
Fast model editing for LLMs
Starred by
Created 4 years ago
Updated 2 years ago
adapters
by
adapter-hub
0.1%
3k
Unified library for parameter-efficient transfer learning in NLP
Starred by
+7
Created 5 years ago
Updated 2 days ago
ContinualLM
by
UIC-Liu-Lab
0.3%
286
PyTorch framework for continual learning of language models
Created 2 years ago
Updated 1 year ago
ConSERT
by
yym6472
0%
543
Research paper code for contrastive self-supervised sentence representation transfer
Created 4 years ago
Updated 3 years ago
datasets
by
huggingface
0.1%
21k
Access and process large AI datasets efficiently
Starred by
+23
Created 5 years ago
Updated 1 day ago
aim
by
aimhubio
0.1%
6k
Experiment tracker for AI model training runs
Starred by
+14
Created 6 years ago
Updated 1 day ago
pytorch3d
by
facebookresearch
0.2%
10k
PyTorch3D is a PyTorch library for 3D deep learning research
Starred by
+10
Created 6 years ago
Updated 5 days ago
vit-pytorch
by
lucidrains
0.2%
24k
PyTorch library for Vision Transformer variants and related techniques
Starred by
+10
Created 5 years ago
Updated 1 day ago
CLIP_prefix_caption
by
rmokady
0%
1k
Image captioning model using CLIP embeddings as a prefix
Starred by
Created 4 years ago
Updated 1 year ago
NL-Augmenter
by
GEM-benchmark
0.1%
788
Framework for natural language dataset augmentation
Starred by
Created 4 years ago
Updated 1 year ago
pytorch-image-models
by
huggingface
0.1%
35k
PyTorch image model collection with training, eval, and inference scripts
Starred by
+23
Created 6 years ago
Updated 2 days ago
vision_transformer
by
google-research
0.2%
12k
Vision Transformer and MLP-Mixer models in JAX/Flax
Starred by
+5
Created 5 years ago
Updated 7 months ago
graph4nlp
by
graph4ai
0%
2k
SDK for graph neural networks in NLP
Starred by
Created 5 years ago
Updated 1 year ago
TextAttack
by
QData
0.1%
3k
Python framework for NLP adversarial attacks, data augmentation, and model training
Starred by
+4
Created 6 years ago
Updated 3 months ago
Text_Classification
by
kk7nc
0.1%
2k
Survey paper for text classification algorithms
Starred by
Created 7 years ago
Updated 6 months ago
robustbench
by
RobustBench
0.1%
744
Standardized benchmark for adversarial robustness research
Created 5 years ago
Updated 6 months ago
CLIP
by
openai
0.3%
31k
Image-text matching model for zero-shot prediction
Starred by
+28
Created 4 years ago
Updated 1 year ago
CVPR2025-Papers-with-Code
by
amusi
0.3%
21k
Curated list of CVPR 2025 papers with code
Starred by
Created 5 years ago
Updated 3 months ago
TAADpapers
by
thunlp
0.1%
2k
Curated list of must-read papers on textual adversarial attack and defense
Created 6 years ago
Updated 4 months ago
flax
by
google
0.1%
7k
NN library for JAX, designed for flexibility in neural network research
Starred by
+19
Created 5 years ago
Updated 1 day ago
Real-Time-Voice-Cloning
by
CorentinJ
0.8%
59k
Voice cloning for real-time speech generation
Starred by
+9
Created 6 years ago
Updated 3 weeks ago
Chinese-ELECTRA
by
ymcui
0.1%
1k
Chinese ELECTRA pre-trained language models
Created 5 years ago
Updated 3 months ago
electra
by
google-research
0.1%
2k
Text encoder pre-training via GAN-like discriminator
Starred by
+3
Created 5 years ago
Updated 1 year ago
apex
by
NVIDIA
0.1%
9k
PyTorch extension for streamlined mixed precision & distributed training
Starred by
+18
Created 7 years ago
Updated 1 week ago
unilm
by
microsoft
0.1%
22k
Foundation models for language, vision, speech, and multimodal tasks
Starred by
+19
Created 6 years ago
Updated 3 months ago
ICLR2019-OpenReviewData
by
shaohua0116
0%
387
Data & visualizations for ICLR 2019 OpenReview data, a research paper
Starred by
Created 7 years ago
Updated 5 years ago
ICLR2020-OpenReviewData
by
shaohua0116
0%
461
Data crawler for ICLR OpenReview webpages
Starred by
Created 6 years ago
Updated 5 years ago
universal-triggers
by
Eric-Wallace
0%
297
NLP attack/analysis research paper (EMNLP 2019)
Starred by
Created 6 years ago
Updated 1 year ago
text-to-text-transfer-transformer
by
google-research
0.1%
6k
Unified text-to-text transformer for NLP research
Starred by
+13
Created 6 years ago
Updated 5 months ago
transferlearning
by
jindongwang
0.1%
14k
Resource list for transfer learning research and development
Starred by
Created 8 years ago
Updated 7 months ago
naacl_transfer_learning_tutorial
by
huggingface
0%
720
NLP transfer learning tutorial code
Starred by
+2
Created 6 years ago
Updated 6 years ago
RAdam
by
LiyuanLucasLiu
0%
3k
Optimizer for neural network training, addressing adaptive learning rate variance
Starred by
+1
Created 6 years ago
Updated 4 years ago
Pytorch-UNet
by
milesial
0.3%
11k
PyTorch implementation for image semantic segmentation
Starred by
Created 8 years ago
Updated 1 year ago
higgsfield
by
higgsfield-ai
0.1%
3k
ML framework for large model training and GPU orchestration
Starred by
+10
Created 7 years ago
Updated 1 year ago
DQN_pytorch
by
dxyang
0%
536
PyTorch implementations of DQN variants
Created 8 years ago
Updated 7 years ago
transformers
by
huggingface
0.2%
151k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 20 hours ago
bertviz
by
jessevig
0.1%
8k
Interactive tool for visualizing attention in Transformer language models
Starred by
+6
Created 6 years ago
Updated 4 months ago
ABSA-PyTorch
by
songyouwei
0.1%
2k
PyTorch implementations for aspect-based sentiment analysis
Created 7 years ago
Updated 2 years ago
deeplearning-papernotes
by
dennybritz
0%
4k
Deep learning paper notes and summaries
Starred by
+3
Created 9 years ago
Updated 7 years ago
bert
by
google-research
0.1%
40k
TensorFlow code and pre-trained models for BERT
Starred by
+26
Created 7 years ago
Updated 1 year ago
nmt
by
tensorflow
0.0%
6k
Build state-of-the-art Neural Machine Translation systems
Starred by
+3
Created 8 years ago
Updated 3 years ago
tensorflow
by
tensorflow
0.1%
192k
Open-source ML framework
Starred by
+97
Created 10 years ago
Updated 20 hours ago
Feedback? Help us improve.