hiyouga avatar

Yaowei Zheng

@hiyouga

Author of LLaMA-Factory

GitHubView on GitHub

Starred Projects (384)

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Theo Browne Theo Browne(Founder of Ping.gg), and
8 more.

harmony by openai

10.6%
3k
Renderer for OpenAI's harmony response format
created 2 weeks ago
updated 2 days ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
12 more.

gpt-oss by openai

10.5%
17k
Open-weight LLMs for reasoning and agents
created 1 month ago
updated 18 hours ago
Starred by Koray Kavukcuoglu Koray Kavukcuoglu(Chief AI Architect at Google; CTO of Google DeepMind), John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), and
19 more.

gemini-cli by google-gemini

2.4%
70k
AI agent for terminal workflows
created 4 months ago
updated 18 hours ago
Starred by Wei-Lin Chiang Wei-Lin Chiang(Cofounder of LMArena), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

qwen-code by QwenLM

26.4%
10k
AI coding agent for complex codebases
created 1 month ago
updated 1 day ago
Starred by Lianmin Zheng Lianmin Zheng(Author of SGLang), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
2 more.

Kimi-K2 by MoonshotAI

1.7%
8k
State-of-the-art MoE language model
created 1 month ago
updated 2 weeks ago
Starred by Didier Lopes Didier Lopes(Founder of OpenBB), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
3 more.

fastmcp by jlowin

2.5%
16k
Pythonic SDK for building Model Context Protocol (MCP) servers/clients
created 8 months ago
updated 22 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

WeClone by xming521

0.5%
15k
Digital twin one-stop solution
created 1 year ago
updated 22 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Elvis Saravia Elvis Saravia(Founder of DAIR.AI), and
13 more.

markitdown by microsoft

1.4%
72k
Python tool for converting files to Markdown for LLM text analysis
created 9 months ago
updated 4 days ago
Starred by Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), Dmytro Ivchenko Dmytro Ivchenko(Cofounder of Fireworks AI), and
2 more.

ml-cross-entropy by apple

0.4%
512
PyTorch module for memory-efficient cross-entropy in LLMs
created 9 months ago
updated 2 weeks ago
Starred by Chris Lattner Chris Lattner(Cofounder of Modular; Author of LLVM, Clang, Swift, Mojo, MLIR), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
14 more.

open-infra-index by deepseek-ai

0.1%
8k
AI infrastructure tools for efficient AGI development
created 5 months ago
updated 3 months ago
Starred by Lilian Weng Lilian Weng(Cofounder of Thinking Machines Lab), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
1 more.

Awesome-ML-SYS-Tutorial by zhaochenyang20

3.4%
3k
ML SYS learning notes and code
created 9 months ago
updated 2 days ago
Starred by Lewis Tunstall Lewis Tunstall(Research Engineer at Hugging Face), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
5 more.

s1 by simplescaling

0.1%
7k
Test-time scaling recipe for strong reasoning performance
created 6 months ago
updated 1 month ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
8 more.

TinyZero by Jiayi-Pan

0.2%
12k
Minimal reproduction of DeepSeek R1 Zero for countdown/multiplication tasks
created 6 months ago
updated 3 months ago
Starred by Michael Han Michael Han(Cofounder of Unsloth), Sebastian Raschka Sebastian Raschka(Author of "Build a Large Language Model (From Scratch)"), and
10 more.

DeepSeek-R1 by deepseek-ai

0.1%
91k
Reasoning models research paper
created 6 months ago
updated 1 month ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
3 more.

simpleRL-reason by hkust-nlp

0.4%
4k
RL recipe for reasoning ability in models
created 6 months ago
updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
15 more.

open-r1 by huggingface

0.3%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 4 days ago
Starred by Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), Thomas Wolf Thomas Wolf(Cofounder of Hugging Face), and
1 more.

Math-Verify by huggingface

0.9%
886
Math evaluator for LLM outputs in mathematical tasks
created 7 months ago
updated 1 month ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI), and
2 more.

SkyThought by NovaSky-AI

0.1%
3k
Training recipes for Sky-T1 family of models
created 7 months ago
updated 1 month ago
Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Didier Lopes Didier Lopes(Founder of OpenBB), and
23 more.

llm.c by karpathy

0.2%
27k
LLM training in pure C/CUDA, no PyTorch needed
created 1 year ago
updated 1 month ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
7 more.

modded-nanogpt by KellerJordan

1.3%
3k
Language model training speedrun on 8x H100 GPUs
created 1 year ago
updated 4 weeks ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Daniel Han Daniel Han(Cofounder of Unsloth).

Kiln by Kiln-AI

0.3%
4k
AI prototyping and dataset collaboration tool
created 1 year ago
updated 1 day ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), and
10 more.

stable-dreamfusion by ashawkey

0.1%
9k
Text-to-3D model using NeRF and diffusion
created 2 years ago
updated 1 year ago
Starred by Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
8 more.

verl by volcengine

2.2%
12k
RL training library for LLMs
created 9 months ago
updated 1 day ago
Starred by Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow) and Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI).

VILA by NVlabs

0.5%
3k
Open-source VLMs for efficient video/multi-image understanding
created 1 year ago
updated 1 week ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), and
8 more.

Liger-Kernel by linkedin

0.8%
6k
Triton kernels for efficient LLM training
created 1 year ago
updated 1 day ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Ying Sheng Ying Sheng(Author of SGLang).

DoRA by NVlabs

0.7%
829
PyTorch code for weight-decomposed low-rank adaptation (DoRA)
created 1 year ago
updated 10 months ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
10 more.

LLM101n by karpathy

0.2%
34k
Educational resource for building a Storyteller AI LLM
created 1 year ago
updated 1 year ago
Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
5 more.

MiniCPM-o by OpenBMB

0.4%
20k
MLLM for vision, speech, and multimodal live streaming on your phone
created 1 year ago
updated 4 days ago
Starred by Wing Lian Wing Lian(Founder of Axolotl AI) and Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake).

HALOs by ContextualAI

0.1%
878
Library for aligning LLMs using human-aware loss functions
created 1 year ago
updated 1 month ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Zhuohan Li Zhuohan Li(Author of vLLM), and
6 more.

torchtitan by pytorch

1.2%
4k
PyTorch platform for generative AI model training research
created 1 year ago
updated 17 hours ago
Starred by Lewis Tunstall Lewis Tunstall(Research Engineer at Hugging Face), Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), and
10 more.

torchtune by pytorch

0.5%
5k
PyTorch library for LLM post-training and experimentation
created 1 year ago
updated 1 day ago
Starred by Junyang Lin Junyang Lin(Core Maintainer of Alibaba Qwen), Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and
2 more.

veScale by volcengine

1.1%
852
PyTorch-native framework for LLM training
created 1 year ago
updated 1 month ago
Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), Albert Gu Albert Gu(Cofounder of Cartesia; Professor at CMU), and
9 more.

LLM-Training-Puzzles by srush

0.3%
1k
Hands-on puzzles for large language model training
created 2 years ago
updated 1 year ago
Starred by Lysandre Debut Lysandre Debut(Chief Open-Source Officer at Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

AQLM by Vahe1994

0.2%
1k
PyTorch code for LLM compression via Additive Quantization (AQLM)
created 1 year ago
updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeremy Howard Jeremy Howard(Cofounder of fast.ai), and
4 more.

llm-awq by mit-han-lab

0.5%
3k
Weight quantization research paper for LLM compression/acceleration
created 2 years ago
updated 4 weeks ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm), and
3 more.

gptq by IST-DASLab

0.1%
2k
Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers
created 2 years ago
updated 1 year ago
Starred by Georgi Gerganov Georgi Gerganov(Author of llama.cpp, whisper.cpp), Alex Yu Alex Yu(Research Scientist at OpenAI; Former Cofounder of Luma AI), and
10 more.

Qwen3 by QwenLM

0.9%
24k
Large language model series by Qwen team, Alibaba Cloud
created 1 year ago
updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Simon Willison Simon Willison(Author of Django), and
7 more.

Yi by 01-ai

0.1%
8k
Open-source bilingual LLMs trained from scratch
created 1 year ago
updated 8 months ago
Starred by Lewis Tunstall Lewis Tunstall(Research Engineer at Hugging Face), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
7 more.

datatrove by huggingface

0.9%
3k
Data processing library for large-scale text data
created 2 years ago
updated 2 days ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luca Antiga Luca Antiga(CTO of Lightning AI), and
8 more.

helm by stanford-crfm

0.5%
2k
Open-source Python framework for holistic evaluation of foundation models
created 3 years ago
updated 20 hours ago
Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
12 more.

TensorRT-LLM by NVIDIA

0.5%
11k
LLM inference optimization SDK for NVIDIA GPUs
created 2 years ago
updated 20 hours ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Zhiyuan Li Zhiyuan Li(Cofounder of Nexa AI), and
18 more.

mamba by state-spaces

0.3%
16k
Mamba SSM architecture for sequence modeling
created 1 year ago
updated 4 weeks ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
25 more.

unsloth by unslothai

1.2%
44k
Finetuning tool for LLMs, targeting speed and memory efficiency
created 1 year ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Lianmin Zheng Lianmin Zheng(Author of SGLang), and
15 more.

gpt-fast by meta-pytorch

0.1%
6k
PyTorch text generation for efficient transformer inference
created 1 year ago
updated 4 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Alexander Wettig Alexander Wettig(Author of SWE-bench, SWE-agent), and
3 more.

data-juicer by modelscope

0.9%
5k
Data-Juicer: Data processing system for foundation models
created 2 years ago
updated 1 day ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), and
20 more.

axolotl by axolotl-ai-cloud

0.6%
10k
CLI tool for streamlined post-training of AI models
created 2 years ago
updated 17 hours ago
Starred by Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

streaming-llm by mit-han-lab

0.2%
7k
Framework for efficient LLM streaming
created 1 year ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), and
2 more.

LongLoRA by dvlab-research

0.1%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
created 1 year ago
updated 1 year ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
3 more.

yarn by jquesnelle

0.4%
2k
Context window extension method for LLMs (research paper, models)
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
10 more.

codellama by meta-llama

0.0%
16k
Inference code for CodeLlama models
created 2 years ago
updated 1 year ago
Starred by Ying Sheng Ying Sheng(Author of SGLang), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
6 more.

ToolBench by OpenBMB

0.4%
5k
Open platform for LLM tool learning (ICLR'24 spotlight)
created 2 years ago
updated 2 months ago
Starred by Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
4 more.

llm-attacks by llm-attacks

0.5%
4k
Attack framework for aligned LLMs, based on a research paper
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Junyang Lin Junyang Lin(Core Maintainer of Alibaba Qwen), and
4 more.

LightLLM by ModelTC

0.9%
4k
Python framework for LLM inference and serving
created 2 years ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), and
25 more.

flash-attention by Dao-AILab

0.7%
19k
Fast, memory-efficient attention implementation
created 3 years ago
updated 22 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow).

FastEdit by hiyouga

0%
1k
Tool for fast edits to large language models
created 2 years ago
updated 2 years ago
Starred by Sebastian Raschka Sebastian Raschka(Author of "Build a Large Language Model (From Scratch)"), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

direct-preference-optimization by eric-mitchell

0.6%
3k
Reference implementation for Direct Preference Optimization (DPO)
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm), and
3 more.

GPTQ-for-LLaMa by qwopqwop200

0.1%
3k
4-bit quantization for LLaMA models using GPTQ
created 2 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Clement Delangue Clement Delangue(Cofounder of Hugging Face), and
41 more.

vllm by vllm-project

1.4%
55k
LLM serving engine for high-throughput, memory-efficient inference
created 2 years ago
updated 17 hours ago
Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
15 more.

WizardLM by nlpxucan

0.1%
9k
LLMs built using Evol-Instruct for complex instruction following
created 2 years ago
updated 2 months ago
Starred by Shizhe Diao Shizhe Diao(Research Scientist at NVIDIA; Author of LMFlow), Evan Hubinger Evan Hubinger(Head of Alignment Stress-Testing at Anthropic), and
1 more.

rome by kmeng01

0.1%
655
Model editing research paper for GPT-2 and GPT-J
created 3 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Carol Willing Carol Willing(Core Contributor to CPython, Jupyter), and
54 more.

langchain by langchain-ai

0.4%
114k
Framework for building LLM-powered applications
created 2 years ago
updated 17 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
3 more.

Sophia by Liuhong99

0.2%
966
Optimizer for language model pre-training (research paper)
created 2 years ago
updated 1 year ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
16 more.

qlora by artidoro

0.1%
11k
Finetuning tool for quantized LLMs
created 2 years ago
updated 1 year ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
14 more.

ColossalAI by hpcaitech

0.1%
41k
AI system for large-scale parallel training
created 3 years ago
updated 1 day ago
Starred by Boris Cherny Boris Cherny(Creator of Claude Code; MTS at Anthropic), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
29 more.

whisper by openai

0.5%
87k
Speech recognition model for multilingual transcription/translation
created 2 years ago
updated 1 month ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
3 more.

MOSS by OpenMOSS

0.0%
12k
Open-source tool-augmented conversational language model
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Junyang Lin Junyang Lin(Core Maintainer of Alibaba Qwen), and
1 more.

Alpaca-CoT by PhoebusSi

0.1%
3k
IFT platform for instruction collection, parameter-efficient methods, and LLMs
created 2 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), John Yang John Yang(Author of SWE-bench, SWE-agent), and
20 more.

stanford_alpaca by tatsu-lab

0.0%
30k
Instruction-following LLaMA model training and data generation
created 2 years ago
updated 1 year ago
Starred by Junyang Lin Junyang Lin(Core Maintainer of Alibaba Qwen), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
15 more.

alpaca-lora by tloen

0.1%
19k
LoRA fine-tuning for LLaMA
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Patrick von Platen Patrick von Platen(Research Engineer at Mistral; Author of Hugging Face Diffusers), and
10 more.

LoRA by microsoft

0.4%
13k
PyTorch library for low-rank adaptation (LoRA) of LLMs
created 4 years ago
updated 8 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Taranjeet Singh Taranjeet Singh(Cofounder of Mem0), and
9 more.

TaskMatrix by chenfei-wu

0.0%
34k
Visual ChatGPT connects LLMs to visual foundation models
created 2 years ago
updated 1 year ago
Starred by Ying Sheng Ying Sheng(Author of SGLang), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
6 more.

adapters by adapter-hub

0.1%
3k
Unified library for parameter-efficient transfer learning in NLP
created 5 years ago
updated 1 day ago
Starred by Clement Delangue Clement Delangue(Cofounder of Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
19 more.

datasets by huggingface

0.2%
21k
Access and process large AI datasets efficiently
created 5 years ago
updated 3 days ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
8 more.

pytorch3d by facebookresearch

0.2%
9k
PyTorch3D is a PyTorch library for 3D deep learning research
created 5 years ago
updated 2 days ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Phil Wang Phil Wang(Prolific Research Paper Implementer), and
10 more.

vit-pytorch by lucidrains

0.3%
24k
PyTorch library for Vision Transformer variants and related techniques
created 4 years ago
updated 2 days ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Collin Burns Collin Burns(MTS at Anthropic; Author of MMLU).

CLIP_prefix_caption by rmokady

0%
1k
Image captioning model using CLIP embeddings as a prefix
created 3 years ago
updated 1 year ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
9 more.

higgsfield by higgsfield-ai

0.2%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Starred by Clement Delangue Clement Delangue(Cofounder of Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
7 more.

bertviz by jessevig

0.2%
8k
Interactive tool for visualizing attention in Transformer language models
created 6 years ago
updated 2 months ago
Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), and
71 more.

tensorflow by tensorflow

0.1%
191k
Open-source ML framework
created 9 years ago
updated 17 hours ago
Feedback? Help us improve.