Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Wing Lian
Wing Lian
Founder of Axolotl AI
GitHub
X
Starred Projects (384)
TileKernels
by
deepseek-ai
N/A
1k
Optimized GPU kernels for LLM operations
Starred by
Created 6 days ago
Updated 5 days ago
ml-intern
by
huggingface
1130.8%
7k
AI engineer for autonomous ML research, training, and deployment
Starred by
+4
Created 5 months ago
Updated 14 hours ago
code-review-graph
by
tirth8205
9.3%
13k
Codebase knowledge graph for AI assistants
Created 2 months ago
Updated 6 days ago
rtk
by
rtk-ai
16.9%
37k
CLI proxy for massive LLM token reduction
Starred by
Created 3 months ago
Updated 1 day ago
autoagent
by
kevinrgu
1.8%
4k
Autonomous agent harness engineering framework
Starred by
Created 3 weeks ago
Updated 3 weeks ago
mini-swe-agent
by
SWE-agent
3.6%
4k
AI agent for solving GitHub issues and command-line tasks
Starred by
+5
Created 10 months ago
Updated 14 hours ago
Adan
by
sail-sg
0.2%
816
PyTorch implementation of Adan optimizer for faster deep model training
Starred by
Created 3 years ago
Updated 10 months ago
ASI-Evolve
by
GAIR-NLP
18.6%
554
Agentic framework for autonomous scientific discovery and optimization
Starred by
Created 1 month ago
Updated 1 week ago
turboquant_plus
by
TheTom
2.6%
7k
LLM KV cache compression for efficient local inference
Starred by
Created 1 month ago
Updated 2 days ago
ATLAS
by
itigges22
0.8%
2k
Boosts frozen LLM performance for efficient, self-hosted AI
Starred by
Created 2 months ago
Updated 9 hours ago
agent-orchestrator
by
ComposioHQ
2.4%
7k
Orchestrating parallel AI coding agents for autonomous software development
Created 2 months ago
Updated 4 hours ago
autokernel
by
RightNow-AI
4.2%
1k
Autonomous GPU kernel optimization for PyTorch
Starred by
Created 1 month ago
Updated 1 month ago
KernelAgent
by
meta-pytorch
2.4%
384
Autonomous GPU kernel generation and optimization via AI agents
Starred by
Created 9 months ago
Updated 4 days ago
cookbook
by
Liquid4All
2.3%
2k
On-device AI models and SDK for edge applications
Starred by
Created 6 months ago
Updated 16 hours ago
simple-evals
by
openai
0.4%
4k
Lightweight library for evaluating language models
Starred by
+15
Created 2 years ago
Updated 5 days ago
Awesome-ML-SYS-Tutorial
by
zhaochenyang20
0.9%
6k
ML SYS learning notes and code
Starred by
+1
Created 1 year ago
Updated 5 days ago
ANE
by
maderix
0.4%
7k
Direct neural network training on Apple Neural Engine
Starred by
Created 1 month ago
Updated 1 month ago
multi-agent-coding-system
by
Danau5tin
0.2%
1k
AI coding system with orchestrator, explorer, and coder agents
Starred by
Created 8 months ago
Updated 5 months ago
CUDA-Agent
by
BytedTsinghua-SIA
0.7%
938
Agentic RL for high-performance CUDA kernel generation
Starred by
Created 2 months ago
Updated 1 month ago
OpenClaw-RL
by
Gen-Verse
1.7%
5k
Personalize AI agents through conversational reinforcement learning
Starred by
+1
Created 2 months ago
Updated 9 hours ago
crush
by
charmbracelet
1.2%
24k
AI coding agent for your terminal
Starred by
+4
Created 11 months ago
Updated 8 hours ago
slowrun
by
qlabs-eng
2.8%
447
LLM training benchmark prioritizing deep learning over speed
Starred by
Created 2 months ago
Updated 1 day ago
superpowers
by
obra
4.6%
170k
AI assistant superpowers via a comprehensive skills library
Starred by
+15
Created 6 months ago
Updated 7 hours ago
kvpress
by
NVIDIA
0.5%
1k
LLM KV cache compression made easy
Starred by
Created 1 year ago
Updated 5 days ago
SkillRL
by
aiming-lab
2.0%
684
Recursive skill-augmented reinforcement learning for evolving LLM agents
Created 2 months ago
Updated 2 weeks ago
Open-AgentRL
by
Gen-Verse
2.1%
473
Reinforcement learning for LLM agents
Created 6 months ago
Updated 2 months ago
discover
by
test-time-training
0.5%
547
Learning to discover at test time
Starred by
Created 3 months ago
Updated 4 weeks ago
mHC-manifold-constrained-hyper-connections
by
tokenbender
0.9%
350
Research implementation of manifold-constrained hyper-connections for deep learning models
Created 3 months ago
Updated 2 months ago
IQuest-Coder-V1
by
IQuestLab
0.1%
1k
Code LLMs for autonomous software engineering
Created 3 months ago
Updated 1 month ago
OpenTinker
by
open-tinker
0.5%
664
RL-as-a-Service infrastructure for foundation models
Starred by
+2
Created 4 months ago
Updated 1 month ago
punica
by
punica-ai
0.2%
1k
LoRA serving system (research paper) for multi-tenant LLM inference
Starred by
+3
Created 2 years ago
Updated 2 years ago
mLoRA
by
TUDB-Labs
0%
376
Framework for efficient LoRA fine-tuning of multiple LLMs
Created 2 years ago
Updated 1 year ago
miles
by
radixark
5.2%
1k
Enterprise RL for large-scale MoE models
Starred by
+5
Created 6 months ago
Updated 7 hours ago
ROLL
by
alibaba
0.5%
3k
RL library for large language models
Starred by
Created 11 months ago
Updated 11 hours ago
Kimi-Linear
by
MoonshotAI
0.4%
1k
Efficient linear attention architecture accelerates long-context LLMs
Created 6 months ago
Updated 5 months ago
Fast-dLLM
by
NVlabs
1.5%
947
Diffusion LLM inference acceleration framework
Starred by
Created 11 months ago
Updated 2 weeks ago
auto-round
by
intel
2.9%
1k
Quantization algorithm for LLMs and VLMs
Starred by
Created 2 years ago
Updated 4 hours ago
luminal
by
luminal-ai
0.2%
3k
Deep learning library using composable compilers for high performance
Starred by
Created 2 years ago
Updated 15 hours ago
MARS
by
AGI-Arena
0%
718
Optimization framework for training large models
Created 1 year ago
Updated 1 month ago
DeepResearch
by
Alibaba-NLP
0.3%
19k
Benchmark for LLMs in web traversal
Starred by
+2
Created 1 year ago
Updated 1 month ago
gemlite
by
dropbox
0.4%
445
Triton kernels for efficient low-bit matrix multiplication
Starred by
Created 1 year ago
Updated 19 hours ago
AgentGym-RL
by
WooooDyy
0.8%
711
Train LLM agents for long-horizon, multi-turn decision-making
Starred by
Created 7 months ago
Updated 2 months ago
LlamaGym
by
KhoomeiK
0%
1k
SDK for fine-tuning LLM agents with online reinforcement learning
Starred by
Created 2 years ago
Updated 2 years ago
flame
by
fla-org
0.8%
380
Minimal, efficient framework for LLM training
Starred by
Created 1 year ago
Updated 5 days ago
Soft-Thinking
by
eric-ai-lab
0%
334
Enhancing LLM reasoning via continuous concept spaces
Created 11 months ago
Updated 3 months ago
DFT
by
yongliang-wu
0.2%
558
Improving SFT generalization with reward rectification
Starred by
Created 9 months ago
Updated 3 months ago
dion
by
microsoft
0.2%
468
Orthonormal updates for faster distributed ML training
Created 11 months ago
Updated 10 hours ago
mixture_of_recursions
by
raymin0223
0.7%
568
Adaptive LLM computation with dynamic recursion
Created 10 months ago
Updated 7 months ago
gem
by
axon-rl
0%
477
Agentic LLM training environment for interactive reinforcement learning
Starred by
Created 11 months ago
Updated 3 months ago
cc
by
kn1026
0.3%
710
Starred by
Created 9 months ago
Updated 9 months ago
HRM
by
sapientinc
0.1%
12k
Hierarchical reasoning for complex tasks
Starred by
Created 9 months ago
Updated 3 weeks ago
RL2
by
ChenmienTan
0.2%
1k
Reinforcement learning for large language models
Starred by
+1
Created 1 year ago
Updated 1 month ago
matmulfreellm
by
ridgerchu
0%
3k
MatMul-free language models
Starred by
+2
Created 2 years ago
Updated 4 months ago
applied-ai
by
meta-pytorch
0.3%
320
Applied AI experiments and examples for PyTorch
Starred by
Created 2 years ago
Updated 8 months ago
COAT
by
NVlabs
0%
261
FP8 training framework for memory efficiency
Created 1 year ago
Updated 8 months ago
SkyRL
by
NovaSky-AI
1.0%
2k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+14
Created 1 year ago
Updated 6 hours ago
scattermoe
by
shawntan
0%
273
Triton-based Sparse Mixture-of-Experts for efficient deep learning
Starred by
Created 2 years ago
Updated 6 months ago
rStar
by
microsoft
0.1%
1k
Research paper repo for math reasoning in small LLMs via deep thinking
Starred by
Created 1 year ago
Updated 7 months ago
Skills
by
NVIDIA-NeMo
0.7%
938
LLM skill-improvement pipelines for synthetic data generation, training, and evaluation
Starred by
Created 2 years ago
Updated 7 hours ago
Absolute-Zero-Reasoner
by
LeapLabTHU
0.1%
2k
Self-play reasoning framework needing zero data
Starred by
Created 1 year ago
Updated 8 months ago
DeepSeekRL-Extended
by
brendanhogan
0%
252
GRPO implementation for scaled RL research
Starred by
Created 1 year ago
Updated 8 months ago
open-webui
by
open-webui
1.1%
135k
Self-hosted AI platform for local LLM deployment
Starred by
+24
Created 2 years ago
Updated 4 days ago
TTRL
by
PRIME-RL
0.6%
1k
RL technique for unlabeled data, especially test data
Created 1 year ago
Updated 1 week ago
R2E-Gym
by
R2E-Gym
0.4%
265
Scaling open-weight SWE agents with procedural environments and hybrid verifiers
Starred by
Created 1 year ago
Updated 9 months ago
axolotl
by
axolotl-ai-cloud
0.4%
12k
CLI tool for streamlined post-training of AI models
Starred by
+26
Created 3 years ago
Updated 19 hours ago
agno
by
agno-agi
0.4%
40k
Lightweight library for building AI Agents with memory, knowledge, and reasoning
Starred by
+9
Created 4 years ago
Updated 11 hours ago
github-mcp-server
by
github
0.6%
29k
MCP server for GitHub API automation and interaction
Starred by
Created 1 year ago
Updated 16 hours ago
loong
by
camel-ai
0%
502
Synthetic data generation project using LLM agents
Created 1 year ago
Updated 1 week ago
SWE-Gym
by
SWE-Gym
0%
671
Environment for training software engineering agents
Starred by
+2
Created 1 year ago
Updated 9 months ago
GamingAgent
by
lmgame-org
1.0%
921
SDK for LLM/VLM gaming agents, enabling model evaluation via games
Starred by
Created 1 year ago
Updated 5 months ago
LLaDA
by
ML-GSAI
0.4%
4k
LLM research paper exploring masked diffusion language models
Starred by
Created 1 year ago
Updated 5 months ago
recurrent-pretraining
by
seal-rg
0.3%
875
Pretraining code for depth-recurrent language model research
Starred by
Created 1 year ago
Updated 4 months ago
TransArch
by
MuLabPKU
0%
436
Post-training method converts GQA-based LLMs to MLA models
Created 1 year ago
Updated 3 weeks ago
MLGym
by
facebookresearch
0.2%
596
Gym environment for ML research agents
Starred by
Created 1 year ago
Updated 8 months ago
native-sparse-attention-triton
by
XunhaoLai
0%
275
Efficient sparse attention for LLMs
Created 1 year ago
Updated 11 months ago
coconut
by
facebookresearch
0.4%
2k
Research paper implementation for LLM reasoning in latent space
Starred by
Created 1 year ago
Updated 2 weeks ago
native-sparse-attention-pytorch
by
lucidrains
0.3%
798
Sparse attention implementation from Deepseek's research paper
Created 1 year ago
Updated 8 months ago
ReasonFlux
by
Gen-Verse
0%
532
LLM post-training algorithms for data selection, RL, and inference
Created 1 year ago
Updated 7 months ago
LIMO
by
GAIR-NLP
0.1%
1k
Reasoning model using less data
Starred by
Created 1 year ago
Updated 9 months ago
s1
by
simplescaling
0.0%
7k
Test-time scaling recipe for strong reasoning performance
Starred by
+8
Created 1 year ago
Updated 10 months ago
reasoning-gym
by
open-thought
0.2%
1k
Procedural dataset generator for reasoning models
Starred by
+6
Created 1 year ago
Updated 1 week ago
curator
by
bespokelabsai
0.3%
2k
Synthetic data curation tool for post-training and structured data extraction
Starred by
+1
Created 1 year ago
Updated 1 week ago
RAGEN
by
mll-lab-nu
0.4%
3k
Train LLM agents with reinforcement learning in interactive environments
Starred by
Created 1 year ago
Updated 2 weeks ago
SkyThought
by
NovaSky-AI
0.1%
3k
Training recipes for Sky-T1 family of models
Starred by
+5
Created 1 year ago
Updated 9 months ago
search-and-learn
by
huggingface
0.1%
1k
Recipes to scale inference-time compute of open models
Starred by
+1
Created 1 year ago
Updated 3 weeks ago
buffer-of-thought-llm
by
YangLing0818
0%
676
Research paper implementation for thought-augmented LLM reasoning
Created 1 year ago
Updated 10 months ago
HuatuoGPT-o1
by
FreedomIntelligence
0.4%
1k
Medical LLM for advanced reasoning
Created 1 year ago
Updated 1 year ago
LayerSkip
by
facebookresearch
0.3%
368
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding" research paper
Created 2 years ago
Updated 2 weeks ago
markitdown
by
microsoft
3.2%
118k
Python tool for converting files to Markdown for LLM text analysis
Starred by
+18
Created 1 year ago
Updated 1 week ago
NeMo-Aligner
by
NVIDIA
0%
853
Toolkit for efficient model alignment
Starred by
+1
Created 2 years ago
Updated 6 months ago
agency-swarm
by
VRSEN
0.5%
4k
Agentic framework built on OpenAI Assistants API for automating AI workflows
Starred by
Created 2 years ago
Updated 3 days ago
instructlab
by
instructlab
0%
1k
CLI tool for LLM alignment tuning via synthetic data
Starred by
Created 2 years ago
Updated 4 weeks ago
flash-linear-attention
by
fla-org
1.2%
5k
Efficient Torch/Triton implementations for linear attention models
Starred by
+8
Created 2 years ago
Updated 19 hours ago
TokenFormer
by
Haiyang-W
0%
588
Research paper on a fully attention-based neural network with tokenized model parameters
Created 1 year ago
Updated 1 year ago
evaluation-guidebook
by
huggingface
0.3%
2k
LLM evaluation guide for practitioners
Starred by
+3
Created 1 year ago
Updated 4 months ago
dynasaur
by
adobe-research
0%
358
LLM agent framework using dynamic action creation via Python code generation
Starred by
Created 1 year ago
Updated 1 year ago
Marco-o1
by
AIDC-AI
0.1%
2k
Open reasoning model for real-world problem solving
Created 1 year ago
Updated 2 months ago
SageAttention
by
thu-ml
0.3%
3k
Attention kernel for plug-and-play inference acceleration
Starred by
Created 1 year ago
Updated 3 months ago
metaflow
by
Netflix
0.1%
10k
Framework for building and managing AI/ML systems
Starred by
+10
Created 6 years ago
Updated 6 hours ago
Muon
by
KellerJordan
1.3%
3k
Optimizer for neural network hidden layers
Starred by
Created 1 year ago
Updated 3 months ago
MathBlackBox
by
trotsky1997
0%
1k
Research paper for mathematical reasoning via LLMs
Starred by
+1
Created 1 year ago
Updated 1 year ago
BitNet
by
microsoft
0.4%
39k
Inference framework for 1-bit LLMs
Starred by
+8
Created 1 year ago
Updated 1 month ago
Aria
by
rhymes-ai
0.1%
1k
Multimodal MoE model for video, document understanding, and dialog
Starred by
Created 1 year ago
Updated 1 year ago
Hands-On-Large-Language-Models
by
HandsOnLLM
0.5%
25k
Code examples for "Hands-On Large Language Models" book
Starred by
Created 1 year ago
Updated 4 days ago
modded-nanogpt
by
KellerJordan
0.6%
5k
Language model training speedrun on 8x H100 GPUs
Starred by
+8
Created 1 year ago
Updated 1 day ago
ogx
by
ogx-ai
0.1%
8k
Composable building blocks for Llama apps
Starred by
+7
Created 1 year ago
Updated 22 hours ago
Adam-mini
by
zyushun
0%
457
PyTorch implementation of Adam-mini optimizer from a research paper
Starred by
Created 1 year ago
Updated 11 months ago
optillm
by
algorithmicsuperintelligence
0.1%
3k
Optimizing inference proxy for LLMs
Starred by
+8
Created 1 year ago
Updated 1 month ago
LLM-Blender
by
yuchenlin
0.2%
980
LLM ensembling framework using pairwise ranking and generative fusion
Starred by
+3
Created 2 years ago
Updated 1 year ago
EvolKit
by
arcee-ai
0.4%
259
LLM instruction enhancement framework
Starred by
Created 1 year ago
Updated 1 year ago
LLMs-Planning
by
karthikv792
0.4%
462
Benchmark for evaluating LLMs on planning tasks
Created 3 years ago
Updated 7 months ago
rStar
by
zhentingqi
0%
970
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 1 year ago
distributed-training-guide
by
LambdaLabsML
0%
608
PyTorch guide for distributed training of large language models
Starred by
Created 1 year ago
Updated 6 months ago
nyuntam
by
nyunAI
0%
665
CLI tool for LLM compression via pruning, quantization, and distillation
Created 1 year ago
Updated 1 year ago
distillm
by
jongwooko
0.4%
258
Streamlined LLM distillation for efficient model training
Starred by
Created 2 years ago
Updated 1 year ago
MisguidedAttention
by
cpldcpu
0.2%
472
LLM reasoning benchmark for evaluating responses to misleading prompts
Starred by
Created 1 year ago
Updated 9 months ago
Open-Reasoning-Tasks
by
NousResearch
0.2%
482
Reasoning tasks collection for LLMs
Starred by
+3
Created 1 year ago
Updated 1 year ago
long-context-attention
by
feifeibear
0.3%
666
Unified sequence parallel attention for long context LLM training/inference
Starred by
Created 2 years ago
Updated 3 months ago
DistillKit
by
arcee-ai
0.2%
931
Open-source toolkit for LLM distillation research
Starred by
Created 1 year ago
Updated 1 month ago
do-not-answer
by
Libr-AI
0%
323
Dataset for evaluating LLM safety mechanisms
Starred by
Created 2 years ago
Updated 1 year ago
fms-fsdp
by
foundation-model-stack
0.3%
285
Efficiently train foundation models with PyTorch
Starred by
Created 2 years ago
Updated 5 months ago
OLMo
by
allenai
0.1%
6k
Open language model code for training, evaluation, and inference
Starred by
+4
Created 3 years ago
Updated 5 months ago
BAdam
by
Ledzy
0%
285
Memory-efficient optimizer for large language model finetuning
Starred by
Created 2 years ago
Updated 1 year ago
snowflake-arctic
by
Snowflake-Labs
0%
561
AI research project for efficient LLM training and inference
Starred by
Created 2 years ago
Updated 1 year ago
open-instruct
by
allenai
0.2%
4k
Training codebase for instruction-following language models
Starred by
+10
Created 2 years ago
Updated 11 hours ago
mdistiller
by
megvii-research
0.1%
899
PyTorch library for knowledge distillation research
Created 4 years ago
Updated 2 years ago
augmentoolkit
by
e-p-armstrong
0.1%
2k
Data toolkit for custom LLM creation using open-source AI
Starred by
+3
Created 2 years ago
Updated 3 days ago
awesome-synthetic-datasets
by
davanstrien
0%
330
Curated list of synthetic text/vision datasets and generation tools
Created 2 years ago
Updated 3 months ago
calm
by
zeux
0.3%
637
Single-GPU inference engine for rapid LLM prototyping
Starred by
Created 2 years ago
Updated 11 months ago
MobileLLM
by
facebookresearch
0.3%
1k
Sub-billion parameter LLM training code for on-device use
Starred by
+2
Created 1 year ago
Updated 1 year ago
phoenix
by
Arize-ai
0.8%
9k
AI observability platform for experimentation, evaluation, and troubleshooting
Starred by
+6
Created 3 years ago
Updated 10 hours ago
SPPO
by
uclaml
0%
587
Self-Play Preference Optimization (SPPO) aligns language models via self-play
Starred by
Created 1 year ago
Updated 1 year ago
WildBench
by
allenai
0%
251
Benchmarking LLMs with challenging real-user tasks
Starred by
Created 2 years ago
Updated 1 year ago
AutoIF
by
QwenLM
0%
330
Research paper for improving LLM instruction-following via self-play with execution feedback
Starred by
Created 1 year ago
Updated 1 year ago
refusal_direction
by
andyrdt
0%
379
Research paper code for analyzing refusal in language models
Starred by
Created 1 year ago
Updated 10 months ago
YaFSDP
by
yandex
0.1%
986
Sharded data parallelism framework for transformer-like neural networks
Starred by
Created 1 year ago
Updated 1 week ago
chat_templates
by
chujiezheng
0%
717
Chat templates for HuggingFace LLMs
Starred by
Created 2 years ago
Updated 1 year ago
LESS
by
princeton-nlp
0.6%
526
Data selection research paper for targeted instruction tuning
Starred by
Created 2 years ago
Updated 1 year ago
MixEval
by
JinjieNi
0%
256
Dynamic LLM evaluation suite for accurate, cost-effective benchmarking
Starred by
Created 1 year ago
Updated 1 year ago
MoRA
by
kongds
0.3%
362
Parameter-efficient fine-tuning via high-rank updating (MoRA)
Starred by
Created 1 year ago
Updated 1 year ago
SimPO
by
princeton-nlp
0.3%
954
Preference optimization algorithm for LLMs (NeurIPS 2024 paper)
Starred by
Created 1 year ago
Updated 1 year ago
qodo-cover
by
qodo-ai
0.1%
5k
CLI tool for AI-powered test generation and code coverage enhancement
Starred by
Created 1 year ago
Updated 3 weeks ago
mlx-bitnet
by
exo-explore
0.7%
273
Efficient LLM inference on Apple Silicon
Starred by
Created 2 years ago
Updated 1 year ago
gemma-2B-10M
by
mustafaaljadery
0%
936
Gemma 2B with 10M context length using Infini-attention
Starred by
Created 2 years ago
Updated 1 year ago
xtuner
by
InternLM
0.1%
5k
LLM fine-tuning toolkit for research
Starred by
+2
Created 2 years ago
Updated 4 hours ago
GLiNER
by
urchade
0.9%
3k
NER model for identifying any entity type using bidirectional transformer
Starred by
Created 2 years ago
Updated 2 days ago
contriever
by
facebookresearch
0%
777
Unsupervised dense information retrieval via contrastive learning
Starred by
Created 4 years ago
Updated 3 years ago
prometheus-eval
by
prometheus-eval
0.2%
1k
LLM evaluation framework using open LLMs
Starred by
Created 2 years ago
Updated 1 year ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.4%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 2 years ago
Updated 1 year ago
selfcodealign
by
bigcode-project
0%
323
Research paper for self-alignment in code generation
Starred by
Created 2 years ago
Updated 1 year ago
llm-datasets
by
mlabonne
1.0%
4k
Curated datasets/tools for LLM post-training
Starred by
+1
Created 2 years ago
Updated 1 day ago
rerope
by
bojone
0%
395
Position embeddings research paper
Starred by
Created 2 years ago
Updated 1 year ago
LaVague
by
lavague-ai
0.1%
6k
Web agent framework for automating web processes
Starred by
+7
Created 2 years ago
Updated 1 year ago
ring-flash-attention
by
zhuzilin
0%
1k
FlashAttention extension for ring attention
Starred by
+2
Created 2 years ago
Updated 7 months ago
llamaduo
by
deep-diver
0%
318
LLMOps pipeline to fine-tune small LLMs for service LLM outage prep
Starred by
Created 2 years ago
Updated 9 months ago
cohere-toolkit
by
cohere-ai
0.1%
3k
RAG toolkit for LLM application development and deployment
Starred by
+4
Created 2 years ago
Updated 3 weeks ago
uptrain
by
uptrain-ai
0%
2k
Open-source platform to evaluate and improve GenAI apps
Starred by
+5
Created 3 years ago
Updated 1 year ago
BitBLAS
by
microsoft
0.4%
762
Library for mixed-precision matrix multiplications, targeting quantized LLM deployment
Created 2 years ago
Updated 8 months ago
arena-hard-auto
by
lmarena
0.2%
1k
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 10 months ago
ChunkLlama
by
HKUNLP
0%
450
Training-free method for extending LLM context windows
Created 2 years ago
Updated 1 year ago
dstack
by
dstackai
0.4%
2k
Open-source tool for simplifying GPU allocation and AI workload orchestration
Starred by
+3
Created 4 years ago
Updated 4 hours ago
rho
by
microsoft
0%
465
LLM pretraining research paper using selective language modeling (SLM)
Starred by
Created 2 years ago
Updated 2 years ago
dify
by
langgenius
0.5%
139k
Open-source LLM app development platform
Starred by
+17
Created 3 years ago
Updated 4 hours ago
attorch
by
BobMcDear
0%
600
PyTorch nn module subset, implemented in Python using Triton
Starred by
+2
Created 2 years ago
Updated 8 months ago
mixtral-offloading
by
dvmazur
0%
2k
Inference optimization for Mixtral-8x7B models
Starred by
Created 2 years ago
Updated 2 years ago
auto-code-rover
by
AutoCodeRoverSG
0.1%
3k
Autonomous software engineer for program improvement
Starred by
+3
Created 2 years ago
Updated 1 year ago
BitNet-Transformers
by
Beomi
0%
315
HuggingFace Transformers implementation of BitNet scaling for LLMs
Created 2 years ago
Updated 2 years ago
EasyContext
by
jzhang38
0.1%
757
Recipes for language model context length extrapolation to 1M tokens
Starred by
+2
Created 2 years ago
Updated 1 year ago
pyreft
by
stanfordnlp
0.1%
2k
Python library for representation finetuning (ReFT) of language models
Starred by
Created 2 years ago
Updated 1 month ago
hlb-gpt
by
tysam-code
0.3%
356
Researcher's toolbench for GPT model exploration
Starred by
Created 3 years ago
Updated 1 year ago
aideml
by
WecoAI
0.9%
1k
ML engineering agent for automated AI R&D, surpassing human experts
Starred by
Created 2 years ago
Updated 6 days ago
BitNet
by
kyegomez
0.1%
2k
PyTorch implementation of BitNet research paper
Starred by
Created 2 years ago
Updated 1 day ago
horovod
by
horovod
0.0%
15k
Distributed training framework for TF, Keras, PyTorch, and MXNet
Starred by
+19
Created 8 years ago
Updated 4 months ago
dataverse
by
UpstageAI
0%
564
ETL pipeline for LLM data processing
Starred by
Created 2 years ago
Updated 1 year ago
hqq
by
dropbox
0.1%
929
Model quantizer for fast, accurate post-training quantization, skipping calibration
Starred by
Created 2 years ago
Updated 2 months ago
Triton-Puzzles
by
gpu-mode
0.4%
2k
Interactive puzzles for learning Triton
Starred by
Created 2 years ago
Updated 3 weeks ago
repeng
by
vgel
0.4%
722
Python library for representation engineering control vectors
Starred by
Created 2 years ago
Updated 7 months ago
cobra
by
OpenHelix-Team
0%
294
Multimodal LLM research paper extending Mamba for efficient inference
Created 2 years ago
Updated 1 year ago
hackathon
by
mistralai-sf24
0%
446
Minimal code for running and finetuning a 7B transformer model
Starred by
Created 2 years ago
Updated 2 years ago
raft
by
rapidsai
0.6%
1k
CUDA-accelerated primitives for ML/data mining algorithms
Starred by
Created 6 years ago
Updated 11 hours ago
maestro
by
Doriandarko
0.1%
4k
Framework for Claude Opus to orchestrate subagents
Starred by
Created 2 years ago
Updated 1 year ago
quiet-star
by
ezelikman
0%
740
Research code for self-teaching language models
Starred by
Created 2 years ago
Updated 1 year ago
ml-engineering
by
stas00
0.3%
18k
Open book for LLM/VLM training engineers
Starred by
+18
Created 5 years ago
Updated 1 month ago
chatbot-ui
by
mckaywrigley
0.0%
33k
Open-source AI chat app
Starred by
+14
Created 3 years ago
Updated 1 year ago
orpo
by
xfactlab
0.4%
483
Preference optimization without a reference model
Starred by
Created 2 years ago
Updated 1 year ago
SWE-bench
by
SWE-bench
1.1%
5k
Benchmark for evaluating LLMs on real-world GitHub issues
Starred by
+12
Created 2 years ago
Updated 3 weeks ago
OpenHands
by
OpenHands
0.8%
72k
AI platform for software development agents
Starred by
+36
Created 2 years ago
Updated 6 hours ago
FastV
by
pkunlp-icler
0%
574
Inference acceleration for large vision-language models (research paper)
Created 2 years ago
Updated 1 year ago
airllm
by
lyogavin
1.3%
17k
Inference optimization for LLMs on low-resource hardware
Starred by
Created 2 years ago
Updated 1 month ago
daytona
by
daytonaio
0.2%
72k
Infrastructure for running AI-generated code
Starred by
+5
Created 2 years ago
Updated 3 hours ago
VisionLLaMA
by
Meituan-AutoML
0%
392
Vision transformer research paper
Created 2 years ago
Updated 1 year ago
fsdp_qlora
by
AnswerDotAI
0.1%
2k
Training script for LLMs using QLoRA + FSDP
Starred by
+3
Created 2 years ago
Updated 1 year ago
h2o-llmstudio
by
h2oai
0.1%
5k
LLM Studio: framework for LLM fine-tuning via GUI or CLI
Starred by
+5
Created 3 years ago
Updated 3 days ago
ChatMusician
by
hf-lin
0.7%
308
LLM for music understanding and generation
Created 2 years ago
Updated 2 years ago
AnyGPT
by
OpenMOSS
0.1%
879
Multimodal LLM research paper for any-to-any modality conversion
Starred by
Created 2 years ago
Updated 1 year ago
FlagEmbedding
by
FlagOpen
0.3%
12k
Toolkit for retrieval and RAG applications
Starred by
+8
Created 2 years ago
Updated 5 days ago
self-rewarding-lm-pytorch
by
lucidrains
0%
1k
Training framework for self-rewarding language models
Starred by
+4
Created 2 years ago
Updated 2 years ago
crewAI
by
crewAIInc
1.4%
50k
Framework for autonomous AI agent orchestration via role-playing and collaboration
Starred by
+18
Created 2 years ago
Updated 5 hours ago
resource-stream
by
gpu-mode
0.4%
2k
CUDA resource collection for GPU programming
Starred by
Created 2 years ago
Updated 1 month ago
metal-flash-attention
by
philipturner
0%
599
Metal port of FlashAttention for Apple silicon
Starred by
+2
Created 2 years ago
Updated 1 year ago
LLMs-from-scratch
by
rasbt
0.5%
92k
Educational resource for LLM construction in PyTorch
Starred by
+11
Created 2 years ago
Updated 1 week ago
mlx-examples
by
ml-explore
0.3%
9k
Examples using the MLX framework
Starred by
+7
Created 2 years ago
Updated 3 weeks ago
ai-codereviewer
by
villesau
0.1%
1k
GitHub Action for AI-powered code review
Starred by
Created 3 years ago
Updated 1 year ago
deita
by
hkust-nlp
0%
594
Data-efficient instruction tuning for LLM alignment (ICLR 2024)
Starred by
Created 2 years ago
Updated 1 year ago
AutoAWQ
by
casper-hansen
0.2%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Starred by
+5
Created 2 years ago
Updated 11 months ago
ProxyAI
by
carlrobertoh
0.3%
2k
JetBrains IDE copilot for coding assistance
Starred by
Created 3 years ago
Updated 1 week ago
EAGLE
by
SafeAILab
0.6%
2k
Speculative decoding research paper for faster LLM inference
Starred by
+5
Created 2 years ago
Updated 2 months ago
HALOs
by
ContextualAI
0.1%
904
Library for aligning LLMs using human-aware loss functions
Starred by
Created 2 years ago
Updated 7 months ago
mamba
by
state-spaces
0.3%
18k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 1 day ago
modelz-llm
by
tensorchord
0%
277
Inference server for open-source LLMs, offering an OpenAI-compatible API
Created 2 years ago
Updated 2 years ago
unsloth
by
unslothai
1.3%
63k
Finetuning tool for LLMs, targeting speed and memory efficiency
Starred by
+38
Created 2 years ago
Updated 4 hours ago
gpt-researcher
by
assafelovic
0.6%
27k
Autonomous agent for web/local research, generating cited reports
Starred by
+9
Created 3 years ago
Updated 1 week ago
functionary
by
MeetKai
0%
2k
Chat language model for tool use and result interpretation
Starred by
+2
Created 2 years ago
Updated 4 months ago
Logic-LLM
by
teacherpeterpan
0%
393
Logic-LM: Framework for improved logical reasoning via LLMs and symbolic solvers
Created 2 years ago
Updated 1 year ago
LLMSurvey
by
RUCAIBox
0.1%
12k
Survey paper for large language models
Starred by
+2
Created 3 years ago
Updated 1 year ago
distilabel
by
argilla-io
0.3%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 14 hours ago
long-llms-learning
by
Strivin0311
0%
274
Literature repository for long-context LLM methodologies
Starred by
Created 2 years ago
Updated 1 year ago
MergeLM
by
yule-BUAA
0.2%
865
Codebase for merging language models via parameter averaging
Starred by
Created 2 years ago
Updated 2 years ago
Video-LLaVA
by
PKU-YuanGroup
0.1%
3k
Video-LLaVA: Multimodal model for video/image understanding via LLM
Starred by
Created 2 years ago
Updated 1 year ago
medAlpaca
by
kbressem
0.2%
559
LLM finetuned for medical question answering
Starred by
Created 3 years ago
Updated 2 years ago
intel-extension-for-transformers
by
intel
0%
2k
Transformer toolkit for GenAI/LLM acceleration on Intel platforms
Starred by
Created 3 years ago
Updated 1 year ago
representation-engineering
by
andyzoujm
0.3%
988
AI transparency via representation engineering
Starred by
Created 2 years ago
Updated 1 year ago
multimodal
by
facebookresearch
0%
2k
PyTorch library for multimodal multi-task model training
Starred by
+1
Created 4 years ago
Updated 21 hours ago
S-LoRA
by
S-LoRA
0%
2k
System for scalable LoRA adapter serving
Starred by
+1
Created 2 years ago
Updated 2 years ago
DeepSpeed
by
deepspeedai
0.1%
42k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 6 years ago
Updated 4 days ago
continue
by
continuedev
0.4%
33k
IDE extension for custom AI code assistants
Starred by
+16
Created 2 years ago
Updated 1 day ago
llama-cookbook
by
meta-llama
0.1%
18k
Guide for building with Llama models
Starred by
+15
Created 2 years ago
Updated 6 days ago
finetuner
by
jina-ai
0%
2k
Cloud tool for task-oriented embedding finetuning of models like BERT and CLIP
Starred by
+3
Created 4 years ago
Updated 2 years ago
ludwig
by
ludwig-ai
0.1%
12k
Low-code framework for custom AI models (LLMs, neural networks)
Starred by
+17
Created 7 years ago
Updated 4 hours ago
img2dataset
by
rom1504
0.0%
4k
CLI tool for creating large image datasets from URLs
Starred by
+12
Created 4 years ago
Updated 6 months ago
distilling-step-by-step
by
google-research
0.3%
590
Code for research paper on knowledge distillation
Starred by
Created 2 years ago
Updated 2 years ago
Cherry_LLM
by
tianyi-lab
0%
413
Research paper for LLM instruction tuning via self-guided data selection
Created 2 years ago
Updated 10 months ago
Reflection_Tuning
by
tianyi-lab
0%
366
Research paper for LLM instruction tuning via data recycling
Starred by
Created 2 years ago
Updated 1 year ago
instructor
by
567-labs
0.3%
13k
SDK for structured LLM outputs using Pydantic models
Starred by
+27
Created 2 years ago
Updated 5 days ago
YiVal
by
YiVal
0.1%
2k
Prompt engineering assistant for GenAI apps
Starred by
Created 2 years ago
Updated 2 years ago
LLM-Shearing
by
princeton-nlp
0%
643
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 2 years ago
Updated 2 years ago
letta
by
letta-ai
0.6%
22k
Agent framework for stateful agents with memory, reasoning, and context management
Starred by
+19
Created 2 years ago
Updated 2 weeks ago
CogVLM
by
zai-org
0.0%
7k
VLM for image understanding and multi-turn dialogue
Starred by
+4
Created 2 years ago
Updated 1 year ago
ragas
by
vibrantlabsai
0.8%
14k
Toolkit for LLM application evaluation
Starred by
+12
Created 3 years ago
Updated 2 months ago
NEFTune
by
neelsjain
0%
412
Technique to improve instruction finetuning of LLMs
Starred by
Created 2 years ago
Updated 1 year ago
FireAct
by
anchen1011
0%
292
Language agent fine-tuning research paper
Starred by
Created 2 years ago
Updated 2 years ago
LLaVA
by
haotian-liu
0.1%
25k
Multimodal assistant with GPT-4 level capabilities
Starred by
+16
Created 3 years ago
Updated 1 year ago
alignment-handbook
by
huggingface
0.2%
6k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 2 weeks ago
autolabel
by
refuel-ai
0.0%
2k
Python library to label text datasets using LLMs
Starred by
+4
Created 3 years ago
Updated 1 year ago
EmpatheticDialogues
by
facebookresearch
0%
546
PyTorch code for empathetic dialogue research
Starred by
Created 6 years ago
Updated 4 years ago
world-models
by
wesg52
0%
260
Research paper code for extracting spatial/temporal world models from LLMs
Starred by
Created 2 years ago
Updated 2 years ago
OpenGPT
by
CogStack
0.3%
362
Framework for grounded instruction datasets and domain-expert LLMs
Starred by
Created 3 years ago
Updated 2 years ago
Medusa
by
FasterDecoding
0.1%
3k
Framework for accelerating LLM generation using multiple decoding heads
Starred by
+6
Created 2 years ago
Updated 1 year ago
open_flamingo
by
mlfoundations
0.0%
4k
Open-source framework for training large multimodal models
Starred by
+7
Created 3 years ago
Updated 1 year ago
textbook_quality
by
VikParuchuri
0%
508
Synthetic data generator for LLM pretraining
Starred by
Created 2 years ago
Updated 2 years ago
tree-of-thought-llm
by
princeton-nlp
0.2%
6k
Research paper implementation for Tree of Thoughts (ToT) prompting
Starred by
+7
Created 2 years ago
Updated 1 year ago
LongLoRA
by
JIA-Lab-research
0%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
Starred by
+1
Created 2 years ago
Updated 1 year ago
kani
by
zhudotexe
0%
599
Microframework for chat-based language models with tool use/function calling
Starred by
Created 2 years ago
Updated 1 month ago
DoLa
by
voidism
0.5%
551
Decoding strategy research paper for improving factuality in LLMs
Starred by
Created 2 years ago
Updated 1 year ago
varuna
by
microsoft
0%
251
Tool for efficient large DNN model training on commodity hardware
Starred by
Created 4 years ago
Updated 1 year ago
BLoRA
by
sabetAI
0%
351
Inference optimization for batched LoRA adapters
Starred by
Created 2 years ago
Updated 2 years ago
TinyLlama
by
jzhang38
0.1%
9k
Tiny pretraining project for a 1.1B Llama model
Starred by
+18
Created 2 years ago
Updated 2 years ago
sparsegpt
by
IST-DASLab
0%
879
Code for massive language model one-shot pruning (ICML 2023 paper)
Starred by
Created 3 years ago
Updated 1 year ago
LLM-Pruner
by
horseee
0.1%
1k
LLM structural pruner for model compression
Created 2 years ago
Updated 1 year ago
graph-of-thoughts
by
spcl
0.2%
3k
Graph-of-Thoughts: LLM framework for complex problem-solving
Starred by
+1
Created 2 years ago
Updated 1 month ago
tensor_parallel
by
BlackSamorez
0%
654
PyTorch module for multi-GPU model parallelism
Starred by
Created 3 years ago
Updated 2 years ago
relora
by
Guitaricet
0.2%
474
PEFT pretraining code for ReLoRA research paper
Starred by
Created 3 years ago
Updated 2 years ago
wandbot
by
wandb
0%
309
Support bot for Weights & Biases' AI tools, running in Discord, Slack, ChatGPT, and Zendesk
Starred by
Created 3 years ago
Updated 2 months ago
LightLLM
by
ModelTC
0.3%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 4 hours ago
lmdeploy
by
InternLM
0.3%
8k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 8 hours ago
llama-chat
by
replicate
0%
835
Next.js app for Llama 3 chat UI development
Created 2 years ago
Updated 4 months ago
llama2-chatbot
by
a16z-infra
0.1%
1k
Streamlit chatbot app for interacting with LLMs
Starred by
Created 2 years ago
Updated 2 years ago
IncognitoPilot
by
silvanmelchior
0%
440
AI code interpreter for local data processing, like ChatGPT Code Interpreter
Created 2 years ago
Updated 2 years ago
ai-town
by
a16z-infra
0.4%
10k
AI town starter kit for building a virtual world
Starred by
+12
Created 2 years ago
Updated 1 month ago
octopack
by
bigcode-project
0.2%
479
Code LLM instruction tuning research paper
Starred by
+2
Created 3 years ago
Updated 1 year ago
outlines
by
dottxt-ai
0.3%
14k
SDK for structured LLM text generation
Starred by
+34
Created 3 years ago
Updated 1 week ago
bubogpt
by
magic-research
0%
511
Multi-modal LLM for joint text, vision, and audio understanding
Created 2 years ago
Updated 2 years ago
MetaGPT
by
FoundationAgents
0.3%
67k
Multi-agent framework for collaborative AI software development
Starred by
+10
Created 2 years ago
Updated 3 months ago
pykoi
by
CambioML
0%
411
Python library for reinforcement learning with human feedback (RLHF)
Starred by
Created 2 years ago
Updated 7 months ago
ChainFury
by
NimbleBoxAI
0%
451
Open-source chaining engine for production AI apps
Starred by
Created 3 years ago
Updated 2 years ago
candle
by
huggingface
0.3%
20k
Minimalist ML framework for Rust, emphasizing performance and ease of use
Starred by
+23
Created 2 years ago
Updated 22 hours ago
Megatron-LLM
by
epfLLM
0%
589
Distributed trainer for LLMs
Starred by
Created 2 years ago
Updated 1 year ago
ToolBench
by
OpenBMB
0.2%
6k
Open platform for LLM tool learning (ICLR'24 spotlight)
Starred by
+6
Created 2 years ago
Updated 11 months ago
gpt-engineer
by
AntonOsika
0.0%
55k
CLI platform for code generation experimentation
Starred by
+17
Created 3 years ago
Updated 11 months ago
RRHF
by
GanjinZero
0%
808
RRHF for aligning LLMs to human preferences
Starred by
Created 3 years ago
Updated 2 years ago
LlamaFactory
by
hiyouga
0.4%
71k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+25
Created 2 years ago
Updated 9 hours ago
exllama
by
turboderp
0.1%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 3 years ago
Updated 2 years ago
doremi
by
sangmichaelxie
0.3%
352
PyTorch for optimizing data mixtures in language model datasets
Starred by
Created 2 years ago
Updated 2 years ago
UltraChat
by
thunlp
0.3%
3k
Multi-round dialogue dataset and models for chat language model training
Starred by
Created 3 years ago
Updated 2 years ago
RealChar
by
Shaunwei
0.1%
6k
Real-time AI character/companion creation and interaction codebase
Starred by
+3
Created 2 years ago
Updated 3 months ago
serve
by
jina-ai
0.0%
22k
Framework for building cloud-native multimodal AI apps
Starred by
+17
Created 6 years ago
Updated 1 year ago
aider
by
Aider-AI
0.8%
44k
AI pair programming in your terminal
Starred by
+38
Created 3 years ago
Updated 2 days ago
LMFlow
by
OptimalScale
0.1%
8k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 3 years ago
Updated 3 days ago
baize-chatbot
by
project-baize
0%
3k
Chat model trained via LoRA, using ChatGPT-generated dialogs
Starred by
+3
Created 3 years ago
Updated 2 years ago
ToolQA
by
night-chen
0%
285
Dataset for evaluating LLMs using external tools
Created 2 years ago
Updated 2 years ago
SuperAGI
by
TransformerOptimus
0.1%
17k
Open-source framework for autonomous AI agent development
Starred by
+4
Created 3 years ago
Updated 1 year ago
audiocraft
by
facebookresearch
0.1%
23k
PyTorch library for audio processing and generation research
Starred by
+15
Created 2 years ago
Updated 1 month ago
guidance
by
guidance-ai
0.1%
21k
Guidance is a programming paradigm for steering LLMs
Starred by
+38
Created 3 years ago
Updated 2 weeks ago
open_llama
by
openlm-research
0.0%
8k
Open-source reproduction of LLaMA models
Starred by
+14
Created 3 years ago
Updated 2 years ago
RL4LMs
by
allenai
0.0%
2k
RL library to fine-tune language models to human preferences
Starred by
+3
Created 3 years ago
Updated 2 years ago
SwiftSage
by
SwiftSage
0.3%
326
Agent system for reasoning with LLMs via in-context reinforcement learning
Created 2 years ago
Updated 1 year ago
ctransformers
by
marella
0.1%
2k
Python bindings for fast Transformer model inference
Starred by
+8
Created 2 years ago
Updated 2 years ago
developer
by
smol-ai
0.0%
12k
Agent for embedding a developer in your app
Starred by
+27
Created 3 years ago
Updated 2 years ago
MeZO
by
princeton-nlp
0.1%
1k
Research paper implementation for memory-efficient LM fine-tuning
Starred by
Created 2 years ago
Updated 2 years ago
ImageBind
by
facebookresearch
0.0%
9k
PyTorch implementation for multimodal embeddings research paper
Starred by
+5
Created 3 years ago
Updated 5 months ago
xtreme1
by
xtreme1-io
0.1%
1k
Open-source platform for multimodal training data annotation
Starred by
Created 3 years ago
Updated 9 months ago
sudolang
by
paralleldrive
0.1%
1k
VS Code extension for LLM-based programming with SudoLang
Starred by
Created 3 years ago
Updated 3 months ago
poe-api
by
ading2210
0%
2k
Python API for Quora's Poe (unmaintained)
Created 3 years ago
Updated 2 years ago
Local-LLM-Comparison-Colab-UI
by
Troyanovsky
0.1%
1k
Local LLM comparison via Colab WebUI links
Starred by
Created 3 years ago
Updated 3 months ago
airoboros
by
jondurbin
0.1%
1k
Self-instruct tool for LLM finetuning
Starred by
+3
Created 3 years ago
Updated 2 years ago
PaLM
by
conceptofmind
0%
820
Open-source PaLM implementation for language model research
Starred by
Created 3 years ago
Updated 1 year ago
TruthfulQA
by
sylinrl
0%
908
Benchmark dataset for evaluating truthfulness of language models
Starred by
Created 4 years ago
Updated 1 year ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 3 years ago
Updated 2 months ago
PMC-LLaMA
by
chaoyi-wu
0%
676
Medical LLM for instruction-following in the medical domain
Created 3 years ago
Updated 1 year ago
openlm
by
r2d4
0%
369
OpenAI-compatible Python client for calling LLMs
Starred by
+1
Created 3 years ago
Updated 2 years ago
FasterTransformer
by
NVIDIA
0.1%
6k
Optimized transformer library for inference
Starred by
+12
Created 5 years ago
Updated 2 years ago
unlimiformer
by
abertsch72
0%
1k
Research paper for long-range transformers with unlimited input
Starred by
+1
Created 3 years ago
Updated 2 years ago
gpt-neox
by
EleutherAI
0.0%
7k
Framework for training large-scale autoregressive language models
Starred by
+22
Created 5 years ago
Updated 2 weeks ago
toolformer
by
conceptofmind
0%
383
Open-source implementation of Toolformer research paper
Starred by
Created 3 years ago
Updated 3 years ago
bark
by
suno-ai
0.0%
39k
Generative audio model for realistic speech and sound effects
Starred by
+19
Created 3 years ago
Updated 1 year ago
chat-langchain
by
langchain-ai
0.1%
6k
Chatbot for question answering over LangChain documentation
Starred by
+3
Created 3 years ago
Updated 4 days ago
LaMini-LM
by
mbzuai-nlp
0.1%
822
Small, efficient language models distilled from ChatGPT for research
Starred by
Created 3 years ago
Updated 3 years ago
ChatRWKV
by
BlinkDL
0.0%
10k
Open-source chatbot powered by the RWKV RNN language model
Starred by
+4
Created 3 years ago
Updated 2 months ago
RWKV-LM
by
BlinkDL
0.1%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+29
Created 4 years ago
Updated 4 hours ago
LocalAI
by
mudler
0.5%
46k
Open-source OpenAI alternative for local AI inference
Starred by
+15
Created 3 years ago
Updated 5 hours ago
WizardLM
by
nlpxucan
0.1%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 3 years ago
Updated 10 months ago
chameleon-llm
by
lupantech
0%
1k
Research paper code for plug-and-play compositional reasoning with LLMs
Starred by
Created 3 years ago
Updated 2 years ago
llama-lab
by
run-llama
0%
2k
LlamaIndex projects for LLM data augmentation
Starred by
Created 3 years ago
Updated 2 years ago
EdgeGPT
by
acheong08
0%
8k
Reverse-engineered API for Microsoft Bing Chat (archived)
Starred by
Created 3 years ago
Updated 2 years ago
gisting
by
jayelm
0%
315
Research paper implementation for prompt compression via learned "gist" tokens
Starred by
Created 3 years ago
Updated 1 year ago
gpt-llama.cpp
by
keldenl
0%
595
API wrapper for local LLM inference, emulating OpenAI's GPT endpoints
Starred by
Created 3 years ago
Updated 2 years ago
memit
by
kmeng01
0%
544
Transformer memory mass-editor (ICLR 2023 research paper)
Starred by
Created 3 years ago
Updated 2 years ago
dl4math
by
lupantech
0%
373
DL4MATH: Deep learning resources for mathematical reasoning
Created 3 years ago
Updated 2 years ago
MiniGPT-4
by
Vision-CAIR
0.0%
26k
Vision-language model for multi-task learning
Starred by
+15
Created 3 years ago
Updated 1 year ago
auto-cot
by
amazon-science
0.1%
2k
Research paper implementation for automatic chain-of-thought prompting
Starred by
Created 3 years ago
Updated 2 years ago
OpenChatKit
by
togethercomputer
0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 3 years ago
Updated 2 years ago
PythonProgrammingPuzzles
by
microsoft
0%
997
Python puzzle dataset for AI programming proficiency research
Created 5 years ago
Updated 2 years ago
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 3 years ago
Updated 1 year ago
unstructured
by
Unstructured-IO
0.3%
15k
ETL solution for structuring unstructured data for language models
Starred by
+12
Created 3 years ago
Updated 1 day ago
whisper
by
openai
0.4%
98k
Speech recognition model for multilingual transcription/translation
Starred by
+42
Created 3 years ago
Updated 1 week ago
LLaMA_MPS
by
jankais3r
0%
584
LLM inference on Apple Silicon GPUs
Starred by
Created 3 years ago
Updated 3 years ago
dolly
by
databrickslabs
0.0%
11k
Instruction-following LLM trained on the Databricks Machine Learning Platform
Starred by
+15
Created 3 years ago
Updated 2 years ago
minimal-llama
by
zphang
0%
456
Code for running and fine-tuning LLaMA models
Starred by
Created 3 years ago
Updated 2 years ago
zero_shot_cot
by
kojima-takeshi188
0%
442
Reasoning framework for LLMs, based on a NeurIPS 2022 paper
Starred by
Created 3 years ago
Updated 2 years ago
safari
by
HazyResearch
0.2%
912
Research paper implementations for sequence modeling with convolutions
Starred by
+2
Created 3 years ago
Updated 1 year ago
EasyLM
by
young-geng
0.0%
3k
LLM training/finetuning framework in JAX/Flax
Starred by
+9
Created 3 years ago
Updated 1 year ago
AlpacaDataCleaned
by
gururise
0.1%
2k
Cleaned dataset for Alpaca LLM training
Starred by
+4
Created 3 years ago
Updated 1 month ago
trl
by
huggingface
0.3%
18k
Library for transformer RL
Starred by
+28
Created 6 years ago
Updated 6 hours ago
ThoughtSource
by
OpenBioLink
0.1%
1k
Framework for chain-of-thought reasoning data and tools
Starred by
Created 3 years ago
Updated 1 year ago
GPT-4-LLM
by
Instruction-Tuning-with-GPT-4
0.0%
4k
GPT-4 data for instruction-tuning LLMs via supervised/RL
Starred by
+5
Created 3 years ago
Updated 2 years ago
lit-llama
by
Lightning-AI
0.0%
6k
LLaMA implementation for pretraining, finetuning, and inference
Starred by
+5
Created 3 years ago
Updated 10 months ago
AutoGPT
by
Significant-Gravitas
0.1%
184k
AI agent platform for building, deploying, and running autonomous workflows
Starred by
+56
Created 3 years ago
Updated 3 hours ago
LLaMA-Adapter
by
OpenGVLab
0.0%
6k
Efficient fine-tuning for instruction-following LLaMA models
Starred by
+3
Created 3 years ago
Updated 2 years ago
pygpt4all
by
nomic-ai
0%
1k
Python bindings for local LLM inference (deprecated)
Starred by
Created 3 years ago
Updated 3 years ago
chatllama
by
henrywoo
0%
1k
Open-source implementation for LLaMA-based ChatGPT, runnable on a single GPU
Created 3 years ago
Updated 1 year ago
optimate
by
nebuly-ai
0.0%
8k
Collection of libraries to optimize AI model performances
Starred by
+3
Created 4 years ago
Updated 1 year ago
GPTeacher
by
teknium1
0.1%
2k
GPT-4 generated datasets for instruction tuning
Starred by
+1
Created 3 years ago
Updated 2 years ago
chatgpt-universe
by
cedrickchee
0%
374
Collection of ChatGPT, GPT, and LLM resources
Created 3 years ago
Updated 1 year ago
langchain
by
langchain-ai
0.6%
135k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 17 hours ago
xTuring
by
stochasticai
0.0%
3k
SDK for fine-tuning and customizing open-source LLMs
Starred by
+3
Created 3 years ago
Updated 1 month ago
ai-pdf-chatbot-langchain
by
mayooear
0.2%
16k
AI chatbot agent for PDF document Q&A using LangChain & LangGraph
Starred by
+3
Created 3 years ago
Updated 1 month ago
natbot
by
nat
0%
2k
Browser automation via GPT-3
Starred by
+7
Created 3 years ago
Updated 1 year ago
ReAct
by
ysymyth
0.6%
4k
GPT-3 prompting code for ReAct research paper
Starred by
+2
Created 3 years ago
Updated 2 years ago
ChatGLM-finetune-LoRA
by
lich99
0%
716
LoRA finetuning code for ChatGLM-6b
Starred by
Created 3 years ago
Updated 2 years ago
Llama-X
by
AetherCortex
0%
2k
Open academic research project improving LLaMA to SOTA LLM
Starred by
Created 3 years ago
Updated 2 years ago
flash-attention
by
Dao-AILab
0.4%
24k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 8 hours ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 3 years ago
Updated 10 months ago
text-generation-inference
by
huggingface
0.1%
11k
Rust/Python/gRPC server for fast LLM text generation
Starred by
+35
Created 3 years ago
Updated 1 month ago
ChatDoctor
by
Kent0n-Li
0.0%
4k
Medical chat model fine-tuned on LLaMA for medical domain Q&A
Starred by
Created 3 years ago
Updated 1 year ago
gpt4all
by
nomic-ai
0.1%
77k
Desktop app for local LLM inference, no GPU/API needed
Starred by
+29
Created 3 years ago
Updated 11 months ago
toolformer-pytorch
by
lucidrains
0%
2k
Pytorch implementation of Toolformer for language models using external tools
Starred by
+2
Created 3 years ago
Updated 1 year ago
textgen
by
oobabooga
0.1%
47k
Web UI for LLM text generation
Starred by
+25
Created 3 years ago
Updated 1 day ago
gptq
by
IST-DASLab
0.2%
2k
Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers
Starred by
+3
Created 3 years ago
Updated 2 years ago
PaLM-rlhf-pytorch
by
lucidrains
0%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 6 months ago
trlx
by
CarperAI
0.0%
5k
Distributed RLHF for LLMs
Starred by
+16
Created 3 years ago
Updated 2 years ago
alpaca_lora_4bit
by
johnsmith0031
0%
536
Fine-tuning and inference tool for quantized LLaMA models
Starred by
Created 3 years ago
Updated 2 years ago
chatgpt-retrieval-plugin
by
openai
0.0%
21k
Retrieval plugin for custom GPTs, function calling, or assistants APIs
Starred by
+23
Created 3 years ago
Updated 1 year ago
GPTQ-for-LLaMa
by
qwopqwop200
0%
3k
4-bit quantization for LLaMA models using GPTQ
Starred by
+2
Created 3 years ago
Updated 1 year ago
dalai
by
cocktailpeanut
0%
13k
Local LLM inference via CLI tool and Node.js API
Starred by
+4
Created 3 years ago
Updated 1 year ago
alpaca-lora
by
tloen
0.0%
19k
LoRA fine-tuning for LLaMA
Starred by
+22
Created 3 years ago
Updated 1 year ago
stanford_alpaca
by
tatsu-lab
0.0%
30k
Instruction-following LLaMA model training and data generation
Starred by
+25
Created 3 years ago
Updated 1 year ago
ColossalAI
by
hpcaitech
0.0%
41k
AI system for large-scale parallel training
Starred by
+25
Created 4 years ago
Updated 19 hours ago
agentic
by
transitive-bullshit
0.0%
18k
AI agent stdlib for LLM-based TypeScript tooling
Starred by
+7
Created 3 years ago
Updated 2 months ago
dagger
by
dagger
0.2%
16k
Open-source runtime for composable workflows, ideal for AI agents
Starred by
+8
Created 6 years ago
Updated 11 hours ago
sdk-python
by
temporalio
0.6%
1k
Python SDK for Temporal, a distributed orchestration engine
Starred by
Created 4 years ago
Updated 6 hours ago
docker-lambda
by
lambci
0%
6k
Deprecated: Docker images for replicating the AWS Lambda environment locally
Starred by
+5
Created 10 years ago
Updated 3 years ago
kong
by
Kong
0.1%
43k
Cloud-native API and AI gateway for microservice orchestration
Starred by
+18
Created 11 years ago
Updated 1 month ago
awesome-machine-learning
by
josephmisiti
0.1%
72k
Curated list of ML frameworks, libraries, and software
Starred by
+24
Created 11 years ago
Updated 1 day ago
hackathon-starter
by
sahat
0.0%
35k
Node.js boilerplate for web applications
Starred by
+11
Created 12 years ago
Updated 20 hours ago
Feedback? Help us improve.