Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Wing Lian
Wing Lian
Founder of Axolotl AI
GitHub
X
Starred Projects (371)
autokernel
by
RightNow-AI
N/A
513
Autonomous GPU kernel optimization for PyTorch
Starred by
Created 2 days ago
Updated 1 day ago
KernelAgent
by
meta-pytorch
8.9%
277
Autonomous GPU kernel generation and optimization via AI agents
Starred by
Created 8 months ago
Updated 3 days ago
cookbook
by
Liquid4All
4.6%
1k
On-device AI models and SDK for edge applications
Starred by
Created 5 months ago
Updated 1 day ago
simple-evals
by
openai
0.2%
4k
Lightweight library for evaluating language models
Starred by
+15
Created 1 year ago
Updated 7 months ago
Awesome-ML-SYS-Tutorial
by
zhaochenyang20
2.6%
6k
ML SYS learning notes and code
Starred by
+1
Created 1 year ago
Updated 1 week ago
ANE
by
maderix
4.5%
6k
Direct neural network training on Apple Neural Engine
Starred by
Created 1 week ago
Updated 3 days ago
multi-agent-coding-system
by
Danau5tin
0.4%
1k
AI coding system with orchestrator, explorer, and coder agents
Starred by
Created 6 months ago
Updated 4 months ago
CUDA-Agent
by
BytedTsinghua-SIA
19.7%
792
Agentic RL for high-performance CUDA kernel generation
Starred by
Created 1 month ago
Updated 1 week ago
OpenClaw-RL
by
Gen-Verse
134.3%
2k
Personalize AI agents through conversational reinforcement learning
Starred by
Created 2 weeks ago
Updated 23 hours ago
crush
by
charmbracelet
1.4%
21k
AI coding agent for your terminal
Starred by
+4
Created 9 months ago
Updated 20 hours ago
slowrun
by
qlabs-eng
28.8%
286
LLM training benchmark prioritizing deep learning over speed
Starred by
Created 2 weeks ago
Updated 13 hours ago
superpowers
by
obra
9.4%
79k
AI assistant superpowers via a comprehensive skills library
Starred by
+10
Created 5 months ago
Updated 1 day ago
kvpress
by
NVIDIA
0.9%
951
LLM KV cache compression made easy
Starred by
Created 1 year ago
Updated 1 day ago
SkillRL
by
aiming-lab
17.5%
381
Recursive skill-augmented reinforcement learning for evolving LLM agents
Created 1 month ago
Updated 3 days ago
Open-AgentRL
by
Gen-Verse
8.8%
356
Reinforcement learning for LLM agents
Created 5 months ago
Updated 2 weeks ago
discover
by
test-time-training
1.6%
495
Learning to discover at test time
Created 1 month ago
Updated 2 weeks ago
mHC-manifold-constrained-hyper-connections
by
tokenbender
0.6%
325
Research implementation of manifold-constrained hyper-connections for deep learning models
Created 2 months ago
Updated 3 weeks ago
IQuest-Coder-V1
by
IQuestLab
0.4%
1k
Code LLMs for autonomous software engineering
Created 2 months ago
Updated 1 week ago
OpenTinker
by
open-tinker
0.8%
642
RL-as-a-Service infrastructure for foundation models
Starred by
+2
Created 2 months ago
Updated 1 day ago
punica
by
punica-ai
0.1%
1k
LoRA serving system (research paper) for multi-tenant LLM inference
Starred by
+3
Created 2 years ago
Updated 1 year ago
mLoRA
by
TUDB-Labs
0.3%
373
Framework for efficient LoRA fine-tuning of multiple LLMs
Created 2 years ago
Updated 1 year ago
miles
by
radixark
2.0%
972
Enterprise RL for large-scale MoE models
Starred by
+4
Created 5 months ago
Updated 19 hours ago
ROLL
by
alibaba
1.4%
3k
RL library for large language models
Starred by
Created 9 months ago
Updated 20 hours ago
Kimi-Linear
by
MoonshotAI
0.3%
1k
Efficient linear attention architecture accelerates long-context LLMs
Created 4 months ago
Updated 3 months ago
Fast-dLLM
by
NVlabs
1.5%
881
Diffusion LLM inference acceleration framework
Starred by
Created 9 months ago
Updated 1 month ago
auto-round
by
intel
1.4%
883
Quantization algorithm for LLMs and VLMs
Starred by
Created 2 years ago
Updated 14 hours ago
luminal
by
luminal-ai
0.1%
3k
Deep learning library using composable compilers for high performance
Starred by
Created 2 years ago
Updated 16 hours ago
MARS
by
AGI-Arena
0%
716
Optimization framework for training large models
Created 1 year ago
Updated 1 week ago
DeepResearch
by
Alibaba-NLP
0.3%
18k
Benchmark for LLMs in web traversal
Starred by
+1
Created 1 year ago
Updated 2 weeks ago
gemlite
by
dropbox
0.5%
438
Triton kernels for efficient low-bit matrix multiplication
Starred by
Created 1 year ago
Updated 1 month ago
AgentGym-RL
by
WooooDyy
2.1%
633
Train LLM agents for long-horizon, multi-turn decision-making
Starred by
Created 6 months ago
Updated 3 weeks ago
LlamaGym
by
KhoomeiK
0.1%
1k
SDK for fine-tuning LLM agents with online reinforcement learning
Starred by
Created 2 years ago
Updated 2 years ago
flame
by
fla-org
0.6%
355
Minimal, efficient framework for LLM training
Starred by
Created 1 year ago
Updated 3 months ago
Soft-Thinking
by
eric-ai-lab
1.3%
318
Enhancing LLM reasoning via continuous concept spaces
Created 9 months ago
Updated 1 month ago
DFT
by
yongliang-wu
0.4%
544
Improving SFT generalization with reward rectification
Starred by
Created 7 months ago
Updated 2 months ago
dion
by
microsoft
0.7%
452
Orthonormal updates for faster distributed ML training
Created 9 months ago
Updated 1 month ago
mixture_of_recursions
by
raymin0223
0.2%
548
Adaptive LLM computation with dynamic recursion
Created 9 months ago
Updated 5 months ago
gem
by
axon-rl
1.5%
461
Agentic LLM training environment for interactive reinforcement learning
Starred by
Created 9 months ago
Updated 1 month ago
cc
by
kn1026
0%
704
Starred by
Created 7 months ago
Updated 7 months ago
HRM
by
sapientinc
0.2%
12k
Hierarchical reasoning for complex tasks
Starred by
Created 8 months ago
Updated 6 months ago
RL2
by
ChenmienTan
0.3%
1k
Reinforcement learning for large language models
Starred by
+1
Created 11 months ago
Updated 1 week ago
matmulfreellm
by
ridgerchu
0.1%
3k
MatMul-free language models
Starred by
+2
Created 1 year ago
Updated 3 months ago
applied-ai
by
meta-pytorch
0%
319
Applied AI experiments and examples for PyTorch
Starred by
Created 2 years ago
Updated 6 months ago
COAT
by
NVlabs
0.4%
262
FP8 training framework for memory efficiency
Created 1 year ago
Updated 7 months ago
SkyRL
by
NovaSky-AI
1.4%
2k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+14
Created 10 months ago
Updated 14 hours ago
scattermoe
by
shawntan
0%
269
Triton-based Sparse Mixture-of-Experts for efficient deep learning
Starred by
Created 2 years ago
Updated 5 months ago
rStar
by
microsoft
0.1%
1k
Research paper repo for math reasoning in small LLMs via deep thinking
Starred by
Created 1 year ago
Updated 6 months ago
Skills
by
NVIDIA-NeMo
1.6%
863
LLM skill-improvement pipelines for synthetic data generation, training, and evaluation
Starred by
Created 2 years ago
Updated 18 hours ago
Absolute-Zero-Reasoner
by
LeapLabTHU
0.1%
2k
Self-play reasoning framework needing zero data
Starred by
Created 10 months ago
Updated 6 months ago
DeepSeekRL-Extended
by
brendanhogan
0%
252
GRPO implementation for scaled RL research
Starred by
Created 1 year ago
Updated 6 months ago
open-webui
by
open-webui
0.8%
127k
Self-hosted AI platform for local LLM deployment
Starred by
+24
Created 2 years ago
Updated 20 hours ago
TTRL
by
PRIME-RL
1.3%
1k
RL technique for unlabeled data, especially test data
Created 10 months ago
Updated 2 days ago
R2E-Gym
by
R2E-Gym
0.8%
253
Scaling open-weight SWE agents with procedural environments and hybrid verifiers
Starred by
Created 11 months ago
Updated 8 months ago
axolotl
by
axolotl-ai-cloud
0.2%
11k
CLI tool for streamlined post-training of AI models
Starred by
+26
Created 2 years ago
Updated 18 hours ago
agno
by
agno-agi
0.5%
39k
Lightweight library for building AI Agents with memory, knowledge, and reasoning
Starred by
+9
Created 3 years ago
Updated 19 hours ago
github-mcp-server
by
github
0.9%
28k
MCP server for GitHub API automation and interaction
Starred by
Created 1 year ago
Updated 13 hours ago
loong
by
camel-ai
0.2%
488
Synthetic data generation project using LLM agents
Created 11 months ago
Updated 1 week ago
SWE-Gym
by
SWE-Gym
0.1%
650
Environment for training software engineering agents
Starred by
+2
Created 1 year ago
Updated 7 months ago
GamingAgent
by
lmgame-org
0.8%
886
SDK for LLM/VLM gaming agents, enabling model evaluation via games
Starred by
Created 1 year ago
Updated 3 months ago
LLaDA
by
ML-GSAI
0.4%
4k
LLM research paper exploring masked diffusion language models
Starred by
Created 1 year ago
Updated 4 months ago
recurrent-pretraining
by
seal-rg
0.1%
866
Pretraining code for depth-recurrent language model research
Starred by
Created 1 year ago
Updated 2 months ago
TransMLA
by
MuLabPKU
0.5%
434
Post-training method converts GQA-based LLMs to MLA models
Created 1 year ago
Updated 1 week ago
MLGym
by
facebookresearch
0.2%
586
Gym environment for ML research agents
Starred by
Created 1 year ago
Updated 7 months ago
native-sparse-attention-triton
by
XunhaoLai
0.4%
269
Efficient sparse attention for LLMs
Created 1 year ago
Updated 9 months ago
coconut
by
facebookresearch
0.1%
2k
Research paper implementation for LLM reasoning in latent space
Starred by
Created 1 year ago
Updated 7 months ago
native-sparse-attention-pytorch
by
lucidrains
0%
798
Sparse attention implementation from Deepseek's research paper
Created 1 year ago
Updated 7 months ago
ReasonFlux
by
Gen-Verse
0%
521
LLM post-training algorithms for data selection, RL, and inference
Created 1 year ago
Updated 5 months ago
LIMO
by
GAIR-NLP
0.2%
1k
Reasoning model using less data
Starred by
Created 1 year ago
Updated 7 months ago
s1
by
simplescaling
0.0%
7k
Test-time scaling recipe for strong reasoning performance
Starred by
+8
Created 1 year ago
Updated 8 months ago
reasoning-gym
by
open-thought
0.3%
1k
Procedural dataset generator for reasoning models
Starred by
+5
Created 1 year ago
Updated 1 month ago
curator
by
bespokelabsai
0.1%
2k
Synthetic data curation tool for post-training and structured data extraction
Starred by
Created 1 year ago
Updated 1 month ago
RAGEN
by
mll-lab-nu
0.3%
3k
Train LLM agents with reinforcement learning in interactive environments
Starred by
Created 1 year ago
Updated 23 hours ago
SkyThought
by
NovaSky-AI
0%
3k
Training recipes for Sky-T1 family of models
Starred by
+4
Created 1 year ago
Updated 8 months ago
search-and-learn
by
huggingface
0%
1k
Recipes to scale inference-time compute of open models
Starred by
+1
Created 1 year ago
Updated 9 months ago
buffer-of-thought-llm
by
YangLing0818
0.3%
675
Research paper implementation for thought-augmented LLM reasoning
Created 1 year ago
Updated 8 months ago
HuatuoGPT-o1
by
FreedomIntelligence
0.1%
1k
Medical LLM for advanced reasoning
Created 1 year ago
Updated 1 year ago
LayerSkip
by
facebookresearch
0%
361
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding" research paper
Created 2 years ago
Updated 1 month ago
markitdown
by
microsoft
0.5%
91k
Python tool for converting files to Markdown for LLM text analysis
Starred by
+17
Created 1 year ago
Updated 3 days ago
NeMo-Aligner
by
NVIDIA
0%
851
Toolkit for efficient model alignment
Starred by
+1
Created 2 years ago
Updated 5 months ago
agency-swarm
by
VRSEN
0.6%
4k
Agentic framework built on OpenAI Assistants API for automating AI workflows
Starred by
Created 2 years ago
Updated 1 day ago
instructlab
by
instructlab
0.1%
1k
CLI tool for LLM alignment tuning via synthetic data
Starred by
Created 2 years ago
Updated 3 weeks ago
flash-linear-attention
by
fla-org
0.8%
5k
Efficient Torch/Triton implementations for linear attention models
Starred by
+8
Created 2 years ago
Updated 1 day ago
TokenFormer
by
Haiyang-W
0%
588
Research paper on a fully attention-based neural network with tokenized model parameters
Created 1 year ago
Updated 1 year ago
evaluation-guidebook
by
huggingface
0.3%
2k
LLM evaluation guide for practitioners
Starred by
+3
Created 1 year ago
Updated 3 months ago
dynasaur
by
adobe-research
0.3%
358
LLM agent framework using dynamic action creation via Python code generation
Starred by
Created 1 year ago
Updated 1 year ago
Marco-o1
by
AIDC-AI
0.1%
2k
Open reasoning model for real-world problem solving
Created 1 year ago
Updated 4 weeks ago
SageAttention
by
thu-ml
0.6%
3k
Attention kernel for plug-and-play inference acceleration
Starred by
Created 1 year ago
Updated 1 month ago
metaflow
by
Netflix
0.2%
10k
Framework for building and managing AI/ML systems
Starred by
+10
Created 6 years ago
Updated 15 hours ago
Muon
by
KellerJordan
0.8%
2k
Optimizer for neural network hidden layers
Starred by
Created 1 year ago
Updated 1 month ago
MathBlackBox
by
trotsky1997
0%
1k
Research paper for mathematical reasoning via LLMs
Starred by
+1
Created 1 year ago
Updated 1 year ago
BitNet
by
microsoft
12.2%
32k
Inference framework for 1-bit LLMs
Starred by
+8
Created 1 year ago
Updated 3 days ago
Aria
by
rhymes-ai
0%
1k
Multimodal MoE model for video, document understanding, and dialog
Starred by
Created 1 year ago
Updated 1 year ago
Hands-On-Large-Language-Models
by
HandsOnLLM
0.5%
23k
Code examples for "Hands-On Large Language Models" book
Starred by
Created 1 year ago
Updated 2 months ago
modded-nanogpt
by
KellerJordan
0.9%
5k
Language model training speedrun on 8x H100 GPUs
Starred by
+8
Created 1 year ago
Updated 2 days ago
llama-stack
by
llamastack
0.1%
8k
Composable building blocks for Llama apps
Starred by
+7
Created 1 year ago
Updated 1 day ago
Adam-mini
by
zyushun
0%
453
PyTorch implementation of Adam-mini optimizer from a research paper
Starred by
Created 1 year ago
Updated 10 months ago
optillm
by
algorithmicsuperintelligence
0.5%
3k
Optimizing inference proxy for LLMs
Starred by
+8
Created 1 year ago
Updated 1 month ago
LLM-Blender
by
yuchenlin
0%
976
LLM ensembling framework using pairwise ranking and generative fusion
Starred by
+3
Created 2 years ago
Updated 1 year ago
EvolKit
by
arcee-ai
0.4%
253
LLM instruction enhancement framework
Starred by
Created 1 year ago
Updated 1 year ago
LLMs-Planning
by
karthikv792
0.2%
454
Benchmark for evaluating LLMs on planning tasks
Created 3 years ago
Updated 5 months ago
rStar
by
zhentingqi
0%
968
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 1 year ago
distributed-training-guide
by
LambdaLabsML
1.0%
592
PyTorch guide for distributed training of large language models
Starred by
Created 1 year ago
Updated 4 months ago
nyuntam
by
nyunAI
0%
673
CLI tool for LLM compression via pruning, quantization, and distillation
Created 1 year ago
Updated 1 year ago
distillm
by
jongwooko
0.8%
254
Streamlined LLM distillation for efficient model training
Starred by
Created 2 years ago
Updated 1 year ago
MisguidedAttention
by
cpldcpu
0.4%
465
LLM reasoning benchmark for evaluating responses to misleading prompts
Starred by
Created 1 year ago
Updated 7 months ago
Open-Reasoning-Tasks
by
NousResearch
0.9%
463
Reasoning tasks collection for LLMs
Starred by
+3
Created 1 year ago
Updated 1 year ago
long-context-attention
by
feifeibear
0.3%
645
Unified sequence parallel attention for long context LLM training/inference
Starred by
Created 1 year ago
Updated 1 month ago
DistillKit
by
arcee-ai
0.6%
888
Open-source toolkit for LLM distillation research
Starred by
Created 1 year ago
Updated 2 months ago
do-not-answer
by
Libr-AI
0.9%
321
Dataset for evaluating LLM safety mechanisms
Starred by
Created 2 years ago
Updated 1 year ago
fms-fsdp
by
foundation-model-stack
0.7%
282
Efficiently train foundation models with PyTorch
Starred by
Created 2 years ago
Updated 3 months ago
OLMo
by
allenai
0.4%
6k
Open language model code for training, evaluation, and inference
Starred by
+4
Created 3 years ago
Updated 3 months ago
BAdam
by
Ledzy
0%
285
Memory-efficient optimizer for large language model finetuning
Starred by
Created 1 year ago
Updated 1 year ago
snowflake-arctic
by
Snowflake-Labs
0%
559
AI research project for efficient LLM training and inference
Starred by
Created 1 year ago
Updated 1 year ago
open-instruct
by
allenai
0.3%
4k
Training codebase for instruction-following language models
Starred by
+10
Created 2 years ago
Updated 1 day ago
mdistiller
by
megvii-research
0.1%
893
PyTorch library for knowledge distillation research
Created 4 years ago
Updated 2 years ago
augmentoolkit
by
e-p-armstrong
0.1%
2k
Data toolkit for custom LLM creation using open-source AI
Starred by
+3
Created 2 years ago
Updated 4 months ago
awesome-synthetic-datasets
by
davanstrien
0%
325
Curated list of synthetic text/vision datasets and generation tools
Created 2 years ago
Updated 2 months ago
calm
by
zeux
0%
628
Single-GPU inference engine for rapid LLM prototyping
Starred by
Created 2 years ago
Updated 9 months ago
MobileLLM
by
facebookresearch
0.2%
1k
Sub-billion parameter LLM training code for on-device use
Starred by
+2
Created 1 year ago
Updated 10 months ago
phoenix
by
Arize-ai
0.8%
9k
AI observability platform for experimentation, evaluation, and troubleshooting
Starred by
+6
Created 3 years ago
Updated 14 hours ago
SPPO
by
uclaml
0%
583
Self-Play Preference Optimization (SPPO) aligns language models via self-play
Starred by
Created 1 year ago
Updated 1 year ago
AutoIF
by
QwenLM
0.3%
326
Research paper for improving LLM instruction-following via self-play with execution feedback
Starred by
Created 1 year ago
Updated 1 year ago
refusal_direction
by
andyrdt
1.1%
359
Research paper code for analyzing refusal in language models
Starred by
Created 1 year ago
Updated 9 months ago
YaFSDP
by
yandex
0%
984
Sharded data parallelism framework for transformer-like neural networks
Starred by
Created 1 year ago
Updated 1 month ago
chat_templates
by
chujiezheng
0.1%
716
Chat templates for HuggingFace LLMs
Starred by
Created 2 years ago
Updated 1 year ago
LESS
by
princeton-nlp
0.2%
513
Data selection research paper for targeted instruction tuning
Starred by
Created 2 years ago
Updated 1 year ago
MixEval
by
JinjieNi
0%
255
Dynamic LLM evaluation suite for accurate, cost-effective benchmarking
Starred by
Created 1 year ago
Updated 1 year ago
MoRA
by
kongds
0%
361
Parameter-efficient fine-tuning via high-rank updating (MoRA)
Starred by
Created 1 year ago
Updated 1 year ago
SimPO
by
princeton-nlp
0%
946
Preference optimization algorithm for LLMs (NeurIPS 2024 paper)
Starred by
Created 1 year ago
Updated 1 year ago
qodo-cover
by
qodo-ai
0.2%
5k
CLI tool for AI-powered test generation and code coverage enhancement
Starred by
Created 1 year ago
Updated 8 months ago
gemma-2B-10M
by
mustafaaljadery
0%
936
Gemma 2B with 10M context length using Infini-attention
Starred by
Created 1 year ago
Updated 1 year ago
xtuner
by
InternLM
0.1%
5k
LLM fine-tuning toolkit for research
Starred by
+2
Created 2 years ago
Updated 14 hours ago
GLiNER
by
urchade
0.7%
3k
NER model for identifying any entity type using bidirectional transformer
Starred by
Created 2 years ago
Updated 16 hours ago
contriever
by
facebookresearch
0.1%
771
Unsupervised dense information retrieval via contrastive learning
Starred by
Created 4 years ago
Updated 2 years ago
prometheus-eval
by
prometheus-eval
0.2%
1k
LLM evaluation framework using open LLMs
Starred by
Created 1 year ago
Updated 10 months ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.4%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 2 years ago
Updated 1 year ago
selfcodealign
by
bigcode-project
0%
323
Research paper for self-alignment in code generation
Starred by
Created 1 year ago
Updated 1 year ago
llm-datasets
by
mlabonne
1.0%
4k
Curated datasets/tools for LLM post-training
Starred by
+1
Created 1 year ago
Updated 4 days ago
rerope
by
bojone
0%
388
Position embeddings research paper
Starred by
Created 2 years ago
Updated 1 year ago
LaVague
by
lavague-ai
0.1%
6k
Web agent framework for automating web processes
Starred by
+7
Created 2 years ago
Updated 1 year ago
ring-flash-attention
by
zhuzilin
0.5%
996
FlashAttention extension for ring attention
Starred by
+2
Created 2 years ago
Updated 6 months ago
llamaduo
by
deep-diver
0%
317
LLMOps pipeline to fine-tune small LLMs for service LLM outage prep
Starred by
Created 2 years ago
Updated 8 months ago
cohere-toolkit
by
cohere-ai
0.1%
3k
RAG toolkit for LLM application development and deployment
Starred by
+4
Created 1 year ago
Updated 1 month ago
uptrain
by
uptrain-ai
0.1%
2k
Open-source platform to evaluate and improve GenAI apps
Starred by
+5
Created 3 years ago
Updated 1 year ago
BitBLAS
by
microsoft
0%
753
Library for mixed-precision matrix multiplications, targeting quantized LLM deployment
Created 2 years ago
Updated 7 months ago
arena-hard-auto
by
lmarena
0.3%
1k
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 8 months ago
ChunkLlama
by
HKUNLP
0%
448
Training-free method for extending LLM context windows
Created 2 years ago
Updated 1 year ago
dstack
by
dstackai
0.1%
2k
Open-source tool for simplifying GPU allocation and AI workload orchestration
Starred by
+3
Created 4 years ago
Updated 13 hours ago
rho
by
microsoft
0%
460
LLM pretraining research paper using selective language modeling (SLM)
Starred by
Created 1 year ago
Updated 1 year ago
dify
by
langgenius
0.9%
133k
Open-source LLM app development platform
Starred by
+17
Created 2 years ago
Updated 13 hours ago
attorch
by
BobMcDear
0.2%
597
PyTorch nn module subset, implemented in Python using Triton
Starred by
+2
Created 2 years ago
Updated 7 months ago
mixtral-offloading
by
dvmazur
0%
2k
Inference optimization for Mixtral-8x7B models
Starred by
Created 2 years ago
Updated 1 year ago
auto-code-rover
by
AutoCodeRoverSG
0.2%
3k
Autonomous software engineer for program improvement
Starred by
+3
Created 1 year ago
Updated 10 months ago
BitNet-Transformers
by
Beomi
0%
313
HuggingFace Transformers implementation of BitNet scaling for LLMs
Created 2 years ago
Updated 2 years ago
EasyContext
by
jzhang38
0.1%
755
Recipes for language model context length extrapolation to 1M tokens
Starred by
+2
Created 1 year ago
Updated 1 year ago
pyreft
by
stanfordnlp
0.2%
2k
Python library for representation finetuning (ReFT) of language models
Starred by
Created 2 years ago
Updated 1 week ago
hlb-gpt
by
tysam-code
0%
355
Researcher's toolbench for GPT model exploration
Starred by
Created 3 years ago
Updated 1 year ago
aideml
by
WecoAI
0.4%
1k
ML engineering agent for automated AI R&D, surpassing human experts
Starred by
Created 1 year ago
Updated 4 weeks ago
BitNet
by
kyegomez
0.2%
2k
PyTorch implementation of BitNet research paper
Starred by
Created 2 years ago
Updated 1 month ago
horovod
by
horovod
0.0%
15k
Distributed training framework for TF, Keras, PyTorch, and MXNet
Starred by
+19
Created 8 years ago
Updated 3 months ago
dataverse
by
UpstageAI
0.2%
566
ETL pipeline for LLM data processing
Starred by
Created 2 years ago
Updated 1 year ago
hqq
by
dropbox
0.2%
917
Model quantizer for fast, accurate post-training quantization, skipping calibration
Starred by
Created 2 years ago
Updated 2 weeks ago
Triton-Puzzles
by
srush
0.3%
2k
Interactive puzzles for learning Triton
Starred by
Created 2 years ago
Updated 1 year ago
repeng
by
vgel
0.1%
693
Python library for representation engineering control vectors
Starred by
Created 2 years ago
Updated 5 months ago
cobra
by
h-zhao1997
0%
293
Multimodal LLM research paper extending Mamba for efficient inference
Created 2 years ago
Updated 1 year ago
hackathon
by
mistralai-sf24
0%
446
Minimal code for running and finetuning a 7B transformer model
Starred by
Created 2 years ago
Updated 1 year ago
raft
by
rapidsai
0.1%
988
CUDA-accelerated primitives for ML/data mining algorithms
Starred by
Created 6 years ago
Updated 17 hours ago
maestro
by
Doriandarko
0.1%
4k
Framework for Claude Opus to orchestrate subagents
Starred by
Created 2 years ago
Updated 1 year ago
quiet-star
by
ezelikman
0%
741
Research code for self-teaching language models
Starred by
Created 2 years ago
Updated 1 year ago
ml-engineering
by
stas00
0.4%
17k
Open book for LLM/VLM training engineers
Starred by
+17
Created 5 years ago
Updated 2 days ago
chatbot-ui
by
mckaywrigley
0.1%
33k
Open-source AI chat app
Starred by
+14
Created 3 years ago
Updated 1 year ago
orpo
by
xfactlab
0%
472
Preference optimization without a reference model
Starred by
Created 2 years ago
Updated 1 year ago
SWE-bench
by
SWE-bench
1.0%
4k
Benchmark for evaluating LLMs on real-world GitHub issues
Starred by
+12
Created 2 years ago
Updated 2 days ago
OpenHands
by
OpenHands
0.5%
69k
AI platform for software development agents
Starred by
+36
Created 2 years ago
Updated 15 hours ago
FastV
by
pkunlp-icler
0.5%
560
Inference acceleration for large vision-language models (research paper)
Created 2 years ago
Updated 1 year ago
airllm
by
lyogavin
1.9%
14k
Inference optimization for LLMs on low-resource hardware
Starred by
Created 2 years ago
Updated 3 days ago
daytona
by
daytonaio
3.0%
64k
Infrastructure for running AI-generated code
Starred by
+5
Created 2 years ago
Updated 1 day ago
VisionLLaMA
by
Meituan-AutoML
0%
392
Vision transformer research paper
Created 2 years ago
Updated 1 year ago
fsdp_qlora
by
AnswerDotAI
0.1%
2k
Training script for LLMs using QLoRA + FSDP
Starred by
+3
Created 2 years ago
Updated 1 year ago
h2o-llmstudio
by
h2oai
0.1%
5k
LLM Studio: framework for LLM fine-tuning via GUI or CLI
Starred by
+5
Created 2 years ago
Updated 1 day ago
ChatMusician
by
hf-lin
0%
304
LLM for music understanding and generation
Created 2 years ago
Updated 1 year ago
AnyGPT
by
OpenMOSS
0.2%
873
Multimodal LLM research paper for any-to-any modality conversion
Starred by
Created 2 years ago
Updated 1 year ago
FlagEmbedding
by
FlagOpen
0.3%
11k
Toolkit for retrieval and RAG applications
Starred by
+8
Created 2 years ago
Updated 3 days ago
self-rewarding-lm-pytorch
by
lucidrains
0%
1k
Training framework for self-rewarding language models
Starred by
+4
Created 2 years ago
Updated 1 year ago
crewAI
by
crewAIInc
1.3%
46k
Framework for autonomous AI agent orchestration via role-playing and collaboration
Starred by
+18
Created 2 years ago
Updated 15 hours ago
resource-stream
by
gpu-mode
1.3%
2k
CUDA resource collection for GPU programming
Starred by
Created 2 years ago
Updated 5 days ago
metal-flash-attention
by
philipturner
0%
589
Metal port of FlashAttention for Apple silicon
Starred by
+2
Created 2 years ago
Updated 1 year ago
LLMs-from-scratch
by
rasbt
0.6%
88k
Educational resource for LLM construction in PyTorch
Starred by
+11
Created 2 years ago
Updated 5 days ago
mlx-examples
by
ml-explore
0.4%
8k
Examples using the MLX framework
Starred by
+7
Created 2 years ago
Updated 4 weeks ago
ai-codereviewer
by
villesau
0.1%
1k
GitHub Action for AI-powered code review
Starred by
Created 3 years ago
Updated 1 year ago
deita
by
hkust-nlp
0.2%
591
Data-efficient instruction tuning for LLM alignment (ICLR 2024)
Starred by
Created 2 years ago
Updated 1 year ago
AutoAWQ
by
casper-hansen
0.0%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Starred by
+5
Created 2 years ago
Updated 10 months ago
ProxyAI
by
carlrobertoh
0.3%
2k
JetBrains IDE copilot for coding assistance
Starred by
Created 3 years ago
Updated 2 days ago
EAGLE
by
SafeAILab
0.5%
2k
Speculative decoding research paper for faster LLM inference
Starred by
+5
Created 2 years ago
Updated 3 weeks ago
HALOs
by
ContextualAI
0.1%
908
Library for aligning LLMs using human-aware loss functions
Starred by
Created 2 years ago
Updated 5 months ago
mamba
by
state-spaces
0.3%
17k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 3 days ago
modelz-llm
by
tensorchord
0%
276
Inference server for open-source LLMs, offering an OpenAI-compatible API
Created 2 years ago
Updated 2 years ago
unsloth
by
unslothai
0.8%
54k
Finetuning tool for LLMs, targeting speed and memory efficiency
Starred by
+38
Created 2 years ago
Updated 13 hours ago
gpt-researcher
by
assafelovic
0.5%
26k
Autonomous agent for web/local research, generating cited reports
Starred by
+9
Created 2 years ago
Updated 1 week ago
functionary
by
MeetKai
0.1%
2k
Chat language model for tool use and result interpretation
Starred by
+2
Created 2 years ago
Updated 3 months ago
Logic-LLM
by
teacherpeterpan
0.8%
387
Logic-LM: Framework for improved logical reasoning via LLMs and symbolic solvers
Created 2 years ago
Updated 1 year ago
LLMSurvey
by
RUCAIBox
0.1%
12k
Survey paper for large language models
Starred by
+2
Created 3 years ago
Updated 1 year ago
distilabel
by
argilla-io
0.2%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 3 days ago
long-llms-learning
by
Strivin0311
0%
272
Literature repository for long-context LLM methodologies
Starred by
Created 2 years ago
Updated 1 year ago
MergeLM
by
yule-BUAA
0.1%
863
Codebase for merging language models via parameter averaging
Starred by
Created 2 years ago
Updated 1 year ago
Video-LLaVA
by
PKU-YuanGroup
0.2%
3k
Video-LLaVA: Multimodal model for video/image understanding via LLM
Starred by
Created 2 years ago
Updated 1 year ago
medAlpaca
by
kbressem
0%
556
LLM finetuned for medical question answering
Starred by
Created 3 years ago
Updated 2 years ago
intel-extension-for-transformers
by
intel
0.1%
2k
Transformer toolkit for GenAI/LLM acceleration on Intel platforms
Starred by
Created 3 years ago
Updated 1 year ago
representation-engineering
by
andyzoujm
0.8%
964
AI transparency via representation engineering
Starred by
Created 2 years ago
Updated 1 year ago
multimodal
by
facebookresearch
0.1%
2k
PyTorch library for multimodal multi-task model training
Starred by
+1
Created 4 years ago
Updated 4 days ago
S-LoRA
by
S-LoRA
0.2%
2k
System for scalable LoRA adapter serving
Starred by
+1
Created 2 years ago
Updated 2 years ago
DeepSpeed
by
deepspeedai
0.1%
42k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 6 years ago
Updated 21 hours ago
continue
by
continuedev
0.5%
32k
IDE extension for custom AI code assistants
Starred by
+16
Created 2 years ago
Updated 21 hours ago
llama-cookbook
by
meta-llama
0.1%
18k
Guide for building with Llama models
Starred by
+15
Created 2 years ago
Updated 1 week ago
finetuner
by
jina-ai
0%
2k
Cloud tool for task-oriented embedding finetuning of models like BERT and CLIP
Starred by
+3
Created 4 years ago
Updated 2 years ago
ludwig
by
ludwig-ai
0.0%
12k
Low-code framework for custom AI models (LLMs, neural networks)
Starred by
+17
Created 7 years ago
Updated 4 days ago
img2dataset
by
rom1504
0.2%
4k
CLI tool for creating large image datasets from URLs
Starred by
+12
Created 4 years ago
Updated 4 months ago
distilling-step-by-step
by
google-research
0.7%
585
Code for research paper on knowledge distillation
Starred by
Created 2 years ago
Updated 2 years ago
Cherry_LLM
by
tianyi-lab
0%
416
Research paper for LLM instruction tuning via self-guided data selection
Created 2 years ago
Updated 8 months ago
Reflection_Tuning
by
tianyi-lab
0%
367
Research paper for LLM instruction tuning via data recycling
Starred by
Created 2 years ago
Updated 1 year ago
instructor
by
567-labs
0.3%
13k
SDK for structured LLM outputs using Pydantic models
Starred by
+27
Created 2 years ago
Updated 2 days ago
YiVal
by
YiVal
0.1%
2k
Prompt engineering assistant for GenAI apps
Starred by
Created 2 years ago
Updated 1 year ago
LLM-Shearing
by
princeton-nlp
0%
642
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 2 years ago
Updated 2 years ago
letta
by
letta-ai
0.6%
22k
Agent framework for stateful agents with memory, reasoning, and context management
Starred by
+19
Created 2 years ago
Updated 1 week ago
CogVLM
by
zai-org
0.0%
7k
VLM for image understanding and multi-turn dialogue
Starred by
+4
Created 2 years ago
Updated 1 year ago
ragas
by
vibrantlabsai
0.7%
13k
Toolkit for LLM application evaluation
Starred by
+12
Created 2 years ago
Updated 2 weeks ago
NEFTune
by
neelsjain
0.2%
410
Technique to improve instruction finetuning of LLMs
Starred by
Created 2 years ago
Updated 1 year ago
FireAct
by
anchen1011
0%
292
Language agent fine-tuning research paper
Starred by
Created 2 years ago
Updated 2 years ago
LLaVA
by
haotian-liu
0.2%
25k
Multimodal assistant with GPT-4 level capabilities
Starred by
+16
Created 2 years ago
Updated 1 year ago
alignment-handbook
by
huggingface
0.1%
6k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 6 months ago
autolabel
by
refuel-ai
0.1%
2k
Python library to label text datasets using LLMs
Starred by
+4
Created 3 years ago
Updated 1 year ago
EmpatheticDialogues
by
facebookresearch
0.2%
542
PyTorch code for empathetic dialogue research
Starred by
Created 6 years ago
Updated 4 years ago
world-models
by
wesg52
0%
260
Research paper code for extracting spatial/temporal world models from LLMs
Starred by
Created 2 years ago
Updated 2 years ago
OpenGPT
by
CogStack
0%
361
Framework for grounded instruction datasets and domain-expert LLMs
Starred by
Created 2 years ago
Updated 2 years ago
Medusa
by
FasterDecoding
0.1%
3k
Framework for accelerating LLM generation using multiple decoding heads
Starred by
+6
Created 2 years ago
Updated 1 year ago
open_flamingo
by
mlfoundations
0.0%
4k
Open-source framework for training large multimodal models
Starred by
+7
Created 3 years ago
Updated 1 year ago
textbook_quality
by
VikParuchuri
0%
509
Synthetic data generator for LLM pretraining
Starred by
Created 2 years ago
Updated 2 years ago
tree-of-thought-llm
by
princeton-nlp
0.2%
6k
Research paper implementation for Tree of Thoughts (ToT) prompting
Starred by
+7
Created 2 years ago
Updated 1 year ago
LongLoRA
by
JIA-Lab-research
0.0%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
Starred by
+1
Created 2 years ago
Updated 1 year ago
kani
by
zhudotexe
0%
599
Microframework for chat-based language models with tool use/function calling
Starred by
Created 2 years ago
Updated 1 week ago
DoLa
by
voidism
0%
544
Decoding strategy research paper for improving factuality in LLMs
Starred by
Created 2 years ago
Updated 1 year ago
varuna
by
microsoft
0%
251
Tool for efficient large DNN model training on commodity hardware
Starred by
Created 4 years ago
Updated 1 year ago
BLoRA
by
sabetAI
0.3%
351
Inference optimization for batched LoRA adapters
Starred by
Created 2 years ago
Updated 2 years ago
TinyLlama
by
jzhang38
0.1%
9k
Tiny pretraining project for a 1.1B Llama model
Starred by
+18
Created 2 years ago
Updated 1 year ago
sparsegpt
by
IST-DASLab
0.3%
875
Code for massive language model one-shot pruning (ICML 2023 paper)
Starred by
Created 3 years ago
Updated 1 year ago
LLM-Pruner
by
horseee
0.2%
1k
LLM structural pruner for model compression
Created 2 years ago
Updated 1 year ago
graph-of-thoughts
by
spcl
0.1%
3k
Graph-of-Thoughts: LLM framework for complex problem-solving
Starred by
+1
Created 2 years ago
Updated 1 year ago
tensor_parallel
by
BlackSamorez
0.1%
656
PyTorch module for multi-GPU model parallelism
Starred by
Created 3 years ago
Updated 2 years ago
relora
by
Guitaricet
0.2%
474
PEFT pretraining code for ReLoRA research paper
Starred by
Created 2 years ago
Updated 1 year ago
wandbot
by
wandb
0%
309
Support bot for Weights & Biases' AI tools, running in Discord, Slack, ChatGPT, and Zendesk
Starred by
Created 3 years ago
Updated 2 weeks ago
LightLLM
by
ModelTC
0.3%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 13 hours ago
lmdeploy
by
InternLM
0.3%
8k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 15 hours ago
llama-chat
by
replicate
0%
835
Next.js app for Llama 3 chat UI development
Created 2 years ago
Updated 2 months ago
llama2-chatbot
by
a16z-infra
0%
1k
Streamlit chatbot app for interacting with LLMs
Starred by
Created 2 years ago
Updated 2 years ago
IncognitoPilot
by
silvanmelchior
0%
440
AI code interpreter for local data processing, like ChatGPT Code Interpreter
Created 2 years ago
Updated 2 years ago
ai-town
by
a16z-infra
0.5%
10k
AI town starter kit for building a virtual world
Starred by
+12
Created 2 years ago
Updated 2 months ago
octopack
by
bigcode-project
0%
478
Code LLM instruction tuning research paper
Starred by
+2
Created 3 years ago
Updated 1 year ago
outlines
by
dottxt-ai
0.3%
14k
SDK for structured LLM text generation
Starred by
+34
Created 3 years ago
Updated 4 days ago
bubogpt
by
magic-research
0%
511
Multi-modal LLM for joint text, vision, and audio understanding
Created 2 years ago
Updated 2 years ago
MetaGPT
by
FoundationAgents
0.4%
65k
Multi-agent framework for collaborative AI software development
Starred by
+10
Created 2 years ago
Updated 1 month ago
pykoi
by
CambioML
0%
411
Python library for reinforcement learning with human feedback (RLHF)
Starred by
Created 2 years ago
Updated 5 months ago
ChainFury
by
NimbleBoxAI
0%
452
Open-source chaining engine for production AI apps
Starred by
Created 2 years ago
Updated 1 year ago
candle
by
huggingface
0.4%
20k
Minimalist ML framework for Rust, emphasizing performance and ease of use
Starred by
+23
Created 2 years ago
Updated 1 day ago
Megatron-LLM
by
epfLLM
0%
590
Distributed trainer for LLMs
Starred by
Created 2 years ago
Updated 1 year ago
ToolBench
by
OpenBMB
0.1%
6k
Open platform for LLM tool learning (ICLR'24 spotlight)
Starred by
+6
Created 2 years ago
Updated 9 months ago
gpt-engineer
by
AntonOsika
0.1%
55k
CLI platform for code generation experimentation
Starred by
+17
Created 2 years ago
Updated 10 months ago
RRHF
by
GanjinZero
0%
808
RRHF for aligning LLMs to human preferences
Starred by
Created 2 years ago
Updated 2 years ago
LlamaFactory
by
hiyouga
0.5%
68k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+25
Created 2 years ago
Updated 3 days ago
exllama
by
turboderp
0%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 2 years ago
Updated 2 years ago
doremi
by
sangmichaelxie
0.3%
352
PyTorch for optimizing data mixtures in language model datasets
Starred by
Created 2 years ago
Updated 2 years ago
UltraChat
by
thunlp
0.3%
3k
Multi-round dialogue dataset and models for chat language model training
Starred by
Created 2 years ago
Updated 2 years ago
RealChar
by
Shaunwei
0.0%
6k
Real-time AI character/companion creation and interaction codebase
Starred by
+3
Created 2 years ago
Updated 1 month ago
serve
by
jina-ai
0.1%
22k
Framework for building cloud-native multimodal AI apps
Starred by
+17
Created 6 years ago
Updated 11 months ago
aider
by
Aider-AI
0.8%
42k
AI pair programming in your terminal
Starred by
+38
Created 2 years ago
Updated 4 days ago
LMFlow
by
OptimalScale
0.0%
8k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 3 years ago
Updated 3 weeks ago
baize-chatbot
by
project-baize
0%
3k
Chat model trained via LoRA, using ChatGPT-generated dialogs
Starred by
+3
Created 2 years ago
Updated 2 years ago
ToolQA
by
night-chen
0%
286
Dataset for evaluating LLMs using external tools
Created 2 years ago
Updated 2 years ago
SuperAGI
by
TransformerOptimus
0.2%
17k
Open-source framework for autonomous AI agent development
Starred by
+4
Created 2 years ago
Updated 1 year ago
audiocraft
by
facebookresearch
0.2%
23k
PyTorch library for audio processing and generation research
Starred by
+15
Created 2 years ago
Updated 1 week ago
guidance
by
guidance-ai
0.1%
21k
Guidance is a programming paradigm for steering LLMs
Starred by
+38
Created 3 years ago
Updated 3 weeks ago
open_llama
by
openlm-research
0.0%
8k
Open-source reproduction of LLaMA models
Starred by
+14
Created 2 years ago
Updated 2 years ago
RL4LMs
by
allenai
0.0%
2k
RL library to fine-tune language models to human preferences
Starred by
+3
Created 3 years ago
Updated 2 years ago
SwiftSage
by
SwiftSage
0%
324
Agent system for reasoning with LLMs via in-context reinforcement learning
Created 2 years ago
Updated 1 year ago
ctransformers
by
marella
0%
2k
Python bindings for fast Transformer model inference
Starred by
+8
Created 2 years ago
Updated 2 years ago
developer
by
smol-ai
0.1%
12k
Agent for embedding a developer in your app
Starred by
+27
Created 2 years ago
Updated 1 year ago
MeZO
by
princeton-nlp
0.2%
1k
Research paper implementation for memory-efficient LM fine-tuning
Starred by
Created 2 years ago
Updated 2 years ago
ImageBind
by
facebookresearch
0.1%
9k
PyTorch implementation for multimodal embeddings research paper
Starred by
+5
Created 3 years ago
Updated 3 months ago
xtreme1
by
xtreme1-io
0.1%
1k
Open-source platform for multimodal training data annotation
Starred by
Created 3 years ago
Updated 8 months ago
sudolang
by
paralleldrive
0.5%
1k
VS Code extension for LLM-based programming with SudoLang
Starred by
Created 2 years ago
Updated 1 month ago
poe-api
by
ading2210
0.1%
2k
Python API for Quora's Poe (unmaintained)
Created 3 years ago
Updated 2 years ago
Local-LLM-Comparison-Colab-UI
by
Troyanovsky
0.1%
1k
Local LLM comparison via Colab WebUI links
Starred by
Created 2 years ago
Updated 1 month ago
airoboros
by
jondurbin
0%
1k
Self-instruct tool for LLM finetuning
Starred by
+3
Created 2 years ago
Updated 2 years ago
PaLM
by
conceptofmind
0%
819
Open-source PaLM implementation for language model research
Starred by
Created 2 years ago
Updated 1 year ago
TruthfulQA
by
sylinrl
0.3%
891
Benchmark dataset for evaluating truthfulness of language models
Starred by
Created 4 years ago
Updated 1 year ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 2 years ago
Updated 2 weeks ago
PMC-LLaMA
by
chaoyi-wu
0.1%
677
Medical LLM for instruction-following in the medical domain
Created 2 years ago
Updated 1 year ago
openlm
by
r2d4
0%
372
OpenAI-compatible Python client for calling LLMs
Starred by
+1
Created 2 years ago
Updated 2 years ago
FasterTransformer
by
NVIDIA
0.0%
6k
Optimized transformer library for inference
Starred by
+12
Created 5 years ago
Updated 1 year ago
unlimiformer
by
abertsch72
0%
1k
Research paper for long-range transformers with unlimited input
Starred by
+1
Created 2 years ago
Updated 2 years ago
gpt-neox
by
EleutherAI
0.0%
7k
Framework for training large-scale autoregressive language models
Starred by
+22
Created 5 years ago
Updated 1 month ago
toolformer
by
conceptofmind
0.3%
380
Open-source implementation of Toolformer research paper
Starred by
Created 3 years ago
Updated 3 years ago
bark
by
suno-ai
0.1%
39k
Generative audio model for realistic speech and sound effects
Starred by
+19
Created 2 years ago
Updated 1 year ago
chat-langchain
by
langchain-ai
0.1%
6k
Chatbot for question answering over LangChain documentation
Starred by
+3
Created 3 years ago
Updated 1 day ago
LaMini-LM
by
mbzuai-nlp
0%
822
Small, efficient language models distilled from ChatGPT for research
Starred by
Created 2 years ago
Updated 2 years ago
ChatRWKV
by
BlinkDL
0.0%
10k
Open-source chatbot powered by the RWKV RNN language model
Starred by
+4
Created 3 years ago
Updated 1 month ago
RWKV-LM
by
BlinkDL
0.1%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+29
Created 4 years ago
Updated 1 week ago
LocalAI
by
mudler
0.4%
44k
Open-source OpenAI alternative for local AI inference
Starred by
+14
Created 3 years ago
Updated 15 hours ago
WizardLM
by
nlpxucan
0.0%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 2 years ago
Updated 9 months ago
chameleon-llm
by
lupantech
0%
1k
Research paper code for plug-and-play compositional reasoning with LLMs
Starred by
Created 2 years ago
Updated 2 years ago
llama-lab
by
run-llama
0.1%
2k
LlamaIndex projects for LLM data augmentation
Starred by
Created 2 years ago
Updated 2 years ago
EdgeGPT
by
acheong08
0%
8k
Reverse-engineered API for Microsoft Bing Chat (archived)
Starred by
Created 3 years ago
Updated 2 years ago
gisting
by
jayelm
1.3%
311
Research paper implementation for prompt compression via learned "gist" tokens
Starred by
Created 2 years ago
Updated 1 year ago
gpt-llama.cpp
by
keldenl
0%
598
API wrapper for local LLM inference, emulating OpenAI's GPT endpoints
Starred by
Created 2 years ago
Updated 2 years ago
memit
by
kmeng01
0.2%
543
Transformer memory mass-editor (ICLR 2023 research paper)
Starred by
Created 3 years ago
Updated 2 years ago
dl4math
by
lupantech
0%
371
DL4MATH: Deep learning resources for mathematical reasoning
Created 3 years ago
Updated 2 years ago
MiniGPT-4
by
Vision-CAIR
0.0%
26k
Vision-language model for multi-task learning
Starred by
+15
Created 2 years ago
Updated 1 year ago
auto-cot
by
amazon-science
0%
2k
Research paper implementation for automatic chain-of-thought prompting
Starred by
Created 3 years ago
Updated 2 years ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 3 years ago
Updated 1 year ago
PythonProgrammingPuzzles
by
microsoft
0%
997
Python puzzle dataset for AI programming proficiency research
Created 4 years ago
Updated 1 year ago
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 2 years ago
Updated 1 year ago
unstructured
by
Unstructured-IO
0.5%
14k
ETL solution for structuring unstructured data for language models
Starred by
+12
Created 3 years ago
Updated 1 week ago
whisper
by
openai
0.3%
96k
Speech recognition model for multilingual transcription/translation
Starred by
+41
Created 3 years ago
Updated 2 months ago
LLaMA_MPS
by
jankais3r
0%
585
LLM inference on Apple Silicon GPUs
Starred by
Created 3 years ago
Updated 3 years ago
dolly
by
databrickslabs
0.0%
11k
Instruction-following LLM trained on the Databricks Machine Learning Platform
Starred by
+15
Created 3 years ago
Updated 2 years ago
minimal-llama
by
zphang
0%
457
Code for running and fine-tuning LLaMA models
Starred by
Created 3 years ago
Updated 2 years ago
zero_shot_cot
by
kojima-takeshi188
0.2%
441
Reasoning framework for LLMs, based on a NeurIPS 2022 paper
Starred by
Created 3 years ago
Updated 2 years ago
safari
by
HazyResearch
0%
913
Research paper implementations for sequence modeling with convolutions
Starred by
+2
Created 3 years ago
Updated 1 year ago
EasyLM
by
young-geng
0.2%
3k
LLM training/finetuning framework in JAX/Flax
Starred by
+9
Created 3 years ago
Updated 1 year ago
AlpacaDataCleaned
by
gururise
0.1%
2k
Cleaned dataset for Alpaca LLM training
Starred by
+4
Created 3 years ago
Updated 6 days ago
trl
by
huggingface
0.5%
18k
Library for transformer RL
Starred by
+28
Created 6 years ago
Updated 14 hours ago
ThoughtSource
by
OpenBioLink
0.1%
1k
Framework for chain-of-thought reasoning data and tools
Starred by
Created 3 years ago
Updated 1 year ago
GPT-4-LLM
by
Instruction-Tuning-with-GPT-4
0%
4k
GPT-4 data for instruction-tuning LLMs via supervised/RL
Starred by
+5
Created 2 years ago
Updated 2 years ago
lit-llama
by
Lightning-AI
0.0%
6k
LLaMA implementation for pretraining, finetuning, and inference
Starred by
+5
Created 3 years ago
Updated 8 months ago
AutoGPT
by
Significant-Gravitas
0.1%
182k
AI agent platform for building, deploying, and running autonomous workflows
Starred by
+56
Created 3 years ago
Updated 13 hours ago
LLaMA-Adapter
by
OpenGVLab
0%
6k
Efficient fine-tuning for instruction-following LLaMA models
Starred by
+3
Created 3 years ago
Updated 2 years ago
pygpt4all
by
nomic-ai
0%
1k
Python bindings for local LLM inference (deprecated)
Starred by
Created 2 years ago
Updated 2 years ago
chatllama
by
henrywoo
0.1%
1k
Open-source implementation for LLaMA-based ChatGPT, runnable on a single GPU
Created 3 years ago
Updated 1 year ago
optimate
by
nebuly-ai
0%
8k
Collection of libraries to optimize AI model performances
Starred by
+3
Created 4 years ago
Updated 1 year ago
GPTeacher
by
teknium1
0.1%
2k
GPT-4 generated datasets for instruction tuning
Starred by
+1
Created 2 years ago
Updated 2 years ago
chatgpt-universe
by
cedrickchee
0%
381
Collection of ChatGPT, GPT, and LLM resources
Created 3 years ago
Updated 1 year ago
langchain
by
langchain-ai
0.7%
129k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 21 hours ago
xTuring
by
stochasticai
0.1%
3k
SDK for fine-tuning and customizing open-source LLMs
Starred by
+3
Created 3 years ago
Updated 1 week ago
ai-pdf-chatbot-langchain
by
mayooear
0.1%
16k
AI chatbot agent for PDF document Q&A using LangChain & LangGraph
Starred by
+3
Created 3 years ago
Updated 1 year ago
natbot
by
nat
0.1%
2k
Browser automation via GPT-3
Starred by
+7
Created 3 years ago
Updated 1 year ago
ReAct
by
ysymyth
0.7%
4k
GPT-3 prompting code for ReAct research paper
Starred by
+2
Created 3 years ago
Updated 2 years ago
ChatGLM-finetune-LoRA
by
lich99
0%
718
LoRA finetuning code for ChatGLM-6b
Starred by
Created 3 years ago
Updated 2 years ago
Llama-X
by
AetherCortex
0%
2k
Open academic research project improving LLaMA to SOTA LLM
Starred by
Created 2 years ago
Updated 2 years ago
flash-attention
by
Dao-AILab
0.8%
23k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 16 hours ago
FastChat
by
lm-sys
0.0%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 3 years ago
Updated 9 months ago
text-generation-inference
by
huggingface
0.1%
11k
Rust/Python/gRPC server for fast LLM text generation
Starred by
+35
Created 3 years ago
Updated 2 months ago
ChatDoctor
by
Kent0n-Li
0.0%
4k
Medical chat model fine-tuned on LLaMA for medical domain Q&A
Starred by
Created 3 years ago
Updated 1 year ago
gpt4all
by
nomic-ai
0.1%
77k
Desktop app for local LLM inference, no GPU/API needed
Starred by
+29
Created 3 years ago
Updated 9 months ago
toolformer-pytorch
by
lucidrains
0%
2k
Pytorch implementation of Toolformer for language models using external tools
Starred by
+2
Created 3 years ago
Updated 1 year ago
text-generation-webui
by
oobabooga
0.2%
46k
Web UI for LLM text generation
Starred by
+24
Created 3 years ago
Updated 17 hours ago
gptq
by
IST-DASLab
0.1%
2k
Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers
Starred by
+3
Created 3 years ago
Updated 1 year ago
PaLM-rlhf-pytorch
by
lucidrains
0%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 5 months ago
trlx
by
CarperAI
0.0%
5k
Distributed RLHF for LLMs
Starred by
+16
Created 3 years ago
Updated 2 years ago
alpaca_lora_4bit
by
johnsmith0031
0%
535
Fine-tuning and inference tool for quantized LLaMA models
Starred by
Created 3 years ago
Updated 2 years ago
chatgpt-retrieval-plugin
by
openai
0.0%
21k
Retrieval plugin for custom GPTs, function calling, or assistants APIs
Starred by
+23
Created 3 years ago
Updated 1 year ago
GPTQ-for-LLaMa
by
qwopqwop200
0%
3k
4-bit quantization for LLaMA models using GPTQ
Starred by
+2
Created 3 years ago
Updated 1 year ago
dalai
by
cocktailpeanut
0%
13k
Local LLM inference via CLI tool and Node.js API
Starred by
+4
Created 3 years ago
Updated 1 year ago
alpaca-lora
by
tloen
0.0%
19k
LoRA fine-tuning for LLaMA
Starred by
+22
Created 3 years ago
Updated 1 year ago
stanford_alpaca
by
tatsu-lab
0.0%
30k
Instruction-following LLaMA model training and data generation
Starred by
+25
Created 3 years ago
Updated 1 year ago
ColossalAI
by
hpcaitech
0.0%
41k
AI system for large-scale parallel training
Starred by
+25
Created 4 years ago
Updated 4 days ago
agentic
by
transitive-bullshit
0.0%
18k
AI agent stdlib for LLM-based TypeScript tooling
Starred by
+7
Created 3 years ago
Updated 1 month ago
dagger
by
dagger
0.2%
16k
Open-source runtime for composable workflows, ideal for AI agents
Starred by
+8
Created 6 years ago
Updated 1 day ago
sdk-python
by
temporalio
0.4%
983
Python SDK for Temporal, a distributed orchestration engine
Starred by
Created 4 years ago
Updated 23 hours ago
docker-lambda
by
lambci
0%
6k
Deprecated: Docker images for replicating the AWS Lambda environment locally
Starred by
+5
Created 9 years ago
Updated 3 years ago
kong
by
Kong
0.1%
43k
Cloud-native API and AI gateway for microservice orchestration
Starred by
+18
Created 11 years ago
Updated 3 days ago
awesome-machine-learning
by
josephmisiti
0.1%
72k
Curated list of ML frameworks, libraries, and software
Starred by
+24
Created 11 years ago
Updated 1 month ago
hackathon-starter
by
sahat
0.0%
35k
Node.js boilerplate for web applications
Starred by
+11
Created 12 years ago
Updated 1 day ago
Feedback? Help us improve.