Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Wing Lian
Wing Lian
Founder of Axolotl AI
GitHub
X
Starred Projects (352)
mHC-manifold-constrained-hyper-connections
by
tokenbender
6.7%
269
Research implementation of manifold-constrained hyper-connections for deep learning models
Created 3 weeks ago
Updated 3 weeks ago
IQuest-Coder-V1
by
IQuestLab
2.2%
1k
Code LLMs for autonomous software engineering
Created 3 weeks ago
Updated 1 day ago
OpenTinker
by
open-tinker
3.1%
602
RL-as-a-Service infrastructure for foundation models
Starred by
+2
Created 1 month ago
Updated 2 days ago
punica
by
punica-ai
0%
1k
LoRA serving system (research paper) for multi-tenant LLM inference
Starred by
+3
Created 2 years ago
Updated 1 year ago
mLoRA
by
TUDB-Labs
0.3%
368
Framework for efficient LoRA fine-tuning of multiple LLMs
Created 2 years ago
Updated 11 months ago
miles
by
radixark
4.6%
770
Enterprise RL for large-scale MoE models
Starred by
+3
Created 3 months ago
Updated 1 day ago
ROLL
by
alibaba
1.6%
3k
RL library for large language models
Starred by
Created 8 months ago
Updated 1 day ago
Kimi-Linear
by
MoonshotAI
0.2%
1k
Efficient linear attention architecture accelerates long-context LLMs
Created 3 months ago
Updated 2 months ago
Fast-dLLM
by
NVlabs
0.9%
800
Diffusion LLM inference acceleration framework
Starred by
Created 8 months ago
Updated 2 months ago
auto-round
by
intel
1.7%
830
Quantization algorithm for LLMs and VLMs
Starred by
Created 2 years ago
Updated 1 day ago
luminal
by
luminal-ai
0.5%
3k
Deep learning library using composable compilers for high performance
Starred by
Created 2 years ago
Updated 1 day ago
MARS
by
AGI-Arena
0%
714
Optimization framework for training large models
Created 1 year ago
Updated 3 months ago
DeepResearch
by
Alibaba-NLP
0.4%
18k
Benchmark for LLMs in web traversal
Starred by
+1
Created 1 year ago
Updated 6 days ago
gemlite
by
dropbox
0%
423
Triton kernels for efficient low-bit matrix multiplication
Starred by
Created 1 year ago
Updated 1 month ago
AgentGym-RL
by
WooooDyy
1.4%
568
Train LLM agents for long-horizon, multi-turn decision-making
Starred by
Created 4 months ago
Updated 4 months ago
LlamaGym
by
KhoomeiK
0.1%
1k
SDK for fine-tuning LLM agents with online reinforcement learning
Starred by
Created 1 year ago
Updated 1 year ago
flame
by
fla-org
1.8%
341
Minimal, efficient framework for LLM training
Starred by
Created 1 year ago
Updated 2 months ago
Soft-Thinking
by
eric-ai-lab
2.0%
304
Enhancing LLM reasoning via continuous concept spaces
Created 8 months ago
Updated 1 day ago
DFT
by
yongliang-wu
0%
526
Improving SFT generalization with reward rectification
Starred by
Created 5 months ago
Updated 3 weeks ago
dion
by
microsoft
0.2%
420
Orthonormal updates for faster distributed ML training
Created 8 months ago
Updated 1 week ago
mixture_of_recursions
by
raymin0223
0.6%
538
Adaptive LLM computation with dynamic recursion
Created 7 months ago
Updated 4 months ago
gem
by
axon-rl
1.1%
437
Agentic LLM training environment for interactive reinforcement learning
Starred by
Created 8 months ago
Updated 6 days ago
cc
by
kn1026
0.4%
703
Starred by
Created 6 months ago
Updated 6 months ago
HRM
by
sapientinc
0.2%
12k
Hierarchical reasoning for complex tasks
Starred by
Created 6 months ago
Updated 4 months ago
RL2
by
ChenmienTan
2.4%
1k
Reinforcement learning for large language models
Starred by
+1
Created 9 months ago
Updated 1 week ago
matmulfreellm
by
ridgerchu
0%
3k
MatMul-free language models
Starred by
+2
Created 1 year ago
Updated 1 month ago
applied-ai
by
meta-pytorch
0%
314
Applied AI experiments and examples for PyTorch
Starred by
Created 2 years ago
Updated 5 months ago
COAT
by
NVlabs
0.4%
258
FP8 training framework for memory efficiency
Created 1 year ago
Updated 5 months ago
SkyRL
by
NovaSky-AI
1.9%
1k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+13
Created 9 months ago
Updated 1 day ago
scattermoe
by
shawntan
0.4%
262
Triton-based Sparse Mixture-of-Experts for efficient deep learning
Starred by
Created 1 year ago
Updated 3 months ago
rStar
by
microsoft
0.1%
1k
Research paper repo for math reasoning in small LLMs via deep thinking
Starred by
Created 1 year ago
Updated 4 months ago
Skills
by
NVIDIA-NeMo
2.2%
788
LLM skill-improvement pipelines for synthetic data generation, training, and evaluation
Starred by
Created 1 year ago
Updated 1 day ago
Absolute-Zero-Reasoner
by
LeapLabTHU
0.2%
2k
Self-play reasoning framework needing zero data
Starred by
Created 9 months ago
Updated 5 months ago
DeepSeekRL-Extended
by
brendanhogan
0%
251
GRPO implementation for scaled RL research
Starred by
Created 11 months ago
Updated 5 months ago
open-webui
by
open-webui
0.6%
122k
Self-hosted AI platform for local LLM deployment
Starred by
+24
Created 2 years ago
Updated 3 days ago
TTRL
by
PRIME-RL
0.6%
961
RL technique for unlabeled data, especially test data
Created 9 months ago
Updated 4 months ago
axolotl
by
axolotl-ai-cloud
0.4%
11k
CLI tool for streamlined post-training of AI models
Starred by
+26
Created 2 years ago
Updated 4 days ago
agno
by
agno-agi
0.5%
37k
Lightweight library for building AI Agents with memory, knowledge, and reasoning
Starred by
+9
Created 3 years ago
Updated 1 day ago
github-mcp-server
by
github
0.9%
26k
MCP server for GitHub API automation and interaction
Starred by
Created 10 months ago
Updated 2 days ago
loong
by
camel-ai
0.6%
483
Synthetic data generation project using LLM agents
Created 10 months ago
Updated 4 days ago
SWE-Gym
by
SWE-Gym
0.6%
622
Environment for training software engineering agents
Starred by
+2
Created 1 year ago
Updated 6 months ago
GamingAgent
by
lmgame-org
0.2%
847
SDK for LLM/VLM gaming agents, enabling model evaluation via games
Starred by
Created 11 months ago
Updated 2 months ago
LLaDA
by
ML-GSAI
0.7%
4k
LLM research paper exploring masked diffusion language models
Starred by
Created 11 months ago
Updated 2 months ago
recurrent-pretraining
by
seal-rg
0%
859
Pretraining code for depth-recurrent language model research
Starred by
Created 11 months ago
Updated 4 weeks ago
TransMLA
by
MuLabPKU
0.5%
428
Post-training method converts GQA-based LLMs to MLA models
Created 1 year ago
Updated 4 months ago
MLGym
by
facebookresearch
0%
583
Gym environment for ML research agents
Starred by
Created 11 months ago
Updated 5 months ago
native-sparse-attention-triton
by
XunhaoLai
0.4%
261
Efficient sparse attention for LLMs
Created 11 months ago
Updated 8 months ago
coconut
by
facebookresearch
0.7%
1k
Research paper implementation for LLM reasoning in latent space
Starred by
Created 1 year ago
Updated 5 months ago
native-sparse-attention-pytorch
by
lucidrains
0%
793
Sparse attention implementation from Deepseek's research paper
Created 11 months ago
Updated 5 months ago
ReasonFlux
by
Gen-Verse
0.2%
515
LLM post-training algorithms for data selection, RL, and inference
Created 11 months ago
Updated 4 months ago
LIMO
by
GAIR-NLP
0%
1k
Reasoning model using less data
Starred by
Created 11 months ago
Updated 6 months ago
s1
by
simplescaling
0.1%
7k
Test-time scaling recipe for strong reasoning performance
Starred by
+8
Created 1 year ago
Updated 7 months ago
reasoning-gym
by
open-thought
1.1%
1k
Procedural dataset generator for reasoning models
Starred by
+5
Created 1 year ago
Updated 1 week ago
curator
by
bespokelabsai
0.5%
2k
Synthetic data curation tool for post-training and structured data extraction
Starred by
Created 1 year ago
Updated 3 days ago
RAGEN
by
mll-lab-nu
0.4%
2k
Train LLM agents with reinforcement learning in interactive environments
Starred by
Created 1 year ago
Updated 2 days ago
SkyThought
by
NovaSky-AI
0.1%
3k
Training recipes for Sky-T1 family of models
Starred by
+4
Created 1 year ago
Updated 6 months ago
search-and-learn
by
huggingface
0%
1k
Recipes to scale inference-time compute of open models
Starred by
+1
Created 1 year ago
Updated 8 months ago
buffer-of-thought-llm
by
YangLing0818
0.1%
677
Research paper implementation for thought-augmented LLM reasoning
Created 1 year ago
Updated 7 months ago
HuatuoGPT-o1
by
FreedomIntelligence
0.1%
1k
Medical LLM for advanced reasoning
Created 1 year ago
Updated 1 year ago
LayerSkip
by
facebookresearch
0.3%
355
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding" research paper
Created 1 year ago
Updated 3 days ago
markitdown
by
microsoft
0.5%
86k
Python tool for converting files to Markdown for LLM text analysis
Starred by
+17
Created 1 year ago
Updated 2 weeks ago
NeMo-Aligner
by
NVIDIA
0.1%
848
Toolkit for efficient model alignment
Starred by
+1
Created 2 years ago
Updated 3 months ago
agency-swarm
by
VRSEN
0.1%
4k
Agentic framework built on OpenAI Assistants API for automating AI workflows
Starred by
Created 2 years ago
Updated 1 day ago
instructlab
by
instructlab
0.1%
1k
CLI tool for LLM alignment tuning via synthetic data
Starred by
Created 1 year ago
Updated 1 week ago
flash-linear-attention
by
fla-org
0.8%
4k
Efficient Torch/Triton implementations for linear attention models
Starred by
+8
Created 2 years ago
Updated 4 days ago
TokenFormer
by
Haiyang-W
0.3%
584
Research paper on a fully attention-based neural network with tokenized model parameters
Created 1 year ago
Updated 11 months ago
evaluation-guidebook
by
huggingface
0.2%
2k
LLM evaluation guide for practitioners
Starred by
+3
Created 1 year ago
Updated 1 month ago
dynasaur
by
adobe-research
0.3%
353
LLM agent framework using dynamic action creation via Python code generation
Starred by
Created 1 year ago
Updated 1 year ago
Marco-o1
by
AIDC-AI
0%
2k
Open reasoning model for real-world problem solving
Created 1 year ago
Updated 8 months ago
SageAttention
by
thu-ml
1.1%
3k
Attention kernel for plug-and-play inference acceleration
Starred by
Created 1 year ago
Updated 1 week ago
metaflow
by
Netflix
0.2%
10k
Framework for building and managing AI/ML systems
Starred by
+10
Created 6 years ago
Updated 2 days ago
Muon
by
KellerJordan
1.0%
2k
Optimizer for neural network hidden layers
Starred by
Created 1 year ago
Updated 1 week ago
MathBlackBox
by
trotsky1997
0%
1k
Research paper for mathematical reasoning via LLMs
Starred by
+1
Created 1 year ago
Updated 1 year ago
BitNet
by
microsoft
0.2%
26k
Inference framework for 1-bit LLMs
Starred by
+8
Created 1 year ago
Updated 7 months ago
Aria
by
rhymes-ai
0%
1k
Multimodal MoE model for video, document understanding, and dialog
Starred by
Created 1 year ago
Updated 1 year ago
Hands-On-Large-Language-Models
by
HandsOnLLM
0.7%
20k
Code examples for "Hands-On Large Language Models" book
Starred by
Created 1 year ago
Updated 1 month ago
modded-nanogpt
by
KellerJordan
7.4%
4k
Language model training speedrun on 8x H100 GPUs
Starred by
+7
Created 1 year ago
Updated 1 week ago
llama-stack
by
llamastack
0.1%
8k
Composable building blocks for Llama apps
Starred by
+7
Created 1 year ago
Updated 1 day ago
Adam-mini
by
zyushun
0.4%
451
PyTorch implementation of Adam-mini optimizer from a research paper
Starred by
Created 1 year ago
Updated 8 months ago
optillm
by
algorithmicsuperintelligence
0.2%
3k
Optimizing inference proxy for LLMs
Starred by
+8
Created 1 year ago
Updated 1 month ago
LLM-Blender
by
yuchenlin
0.2%
975
LLM ensembling framework using pairwise ranking and generative fusion
Starred by
+3
Created 2 years ago
Updated 1 year ago
LLMs-Planning
by
karthikv792
0.9%
445
Benchmark for evaluating LLMs on planning tasks
Created 3 years ago
Updated 4 months ago
rStar
by
zhentingqi
0%
971
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 1 year ago
distributed-training-guide
by
LambdaLabsML
0.9%
571
PyTorch guide for distributed training of large language models
Starred by
Created 1 year ago
Updated 3 months ago
nyuntam
by
nyunAI
0%
677
CLI tool for LLM compression via pruning, quantization, and distillation
Created 1 year ago
Updated 1 year ago
MisguidedAttention
by
cpldcpu
0.2%
455
LLM reasoning benchmark for evaluating responses to misleading prompts
Starred by
Created 1 year ago
Updated 6 months ago
Open-Reasoning-Tasks
by
NousResearch
0.4%
457
Reasoning tasks collection for LLMs
Starred by
+3
Created 1 year ago
Updated 1 year ago
long-context-attention
by
feifeibear
0.8%
634
Unified sequence parallel attention for long context LLM training/inference
Starred by
Created 1 year ago
Updated 1 week ago
DistillKit
by
arcee-ai
1.6%
836
Open-source toolkit for LLM distillation research
Starred by
Created 1 year ago
Updated 1 month ago
do-not-answer
by
Libr-AI
1.0%
309
Dataset for evaluating LLM safety mechanisms
Starred by
Created 2 years ago
Updated 1 year ago
fms-fsdp
by
foundation-model-stack
0.7%
280
Efficiently train foundation models with PyTorch
Starred by
Created 2 years ago
Updated 2 months ago
OLMo
by
allenai
0.1%
6k
Open language model code for training, evaluation, and inference
Starred by
+4
Created 2 years ago
Updated 2 months ago
BAdam
by
Ledzy
0%
283
Memory-efficient optimizer for large language model finetuning
Starred by
Created 1 year ago
Updated 10 months ago
snowflake-arctic
by
Snowflake-Labs
0%
558
AI research project for efficient LLM training and inference
Starred by
Created 1 year ago
Updated 1 year ago
open-instruct
by
allenai
0.5%
4k
Training codebase for instruction-following language models
Starred by
+10
Created 2 years ago
Updated 1 day ago
mdistiller
by
megvii-research
0.2%
888
PyTorch library for knowledge distillation research
Created 3 years ago
Updated 2 years ago
augmentoolkit
by
e-p-armstrong
0.2%
2k
Data toolkit for custom LLM creation using open-source AI
Starred by
+3
Created 2 years ago
Updated 2 months ago
awesome-synthetic-datasets
by
davanstrien
0%
321
Curated list of synthetic text/vision datasets and generation tools
Created 1 year ago
Updated 2 weeks ago
calm
by
zeux
0.2%
625
Single-GPU inference engine for rapid LLM prototyping
Starred by
Created 2 years ago
Updated 8 months ago
MobileLLM
by
facebookresearch
0.1%
1k
Sub-billion parameter LLM training code for on-device use
Starred by
+2
Created 1 year ago
Updated 9 months ago
phoenix
by
Arize-ai
0.9%
8k
AI observability platform for experimentation, evaluation, and troubleshooting
Starred by
+6
Created 3 years ago
Updated 1 day ago
SPPO
by
uclaml
0%
583
Self-Play Preference Optimization (SPPO) aligns language models via self-play
Starred by
Created 1 year ago
Updated 1 year ago
AutoIF
by
QwenLM
0%
321
Research paper for improving LLM instruction-following via self-play with execution feedback
Starred by
Created 1 year ago
Updated 1 year ago
refusal_direction
by
andyrdt
0.9%
333
Research paper code for analyzing refusal in language models
Starred by
Created 1 year ago
Updated 7 months ago
YaFSDP
by
yandex
0.1%
984
Sharded data parallelism framework for transformer-like neural networks
Starred by
Created 1 year ago
Updated 1 month ago
chat_templates
by
chujiezheng
0%
713
Chat templates for HuggingFace LLMs
Starred by
Created 2 years ago
Updated 1 year ago
LESS
by
princeton-nlp
0.2%
512
Data selection research paper for targeted instruction tuning
Starred by
Created 2 years ago
Updated 1 year ago
MixEval
by
JinjieNi
0%
254
Dynamic LLM evaluation suite for accurate, cost-effective benchmarking
Starred by
Created 1 year ago
Updated 1 year ago
MoRA
by
kongds
0%
363
Parameter-efficient fine-tuning via high-rank updating (MoRA)
Starred by
Created 1 year ago
Updated 1 year ago
SimPO
by
princeton-nlp
0.2%
943
Preference optimization algorithm for LLMs (NeurIPS 2024 paper)
Starred by
Created 1 year ago
Updated 11 months ago
qodo-cover
by
qodo-ai
0.1%
5k
CLI tool for AI-powered test generation and code coverage enhancement
Starred by
Created 1 year ago
Updated 7 months ago
gemma-2B-10M
by
mustafaaljadery
0%
937
Gemma 2B with 10M context length using Infini-attention
Starred by
Created 1 year ago
Updated 1 year ago
xtuner
by
InternLM
0.1%
5k
LLM fine-tuning toolkit for research
Starred by
+2
Created 2 years ago
Updated 1 day ago
GLiNER
by
urchade
0.8%
3k
NER model for identifying any entity type using bidirectional transformer
Starred by
Created 2 years ago
Updated 1 week ago
contriever
by
facebookresearch
0.1%
768
Unsupervised dense information retrieval via contrastive learning
Starred by
Created 4 years ago
Updated 2 years ago
prometheus-eval
by
prometheus-eval
0.2%
1k
LLM evaluation framework using open LLMs
Starred by
Created 1 year ago
Updated 9 months ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.4%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 2 years ago
Updated 1 year ago
selfcodealign
by
bigcode-project
0%
323
Research paper for self-alignment in code generation
Starred by
Created 1 year ago
Updated 11 months ago
llm-datasets
by
mlabonne
0.5%
4k
Curated datasets/tools for LLM post-training
Starred by
+1
Created 1 year ago
Updated 2 months ago
rerope
by
bojone
0%
387
Position embeddings research paper
Starred by
Created 2 years ago
Updated 1 year ago
LaVague
by
lavague-ai
0.1%
6k
Web agent framework for automating web processes
Starred by
+7
Created 1 year ago
Updated 1 year ago
ring-flash-attention
by
zhuzilin
0.5%
969
FlashAttention extension for ring attention
Starred by
+2
Created 1 year ago
Updated 4 months ago
llamaduo
by
deep-diver
0%
314
LLMOps pipeline to fine-tune small LLMs for service LLM outage prep
Starred by
Created 1 year ago
Updated 6 months ago
cohere-toolkit
by
cohere-ai
0.1%
3k
RAG toolkit for LLM application development and deployment
Starred by
+4
Created 1 year ago
Updated 4 days ago
uptrain
by
uptrain-ai
0.1%
2k
Open-source platform to evaluate and improve GenAI apps
Starred by
+5
Created 3 years ago
Updated 1 year ago
BitBLAS
by
microsoft
0.3%
747
Library for mixed-precision matrix multiplications, targeting quantized LLM deployment
Created 1 year ago
Updated 5 months ago
arena-hard-auto
by
lmarena
0.2%
988
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 7 months ago
ChunkLlama
by
HKUNLP
0.2%
445
Training-free method for extending LLM context windows
Created 1 year ago
Updated 1 year ago
dstack
by
dstackai
0.2%
2k
Open-source tool for simplifying GPU allocation and AI workload orchestration
Starred by
+3
Created 4 years ago
Updated 1 day ago
rho
by
microsoft
0.4%
456
LLM pretraining research paper using selective language modeling (SLM)
Starred by
Created 1 year ago
Updated 1 year ago
dify
by
langgenius
0.6%
127k
Open-source LLM app development platform
Starred by
+17
Created 2 years ago
Updated 1 day ago
attorch
by
BobMcDear
0.2%
595
PyTorch nn module subset, implemented in Python using Triton
Starred by
+2
Created 2 years ago
Updated 5 months ago
mixtral-offloading
by
dvmazur
0%
2k
Inference optimization for Mixtral-8x7B models
Starred by
Created 2 years ago
Updated 1 year ago
auto-code-rover
by
AutoCodeRoverSG
0.1%
3k
Autonomous software engineer for program improvement
Starred by
+3
Created 1 year ago
Updated 9 months ago
BitNet-Transformers
by
Beomi
0%
310
HuggingFace Transformers implementation of BitNet scaling for LLMs
Created 2 years ago
Updated 1 year ago
EasyContext
by
jzhang38
0%
750
Recipes for language model context length extrapolation to 1M tokens
Starred by
+2
Created 1 year ago
Updated 1 year ago
pyreft
by
stanfordnlp
0.1%
2k
Python library for representation finetuning (ReFT) of language models
Starred by
Created 1 year ago
Updated 1 week ago
hlb-gpt
by
tysam-code
0%
355
Researcher's toolbench for GPT model exploration
Starred by
Created 2 years ago
Updated 1 year ago
aideml
by
WecoAI
0.3%
1k
ML engineering agent for automated AI R&D, surpassing human experts
Starred by
Created 1 year ago
Updated 2 months ago
BitNet
by
kyegomez
0.1%
2k
PyTorch implementation of BitNet research paper
Starred by
Created 2 years ago
Updated 1 week ago
horovod
by
horovod
0.1%
15k
Distributed training framework for TF, Keras, PyTorch, and MXNet
Starred by
+19
Created 8 years ago
Updated 1 month ago
dataverse
by
UpstageAI
0%
565
ETL pipeline for LLM data processing
Starred by
Created 2 years ago
Updated 1 year ago
hqq
by
dropbox
0.1%
907
Model quantizer for fast, accurate post-training quantization, skipping calibration
Starred by
Created 2 years ago
Updated 1 month ago
Triton-Puzzles
by
srush
1.0%
2k
Interactive puzzles for learning Triton
Starred by
Created 1 year ago
Updated 1 year ago
repeng
by
vgel
0%
676
Python library for representation engineering control vectors
Starred by
Created 2 years ago
Updated 4 months ago
cobra
by
h-zhao1997
0%
293
Multimodal LLM research paper extending Mamba for efficient inference
Created 1 year ago
Updated 1 year ago
hackathon
by
mistralai-sf24
0%
446
Minimal code for running and finetuning a 7B transformer model
Starred by
Created 1 year ago
Updated 1 year ago
raft
by
rapidsai
0.4%
974
CUDA-accelerated primitives for ML/data mining algorithms
Starred by
Created 6 years ago
Updated 6 days ago
maestro
by
Doriandarko
0.1%
4k
Framework for Claude Opus to orchestrate subagents
Starred by
Created 1 year ago
Updated 1 year ago
quiet-star
by
ezelikman
0%
741
Research code for self-teaching language models
Starred by
Created 1 year ago
Updated 1 year ago
ml-engineering
by
stas00
0.7%
17k
Open book for LLM/VLM training engineers
Starred by
+17
Created 5 years ago
Updated 3 days ago
chatbot-ui
by
mckaywrigley
0.1%
33k
Open-source AI chat app
Starred by
+14
Created 2 years ago
Updated 1 year ago
orpo
by
xfactlab
0.2%
470
Preference optimization without a reference model
Starred by
Created 1 year ago
Updated 1 year ago
SWE-bench
by
SWE-bench
1.1%
4k
Benchmark for evaluating LLMs on real-world GitHub issues
Starred by
+12
Created 2 years ago
Updated 4 days ago
OpenHands
by
OpenHands
0.5%
67k
AI platform for software development agents
Starred by
+36
Created 1 year ago
Updated 1 day ago
FastV
by
pkunlp-icler
0.4%
548
Inference acceleration for large vision-language models (research paper)
Created 1 year ago
Updated 1 year ago
airllm
by
lyogavin
20.4%
9k
Inference optimization for LLMs on low-resource hardware
Starred by
Created 2 years ago
Updated 4 months ago
daytona
by
daytonaio
4.8%
50k
Infrastructure for running AI-generated code
Starred by
+4
Created 2 years ago
Updated 1 day ago
VisionLLaMA
by
Meituan-AutoML
0%
390
Vision transformer research paper
Created 1 year ago
Updated 1 year ago
fsdp_qlora
by
AnswerDotAI
0%
2k
Training script for LLMs using QLoRA + FSDP
Starred by
+3
Created 2 years ago
Updated 1 year ago
h2o-llmstudio
by
h2oai
0.1%
5k
LLM Studio: framework for LLM fine-tuning via GUI or CLI
Starred by
+5
Created 2 years ago
Updated 3 days ago
ChatMusician
by
hf-lin
0%
295
LLM for music understanding and generation
Created 2 years ago
Updated 1 year ago
AnyGPT
by
OpenMOSS
0.2%
866
Multimodal LLM research paper for any-to-any modality conversion
Starred by
Created 1 year ago
Updated 1 year ago
FlagEmbedding
by
FlagOpen
0.3%
11k
Toolkit for retrieval and RAG applications
Starred by
+8
Created 2 years ago
Updated 1 month ago
self-rewarding-lm-pytorch
by
lucidrains
0.1%
1k
Training framework for self-rewarding language models
Starred by
+4
Created 2 years ago
Updated 1 year ago
crewAI
by
crewAIInc
0.6%
43k
Framework for autonomous AI agent orchestration via role-playing and collaboration
Starred by
+18
Created 2 years ago
Updated 1 day ago
resource-stream
by
gpu-mode
0.9%
2k
CUDA resource collection for GPU programming
Starred by
Created 2 years ago
Updated 4 months ago
metal-flash-attention
by
philipturner
0.3%
576
Metal port of FlashAttention for Apple silicon
Starred by
+2
Created 2 years ago
Updated 1 year ago
LLMs-from-scratch
by
rasbt
0.5%
84k
Educational resource for LLM construction in PyTorch
Starred by
+11
Created 2 years ago
Updated 1 week ago
mlx-examples
by
ml-explore
0.3%
8k
Examples using the MLX framework
Starred by
+7
Created 2 years ago
Updated 1 month ago
ai-codereviewer
by
villesau
0.2%
995
GitHub Action for AI-powered code review
Starred by
Created 2 years ago
Updated 1 year ago
deita
by
hkust-nlp
0.5%
584
Data-efficient instruction tuning for LLM alignment (ICLR 2024)
Starred by
Created 2 years ago
Updated 1 year ago
AutoAWQ
by
casper-hansen
0.1%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Starred by
+5
Created 2 years ago
Updated 8 months ago
ProxyAI
by
carlrobertoh
0.2%
2k
JetBrains IDE copilot for coding assistance
Starred by
Created 2 years ago
Updated 1 week ago
EAGLE
by
SafeAILab
0.8%
2k
Speculative decoding research paper for faster LLM inference
Starred by
+5
Created 2 years ago
Updated 2 weeks ago
HALOs
by
ContextualAI
0%
899
Library for aligning LLMs using human-aware loss functions
Starred by
Created 2 years ago
Updated 3 months ago
mamba
by
state-spaces
0.3%
17k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 2 weeks ago
modelz-llm
by
tensorchord
0%
275
Inference server for open-source LLMs, offering an OpenAI-compatible API
Created 2 years ago
Updated 2 years ago
unsloth
by
unslothai
0.6%
51k
Finetuning tool for LLMs, targeting speed and memory efficiency
Starred by
+38
Created 2 years ago
Updated 2 days ago
gpt-researcher
by
assafelovic
0.5%
25k
Autonomous agent for web/local research, generating cited reports
Starred by
+9
Created 2 years ago
Updated 2 days ago
functionary
by
MeetKai
0%
2k
Chat language model for tool use and result interpretation
Starred by
+2
Created 2 years ago
Updated 1 month ago
Logic-LLM
by
teacherpeterpan
0.5%
375
Logic-LM: Framework for improved logical reasoning via LLMs and symbolic solvers
Created 2 years ago
Updated 1 year ago
LLMSurvey
by
RUCAIBox
0.1%
12k
Survey paper for large language models
Starred by
+2
Created 2 years ago
Updated 10 months ago
distilabel
by
argilla-io
0.4%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 1 week ago
long-llms-learning
by
Strivin0311
0%
272
Literature repository for long-context LLM methodologies
Starred by
Created 2 years ago
Updated 1 year ago
MergeLM
by
yule-BUAA
0.1%
863
Codebase for merging language models via parameter averaging
Starred by
Created 2 years ago
Updated 1 year ago
Video-LLaVA
by
PKU-YuanGroup
0.3%
3k
Video-LLaVA: Multimodal model for video/image understanding via LLM
Starred by
Created 2 years ago
Updated 1 year ago
medAlpaca
by
kbressem
0%
549
LLM finetuned for medical question answering
Starred by
Created 2 years ago
Updated 2 years ago
intel-extension-for-transformers
by
intel
0%
2k
Transformer toolkit for GenAI/LLM acceleration on Intel platforms
Starred by
Created 3 years ago
Updated 1 year ago
representation-engineering
by
andyzoujm
0.5%
942
AI transparency via representation engineering
Starred by
Created 2 years ago
Updated 1 year ago
multimodal
by
facebookresearch
0.2%
2k
PyTorch library for multimodal multi-task model training
Starred by
+1
Created 4 years ago
Updated 1 week ago
S-LoRA
by
S-LoRA
0.1%
2k
System for scalable LoRA adapter serving
Starred by
+1
Created 2 years ago
Updated 2 years ago
DeepSpeed
by
deepspeedai
0.2%
41k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 6 years ago
Updated 1 day ago
continue
by
continuedev
0.5%
31k
IDE extension for custom AI code assistants
Starred by
+16
Created 2 years ago
Updated 1 day ago
llama-cookbook
by
meta-llama
0.1%
18k
Guide for building with Llama models
Starred by
+15
Created 2 years ago
Updated 2 months ago
finetuner
by
jina-ai
0%
2k
Cloud tool for task-oriented embedding finetuning of models like BERT and CLIP
Starred by
+3
Created 4 years ago
Updated 1 year ago
ludwig
by
ludwig-ai
0.0%
12k
Low-code framework for custom AI models (LLMs, neural networks)
Starred by
+17
Created 7 years ago
Updated 1 week ago
img2dataset
by
rom1504
0.0%
4k
CLI tool for creating large image datasets from URLs
Starred by
+12
Created 4 years ago
Updated 3 months ago
distilling-step-by-step
by
google-research
0.3%
579
Code for research paper on knowledge distillation
Starred by
Created 2 years ago
Updated 2 years ago
Cherry_LLM
by
tianyi-lab
0.2%
414
Research paper for LLM instruction tuning via self-guided data selection
Created 2 years ago
Updated 7 months ago
Reflection_Tuning
by
tianyi-lab
0%
366
Research paper for LLM instruction tuning via data recycling
Starred by
Created 2 years ago
Updated 1 year ago
instructor
by
567-labs
0.6%
12k
SDK for structured LLM outputs using Pydantic models
Starred by
+27
Created 2 years ago
Updated 1 day ago
YiVal
by
YiVal
0%
2k
Prompt engineering assistant for GenAI apps
Starred by
Created 2 years ago
Updated 1 year ago
LLM-Shearing
by
princeton-nlp
0.3%
637
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 2 years ago
Updated 1 year ago
letta
by
letta-ai
0.5%
21k
Agent framework for stateful agents with memory, reasoning, and context management
Starred by
+19
Created 2 years ago
Updated 1 week ago
CogVLM
by
zai-org
0.0%
7k
VLM for image understanding and multi-turn dialogue
Starred by
+4
Created 2 years ago
Updated 1 year ago
ragas
by
vibrantlabsai
0.7%
12k
Toolkit for LLM application evaluation
Starred by
+12
Created 2 years ago
Updated 5 days ago
NEFTune
by
neelsjain
0%
410
Technique to improve instruction finetuning of LLMs
Starred by
Created 2 years ago
Updated 1 year ago
FireAct
by
anchen1011
0%
291
Language agent fine-tuning research paper
Starred by
Created 2 years ago
Updated 2 years ago
LLaVA
by
haotian-liu
0.2%
24k
Multimodal assistant with GPT-4 level capabilities
Starred by
+16
Created 2 years ago
Updated 1 year ago
alignment-handbook
by
huggingface
0.1%
5k
Handbook for aligning language models with human/AI preferences
Starred by
+11
Created 2 years ago
Updated 4 months ago
autolabel
by
refuel-ai
0%
2k
Python library to label text datasets using LLMs
Starred by
+4
Created 2 years ago
Updated 10 months ago
EmpatheticDialogues
by
facebookresearch
0.2%
536
PyTorch code for empathetic dialogue research
Starred by
Created 6 years ago
Updated 4 years ago
world-models
by
wesg52
0%
258
Research paper code for extracting spatial/temporal world models from LLMs
Starred by
Created 2 years ago
Updated 2 years ago
OpenGPT
by
CogStack
0%
361
Framework for grounded instruction datasets and domain-expert LLMs
Starred by
Created 2 years ago
Updated 2 years ago
Medusa
by
FasterDecoding
0.1%
3k
Framework for accelerating LLM generation using multiple decoding heads
Starred by
+6
Created 2 years ago
Updated 1 year ago
open_flamingo
by
mlfoundations
0.0%
4k
Open-source framework for training large multimodal models
Starred by
+7
Created 3 years ago
Updated 1 year ago
textbook_quality
by
VikParuchuri
0%
509
Synthetic data generator for LLM pretraining
Starred by
Created 2 years ago
Updated 2 years ago
tree-of-thought-llm
by
princeton-nlp
0.2%
6k
Research paper implementation for Tree of Thoughts (ToT) prompting
Starred by
+7
Created 2 years ago
Updated 1 year ago
LongLoRA
by
JIA-Lab-research
0.0%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
Starred by
+1
Created 2 years ago
Updated 1 year ago
kani
by
zhudotexe
0%
598
Microframework for chat-based language models with tool use/function calling
Starred by
Created 2 years ago
Updated 1 week ago
DoLa
by
voidism
0%
533
Decoding strategy research paper for improving factuality in LLMs
Starred by
Created 2 years ago
Updated 1 year ago
varuna
by
microsoft
0%
252
Tool for efficient large DNN model training on commodity hardware
Starred by
Created 4 years ago
Updated 1 year ago
BLoRA
by
sabetAI
0%
349
Inference optimization for batched LoRA adapters
Starred by
Created 2 years ago
Updated 2 years ago
TinyLlama
by
jzhang38
0.1%
9k
Tiny pretraining project for a 1.1B Llama model
Starred by
+18
Created 2 years ago
Updated 1 year ago
sparsegpt
by
IST-DASLab
0.1%
865
Code for massive language model one-shot pruning (ICML 2023 paper)
Starred by
Created 2 years ago
Updated 1 year ago
LLM-Pruner
by
horseee
0.2%
1k
LLM structural pruner for model compression
Created 2 years ago
Updated 1 year ago
graph-of-thoughts
by
spcl
0.3%
3k
Graph-of-Thoughts: LLM framework for complex problem-solving
Starred by
+1
Created 2 years ago
Updated 1 year ago
tensor_parallel
by
BlackSamorez
0.1%
656
PyTorch module for multi-GPU model parallelism
Starred by
Created 3 years ago
Updated 2 years ago
relora
by
Guitaricet
0%
474
PEFT pretraining code for ReLoRA research paper
Starred by
Created 2 years ago
Updated 1 year ago
wandbot
by
wandb
0%
309
Support bot for Weights & Biases' AI tools, running in Discord, Slack, ChatGPT, and Zendesk
Starred by
Created 2 years ago
Updated 3 months ago
LightLLM
by
ModelTC
0.2%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 1 day ago
lmdeploy
by
InternLM
0.3%
8k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 4 days ago
llama-chat
by
replicate
0%
835
Next.js app for Llama 3 chat UI development
Created 2 years ago
Updated 1 month ago
llama2-chatbot
by
a16z-infra
0%
1k
Streamlit chatbot app for interacting with LLMs
Starred by
Created 2 years ago
Updated 2 years ago
IncognitoPilot
by
silvanmelchior
0.2%
440
AI code interpreter for local data processing, like ChatGPT Code Interpreter
Created 2 years ago
Updated 2 years ago
ai-town
by
a16z-infra
0.3%
9k
AI town starter kit for building a virtual world
Starred by
+12
Created 2 years ago
Updated 2 weeks ago
octopack
by
bigcode-project
0%
479
Code LLM instruction tuning research paper
Starred by
+2
Created 2 years ago
Updated 11 months ago
outlines
by
dottxt-ai
0.3%
13k
SDK for structured LLM text generation
Starred by
+34
Created 2 years ago
Updated 4 days ago
bubogpt
by
magic-research
0%
511
Multi-modal LLM for joint text, vision, and audio understanding
Created 2 years ago
Updated 2 years ago
MetaGPT
by
FoundationAgents
0.4%
63k
Multi-agent framework for collaborative AI software development
Starred by
+10
Created 2 years ago
Updated 6 days ago
pykoi
by
CambioML
0%
412
Python library for reinforcement learning with human feedback (RLHF)
Starred by
Created 2 years ago
Updated 4 months ago
ChainFury
by
NimbleBoxAI
0%
450
Open-source chaining engine for production AI apps
Starred by
Created 2 years ago
Updated 1 year ago
candle
by
huggingface
0.4%
19k
Minimalist ML framework for Rust, emphasizing performance and ease of use
Starred by
+23
Created 2 years ago
Updated 3 days ago
Megatron-LLM
by
epfLLM
0%
588
Distributed trainer for LLMs
Starred by
Created 2 years ago
Updated 1 year ago
ToolBench
by
OpenBMB
0.4%
5k
Open platform for LLM tool learning (ICLR'24 spotlight)
Starred by
+6
Created 2 years ago
Updated 8 months ago
gpt-engineer
by
AntonOsika
0.1%
55k
CLI platform for code generation experimentation
Starred by
+17
Created 2 years ago
Updated 8 months ago
RRHF
by
GanjinZero
0.1%
809
RRHF for aligning LLMs to human preferences
Starred by
Created 2 years ago
Updated 2 years ago
LlamaFactory
by
hiyouga
0.6%
66k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+25
Created 2 years ago
Updated 2 days ago
exllama
by
turboderp
0.0%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 2 years ago
Updated 2 years ago
doremi
by
sangmichaelxie
0%
350
PyTorch for optimizing data mixtures in language model datasets
Starred by
Created 2 years ago
Updated 2 years ago
UltraChat
by
thunlp
0.2%
3k
Multi-round dialogue dataset and models for chat language model training
Starred by
Created 2 years ago
Updated 1 year ago
RealChar
by
Shaunwei
0.0%
6k
Real-time AI character/companion creation and interaction codebase
Starred by
+3
Created 2 years ago
Updated 1 week ago
serve
by
jina-ai
0.1%
22k
Framework for building cloud-native multimodal AI apps
Starred by
+17
Created 6 years ago
Updated 10 months ago
aider
by
Aider-AI
0.5%
40k
AI pair programming in your terminal
Starred by
+37
Created 2 years ago
Updated 1 week ago
LMFlow
by
OptimalScale
0.0%
9k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 2 years ago
Updated 2 weeks ago
baize-chatbot
by
project-baize
0%
3k
Chat model trained via LoRA, using ChatGPT-generated dialogs
Starred by
+3
Created 2 years ago
Updated 1 year ago
ToolQA
by
night-chen
0%
285
Dataset for evaluating LLMs using external tools
Created 2 years ago
Updated 2 years ago
SuperAGI
by
TransformerOptimus
0.2%
17k
Open-source framework for autonomous AI agent development
Starred by
+4
Created 2 years ago
Updated 1 year ago
audiocraft
by
facebookresearch
0.1%
23k
PyTorch library for audio processing and generation research
Starred by
+15
Created 2 years ago
Updated 10 months ago
guidance
by
guidance-ai
0.1%
21k
Guidance is a programming paradigm for steering LLMs
Starred by
+38
Created 3 years ago
Updated 5 days ago
open_llama
by
openlm-research
0.1%
8k
Open-source reproduction of LLaMA models
Starred by
+14
Created 2 years ago
Updated 2 years ago
RL4LMs
by
allenai
0.1%
2k
RL library to fine-tune language models to human preferences
Starred by
+3
Created 3 years ago
Updated 1 year ago
SwiftSage
by
SwiftSage
0.3%
324
Agent system for reasoning with LLMs via in-context reinforcement learning
Created 2 years ago
Updated 1 year ago
ctransformers
by
marella
0.2%
2k
Python bindings for fast Transformer model inference
Starred by
+8
Created 2 years ago
Updated 2 years ago
developer
by
smol-ai
0.0%
12k
Agent for embedding a developer in your app
Starred by
+27
Created 2 years ago
Updated 1 year ago
MeZO
by
princeton-nlp
0.2%
1k
Research paper implementation for memory-efficient LM fine-tuning
Starred by
Created 2 years ago
Updated 2 years ago
ImageBind
by
facebookresearch
0.1%
9k
PyTorch implementation for multimodal embeddings research paper
Starred by
+5
Created 2 years ago
Updated 2 months ago
xtreme1
by
xtreme1-io
0.3%
1k
Open-source platform for multimodal training data annotation
Starred by
Created 3 years ago
Updated 6 months ago
sudolang
by
paralleldrive
0.1%
1k
VS Code extension for LLM-based programming with SudoLang
Starred by
Created 2 years ago
Updated 1 week ago
poe-api
by
ading2210
0%
2k
Python API for Quora's Poe (unmaintained)
Created 2 years ago
Updated 2 years ago
Local-LLM-Comparison-Colab-UI
by
Troyanovsky
0.1%
1k
Local LLM comparison via Colab WebUI links
Starred by
Created 2 years ago
Updated 2 weeks ago
airoboros
by
jondurbin
0%
1k
Self-instruct tool for LLM finetuning
Starred by
+3
Created 2 years ago
Updated 1 year ago
PaLM
by
conceptofmind
0.1%
820
Open-source PaLM implementation for language model research
Starred by
Created 2 years ago
Updated 1 year ago
TruthfulQA
by
sylinrl
0.6%
875
Benchmark dataset for evaluating truthfulness of language models
Starred by
Created 4 years ago
Updated 1 year ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 2 years ago
Updated 1 year ago
PMC-LLaMA
by
chaoyi-wu
0%
675
Medical LLM for instruction-following in the medical domain
Created 2 years ago
Updated 1 year ago
openlm
by
r2d4
0%
372
OpenAI-compatible Python client for calling LLMs
Starred by
+1
Created 2 years ago
Updated 2 years ago
FasterTransformer
by
NVIDIA
0.1%
6k
Optimized transformer library for inference
Starred by
+12
Created 4 years ago
Updated 1 year ago
unlimiformer
by
abertsch72
0.1%
1k
Research paper for long-range transformers with unlimited input
Starred by
+1
Created 2 years ago
Updated 1 year ago
gpt-neox
by
EleutherAI
0.1%
7k
Framework for training large-scale autoregressive language models
Starred by
+22
Created 5 years ago
Updated 1 month ago
toolformer
by
conceptofmind
0%
380
Open-source implementation of Toolformer research paper
Starred by
Created 2 years ago
Updated 2 years ago
bark
by
suno-ai
0.1%
39k
Generative audio model for realistic speech and sound effects
Starred by
+19
Created 2 years ago
Updated 1 year ago
chat-langchain
by
langchain-ai
0.1%
6k
Chatbot for question answering over LangChain documentation
Starred by
+3
Created 3 years ago
Updated 4 weeks ago
LaMini-LM
by
mbzuai-nlp
0%
823
Small, efficient language models distilled from ChatGPT for research
Starred by
Created 2 years ago
Updated 2 years ago
ChatRWKV
by
BlinkDL
0.0%
10k
Open-source chatbot powered by the RWKV RNN language model
Starred by
+4
Created 3 years ago
Updated 3 days ago
RWKV-LM
by
BlinkDL
0.1%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+29
Created 4 years ago
Updated 3 days ago
LocalAI
by
mudler
0.5%
42k
Open-source OpenAI alternative for local AI inference
Starred by
+14
Created 2 years ago
Updated 1 day ago
WizardLM
by
nlpxucan
0.0%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 2 years ago
Updated 7 months ago
chameleon-llm
by
lupantech
0%
1k
Research paper code for plug-and-play compositional reasoning with LLMs
Starred by
Created 2 years ago
Updated 2 years ago
llama-lab
by
run-llama
0.1%
2k
LlamaIndex projects for LLM data augmentation
Starred by
Created 2 years ago
Updated 2 years ago
EdgeGPT
by
acheong08
0.0%
8k
Reverse-engineered API for Microsoft Bing Chat (archived)
Starred by
Created 3 years ago
Updated 2 years ago
gisting
by
jayelm
0%
303
Research paper implementation for prompt compression via learned "gist" tokens
Starred by
Created 2 years ago
Updated 11 months ago
gpt-llama.cpp
by
keldenl
0.2%
597
API wrapper for local LLM inference, emulating OpenAI's GPT endpoints
Starred by
Created 2 years ago
Updated 2 years ago
memit
by
kmeng01
0%
535
Transformer memory mass-editor (ICLR 2023 research paper)
Starred by
Created 3 years ago
Updated 2 years ago
dl4math
by
lupantech
0%
370
DL4MATH: Deep learning resources for mathematical reasoning
Created 3 years ago
Updated 2 years ago
MiniGPT-4
by
Vision-CAIR
0.0%
26k
Vision-language model for multi-task learning
Starred by
+15
Created 2 years ago
Updated 1 year ago
auto-cot
by
amazon-science
0.1%
2k
Research paper implementation for automatic chain-of-thought prompting
Starred by
Created 3 years ago
Updated 1 year ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 2 years ago
Updated 1 year ago
PythonProgrammingPuzzles
by
microsoft
0.2%
995
Python puzzle dataset for AI programming proficiency research
Created 4 years ago
Updated 1 year ago
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 2 years ago
Updated 1 year ago
unstructured
by
Unstructured-IO
0.4%
14k
ETL solution for structuring unstructured data for language models
Starred by
+12
Created 3 years ago
Updated 2 days ago
whisper
by
openai
0.3%
94k
Speech recognition model for multilingual transcription/translation
Starred by
+40
Created 3 years ago
Updated 1 month ago
LLaMA_MPS
by
jankais3r
0%
585
LLM inference on Apple Silicon GPUs
Starred by
Created 2 years ago
Updated 2 years ago
dolly
by
databrickslabs
0.0%
11k
Instruction-following LLM trained on the Databricks Machine Learning Platform
Starred by
+15
Created 2 years ago
Updated 2 years ago
minimal-llama
by
zphang
0%
457
Code for running and fine-tuning LLaMA models
Starred by
Created 2 years ago
Updated 2 years ago
zero_shot_cot
by
kojima-takeshi188
0.2%
437
Reasoning framework for LLMs, based on a NeurIPS 2022 paper
Starred by
Created 3 years ago
Updated 2 years ago
safari
by
HazyResearch
0%
909
Research paper implementations for sequence modeling with convolutions
Starred by
+2
Created 2 years ago
Updated 1 year ago
EasyLM
by
young-geng
0.1%
3k
LLM training/finetuning framework in JAX/Flax
Starred by
+9
Created 3 years ago
Updated 1 year ago
AlpacaDataCleaned
by
gururise
0%
2k
Cleaned dataset for Alpaca LLM training
Starred by
+4
Created 2 years ago
Updated 2 years ago
trl
by
huggingface
0.6%
17k
Library for transformer RL
Starred by
+28
Created 5 years ago
Updated 2 days ago
ThoughtSource
by
OpenBioLink
0%
1k
Framework for chain-of-thought reasoning data and tools
Starred by
Created 3 years ago
Updated 1 year ago
GPT-4-LLM
by
Instruction-Tuning-with-GPT-4
0%
4k
GPT-4 data for instruction-tuning LLMs via supervised/RL
Starred by
+5
Created 2 years ago
Updated 2 years ago
lit-llama
by
Lightning-AI
0%
6k
LLaMA implementation for pretraining, finetuning, and inference
Starred by
+5
Created 2 years ago
Updated 7 months ago
AutoGPT
by
Significant-Gravitas
0.1%
181k
AI agent platform for building, deploying, and running autonomous workflows
Starred by
+56
Created 2 years ago
Updated 1 day ago
LLaMA-Adapter
by
OpenGVLab
0%
6k
Efficient fine-tuning for instruction-following LLaMA models
Starred by
+3
Created 2 years ago
Updated 1 year ago
pygpt4all
by
nomic-ai
0%
1k
Python bindings for local LLM inference (deprecated)
Starred by
Created 2 years ago
Updated 2 years ago
chatllama
by
henrywoo
0%
1k
Open-source implementation for LLaMA-based ChatGPT, runnable on a single GPU
Created 2 years ago
Updated 1 year ago
optimate
by
nebuly-ai
0%
8k
Collection of libraries to optimize AI model performances
Starred by
+3
Created 4 years ago
Updated 1 year ago
GPTeacher
by
teknium1
0%
2k
GPT-4 generated datasets for instruction tuning
Starred by
+1
Created 2 years ago
Updated 2 years ago
chatgpt-universe
by
cedrickchee
0.3%
380
Collection of ChatGPT, GPT, and LLM resources
Created 3 years ago
Updated 1 year ago
langchain
by
langchain-ai
0.5%
125k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 1 day ago
xTuring
by
stochasticai
0%
3k
SDK for fine-tuning and customizing open-source LLMs
Starred by
+3
Created 2 years ago
Updated 5 days ago
ai-pdf-chatbot-langchain
by
mayooear
0.1%
16k
AI chatbot agent for PDF document Q&A using LangChain & LangGraph
Starred by
+3
Created 2 years ago
Updated 11 months ago
natbot
by
nat
0%
2k
Browser automation via GPT-3
Starred by
+7
Created 3 years ago
Updated 1 year ago
ReAct
by
ysymyth
0.9%
3k
GPT-3 prompting code for ReAct research paper
Starred by
+2
Created 3 years ago
Updated 2 years ago
ChatGLM-finetune-LoRA
by
lich99
0%
719
LoRA finetuning code for ChatGLM-6b
Starred by
Created 2 years ago
Updated 2 years ago
Llama-X
by
AetherCortex
0%
2k
Open academic research project improving LLaMA to SOTA LLM
Starred by
Created 2 years ago
Updated 2 years ago
flash-attention
by
Dao-AILab
0.7%
22k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 2 days ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 7 months ago
text-generation-inference
by
huggingface
0.1%
11k
Rust/Python/gRPC server for fast LLM text generation
Starred by
+35
Created 3 years ago
Updated 2 weeks ago
ChatDoctor
by
Kent0n-Li
0%
4k
Medical chat model fine-tuned on LLaMA for medical domain Q&A
Starred by
Created 2 years ago
Updated 1 year ago
gpt4all
by
nomic-ai
0.0%
77k
Desktop app for local LLM inference, no GPU/API needed
Starred by
+29
Created 2 years ago
Updated 8 months ago
toolformer-pytorch
by
lucidrains
0%
2k
Pytorch implementation of Toolformer for language models using external tools
Starred by
+2
Created 3 years ago
Updated 1 year ago
text-generation-webui
by
oobabooga
0.1%
46k
Web UI for LLM text generation
Starred by
+24
Created 3 years ago
Updated 1 week ago
gptq
by
IST-DASLab
0.0%
2k
Code for GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers
Starred by
+3
Created 3 years ago
Updated 1 year ago
PaLM-rlhf-pytorch
by
lucidrains
0.0%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 3 months ago
trlx
by
CarperAI
0.1%
5k
Distributed RLHF for LLMs
Starred by
+16
Created 3 years ago
Updated 2 years ago
alpaca_lora_4bit
by
johnsmith0031
0%
535
Fine-tuning and inference tool for quantized LLaMA models
Starred by
Created 2 years ago
Updated 2 years ago
chatgpt-retrieval-plugin
by
openai
0.0%
21k
Retrieval plugin for custom GPTs, function calling, or assistants APIs
Starred by
+23
Created 2 years ago
Updated 1 year ago
GPTQ-for-LLaMa
by
qwopqwop200
0%
3k
4-bit quantization for LLaMA models using GPTQ
Starred by
+2
Created 2 years ago
Updated 1 year ago
dalai
by
cocktailpeanut
0%
13k
Local LLM inference via CLI tool and Node.js API
Starred by
+4
Created 2 years ago
Updated 1 year ago
alpaca-lora
by
tloen
0.0%
19k
LoRA fine-tuning for LLaMA
Starred by
+22
Created 2 years ago
Updated 1 year ago
stanford_alpaca
by
tatsu-lab
0.0%
30k
Instruction-following LLaMA model training and data generation
Starred by
+25
Created 2 years ago
Updated 1 year ago
ColossalAI
by
hpcaitech
0.1%
41k
AI system for large-scale parallel training
Starred by
+25
Created 4 years ago
Updated 1 week ago
agentic
by
transitive-bullshit
0.1%
18k
AI agent stdlib for LLM-based TypeScript tooling
Starred by
+7
Created 3 years ago
Updated 3 months ago
dagger
by
dagger
0.2%
15k
Open-source runtime for composable workflows, ideal for AI agents
Starred by
+8
Created 6 years ago
Updated 1 day ago
sdk-python
by
temporalio
0.6%
938
Python SDK for Temporal, a distributed orchestration engine
Starred by
Created 4 years ago
Updated 3 days ago
docker-lambda
by
lambci
0.0%
6k
Deprecated: Docker images for replicating the AWS Lambda environment locally
Starred by
+5
Created 9 years ago
Updated 3 years ago
kong
by
Kong
0.1%
43k
Cloud-native API and AI gateway for microservice orchestration
Starred by
+18
Created 11 years ago
Updated 1 week ago
awesome-machine-learning
by
josephmisiti
0.1%
71k
Curated list of ML frameworks, libraries, and software
Starred by
+24
Created 11 years ago
Updated 3 weeks ago
hackathon-starter
by
sahat
0.0%
35k
Node.js boilerplate for web applications
Starred by
+11
Created 12 years ago
Updated 2 days ago
Feedback? Help us improve.