Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Pawel Garbacki
Pawel Garbacki
Cofounder of Fireworks AI
GitHub
X
Starred Projects (322)
DeepSeek-OCR
by
deepseek-ai
1.1%
21k
Context-aware OCR model for visual-text compression
Starred by
+3
Created 1 month ago
Updated 1 month ago
NeuralFlow
by
valine
0%
381
Python script for visualizing Mistral 7B intermediate layer outputs
Created 1 year ago
Updated 10 months ago
MagiAttention
by
SandAI-org
1.6%
570
Distributed attention mechanism research paper for ultra-long context, heterogeneous data training
Starred by
Created 7 months ago
Updated 2 days ago
tinker-cookbook
by
thinking-machines-lab
3.8%
2k
Advanced LLM fine-tuning SDK and example cookbook
Starred by
+2
Created 4 months ago
Updated 5 days ago
torchtitan
by
pytorch
0.5%
5k
PyTorch platform for generative AI model training research
Starred by
+11
Created 1 year ago
Updated 1 day ago
checkpoint-engine
by
MoonshotAI
0.8%
849
Middleware for efficient LLM weight updates during inference
Starred by
+3
Created 2 months ago
Updated 6 days ago
SWE-bench
by
SWE-bench
0.8%
4k
Benchmark for evaluating LLMs on real-world GitHub issues
Starred by
+11
Created 2 years ago
Updated 2 weeks ago
slime
by
THUDM
2.3%
3k
LLM post-training framework for RL scaling
Starred by
+4
Created 5 months ago
Updated 1 day ago
Step-Audio2
by
stepfun-ai
1.0%
1k
End-to-end audio understanding and speech conversation model
Created 4 months ago
Updated 2 months ago
inspect_ai
by
UKGovernmentBEIS
1.6%
2k
Framework for large language model evaluations
Starred by
+5
Created 2 years ago
Updated 1 day ago
openbench
by
groq
0.9%
667
Provider-agnostic LLM evaluation infrastructure
Starred by
+3
Created 4 months ago
Updated 2 days ago
LaCT
by
a1600012888
1.3%
324
Test-Time Training framework for adaptable models
Starred by
Created 6 months ago
Updated 1 week ago
SkyRL
by
NovaSky-AI
4.0%
1k
RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks
Starred by
+12
Created 7 months ago
Updated 4 days ago
ART
by
OpenPipe
0.7%
8k
RL library for training LLM agents via GRPO
Starred by
+8
Created 8 months ago
Updated 3 days ago
torch-profiling-tutorial
by
Quentin-Anthony
0.4%
532
PyTorch model profiling tutorial
Created 4 months ago
Updated 3 months ago
OpenHands
by
OpenHands
0.2%
65k
AI platform for software development agents
Starred by
+36
Created 1 year ago
Updated 23 hours ago
ERNIE
by
PaddlePaddle
0.1%
8k
PaddlePaddle implementations for ERNIE family pre-training models
Starred by
Created 6 years ago
Updated 2 days ago
codex
by
openai
0.7%
51k
Coding agent CLI tool for terminal-based chat-driven development
Starred by
+33
Created 7 months ago
Updated 1 day ago
claude-code
by
anthropics
1.8%
44k
Agentic coding assistant for your terminal
Starred by
+15
Created 9 months ago
Updated 3 days ago
opencode
by
opencode-ai
0.2%
10k
CLI tool for terminal-based AI coding assistance
Starred by
+11
Created 8 months ago
Updated 2 months ago
gemini-cli
by
google-gemini
1.3%
85k
AI agent for terminal workflows
Starred by
+27
Created 7 months ago
Updated 1 day ago
MiniMax-M1
by
MiniMax-AI
0.2%
3k
Open-weight reasoning model with hybrid attention
Starred by
Created 5 months ago
Updated 4 months ago
RAGEN
by
mll-lab-nu
0.7%
2k
Train LLM agents with reinforcement learning in interactive environments
Starred by
Created 10 months ago
Updated 2 days ago
VoRA
by
Hon-Wong
1.4%
353
MLLM with visual capabilities
Created 8 months ago
Updated 5 months ago
awesome-instruction-datasets
by
jianzhnie
0%
710
Curated list of instruction datasets for training ChatLLMs
Created 2 years ago
Updated 1 year ago
xLAM
by
SalesforceAIResearch
0.2%
584
xLAM is a family of large action models for AI agent systems
Starred by
Created 1 year ago
Updated 3 months ago
deer-flow
by
bytedance
1.0%
18k
Deep research framework combining language models with specialized tools
Starred by
+3
Created 6 months ago
Updated 1 day ago
arena-hard-auto
by
lmarena
0.3%
963
Automatic LLM benchmark for instruction-tuned models, correlating with human preference
Starred by
+6
Created 2 years ago
Updated 5 months ago
atropos
by
NousResearch
0.3%
756
RL environment framework for LLM trajectory collection/evaluation
Starred by
+1
Created 7 months ago
Updated 4 days ago
12-factor-agents
by
humanlayer
0.7%
16k
Principles for reliable LLM application development
Starred by
+4
Created 8 months ago
Updated 2 months ago
DAPO
by
BytedTsinghua-SIA
0.8%
2k
Open-source RL system for large-scale LLM training
Starred by
Created 8 months ago
Updated 6 months ago
system-prompts-and-models-of-ai-tools
by
x1xhlol
1.5%
98k
AI tool system prompts and models
Starred by
+9
Created 9 months ago
Updated 1 day ago
rllm
by
rllm-org
0.6%
5k
Framework for post-training language agents via reinforcement learning
Starred by
+2
Created 10 months ago
Updated 4 days ago
ring-flash-attention
by
zhuzilin
0.2%
923
FlashAttention extension for ring attention
Starred by
+1
Created 1 year ago
Updated 2 months ago
understand-r1-zero
by
sail-sg
0.4%
1k
Research paper analyzing R1-Zero-like training for LLMs
Starred by
Created 8 months ago
Updated 3 months ago
dynamo
by
ai-dynamo
0.8%
6k
Inference framework for distributed generative AI model serving
Starred by
+7
Created 9 months ago
Updated 21 hours ago
openai-agents-python
by
openai
0.7%
18k
Python SDK for multi-agent workflows
Starred by
+10
Created 8 months ago
Updated 3 days ago
OpenManus
by
FoundationAgents
0.2%
51k
Open-source framework for building general AI agents
Starred by
+2
Created 8 months ago
Updated 1 week ago
verl
by
volcengine
3.1%
17k
RL training library for LLMs
Starred by
+14
Created 1 year ago
Updated 1 day ago
Search-R1
by
PeterGriffinJin
0.8%
4k
RL framework for training LLMs to use search engines
Starred by
+2
Created 9 months ago
Updated 2 weeks ago
s1
by
simplescaling
0.1%
7k
Test-time scaling recipe for strong reasoning performance
Starred by
+8
Created 10 months ago
Updated 5 months ago
open-r1
by
huggingface
0.1%
26k
SDK for reproducing DeepSeek-R1
Starred by
+17
Created 10 months ago
Updated 6 days ago
openllmetry
by
traceloop
0.3%
7k
Open-source observability SDK for LLM applications
Starred by
+11
Created 2 years ago
Updated 3 days ago
Kimi-k1.5
by
MoonshotAI
0.0%
3k
Research paper on scaling reinforcement learning with LLMs
Starred by
+3
Created 10 months ago
Updated 8 months ago
UI-TARS
by
bytedance
0.5%
8k
Multimodal agent for GUI interaction in virtual worlds (research paper)
Starred by
+3
Created 10 months ago
Updated 2 weeks ago
DeepSeek-R1
by
deepseek-ai
0.1%
92k
Reasoning models research paper
Starred by
+16
Created 10 months ago
Updated 5 months ago
unsloth
by
unslothai
0.5%
49k
Finetuning tool for LLMs, targeting speed and memory efficiency
Starred by
+38
Created 2 years ago
Updated 23 hours ago
ml-cross-entropy
by
apple
0.7%
555
PyTorch module for memory-efficient cross-entropy in LLMs
Starred by
+1
Created 1 year ago
Updated 2 months ago
Liger-Kernel
by
linkedin
0.4%
6k
Triton kernels for efficient LLM training
Starred by
+8
Created 1 year ago
Updated 2 days ago
MiniMax-01
by
MiniMax-AI
0.1%
3k
Large language & vision-language models based on linear attention
Starred by
+2
Created 10 months ago
Updated 4 months ago
tabby
by
TabbyML
0.1%
33k
Self-hosted AI coding assistant for on-prem code completion
Starred by
+17
Created 2 years ago
Updated 4 days ago
continue
by
continuedev
0.4%
30k
IDE extension for custom AI code assistants
Starred by
+16
Created 2 years ago
Updated 1 day ago
SkyThought
by
NovaSky-AI
0.0%
3k
Training recipes for Sky-T1 family of models
Starred by
+4
Created 10 months ago
Updated 4 months ago
dspy
by
stanfordnlp
0.5%
30k
Framework for programming language models, not prompting
Starred by
+49
Created 2 years ago
Updated 4 days ago
storm
by
stanford-oval
0.1%
28k
LLM system for automated knowledge curation and article generation
Starred by
+5
Created 1 year ago
Updated 2 months ago
open-computer-use
by
e2b-dev
0.6%
2k
AI agent for computer control via LLMs
Starred by
+1
Created 1 year ago
Updated 5 months ago
PaLM-rlhf-pytorch
by
lucidrains
0.1%
8k
RLHF implementation on PaLM
Starred by
+5
Created 3 years ago
Updated 1 month ago
picotron
by
huggingface
0.6%
2k
Minimalist distributed training framework for educational use
Starred by
+3
Created 1 year ago
Updated 3 months ago
PRIME
by
PRIME-RL
0.4%
2k
Scalable RL solution for advanced reasoning of language models
Starred by
+3
Created 11 months ago
Updated 8 months ago
sglang
by
sgl-project
0.9%
20k
Fast serving framework for LLMs and vision language models
Starred by
+34
Created 1 year ago
Updated 21 hours ago
DeepSeek-V3
by
deepseek-ai
0.1%
100k
MoE language model research paper with 671B total parameters
Starred by
+13
Created 11 months ago
Updated 3 months ago
gitingest
by
coderamp-labs
0.4%
13k
CLI tool for LLM-friendly code ingestion from Git repos
Starred by
Created 1 year ago
Updated 6 days ago
tau-bench
by
sierra-research
2.4%
971
Benchmark for tool-agent-user interaction research
Starred by
Created 1 year ago
Updated 3 months ago
Genesis
by
Genesis-Embodied-AI
0.2%
28k
Physics platform for robotics & embodied AI learning
Starred by
+12
Created 2 years ago
Updated 1 day ago
search-and-learn
by
huggingface
0%
1k
Recipes to scale inference-time compute of open models
Starred by
+1
Created 11 months ago
Updated 6 months ago
open-instruct
by
allenai
1.3%
3k
Training codebase for instruction-following language models
Starred by
+9
Created 2 years ago
Updated 1 day ago
desktop
by
e2b-dev
0.8%
1k
SDK for virtual desktop sandboxes for LLM-powered computer use
Starred by
Created 1 year ago
Updated 3 weeks ago
VLMEvalKit
by
open-compass
1.1%
3k
Evaluation toolkit for large multi-modality models (LMMs)
Created 2 years ago
Updated 3 days ago
Qwen3-VL
by
QwenLM
1.3%
17k
Multimodal LLM for vision-language tasks, document parsing, and agent functionality
Starred by
+7
Created 1 year ago
Updated 2 days ago
DocBank
by
doc-analysis
0.2%
629
Layout analysis dataset for document understanding tasks
Created 5 years ago
Updated 1 year ago
Qwen-VL
by
QwenLM
0.1%
6k
Vision-language model for multimodal understanding, localization, and text reading
Starred by
+1
Created 2 years ago
Updated 1 year ago
OSWorld
by
xlang-ai
1.0%
2k
Multimodal agent benchmark for open-ended tasks in realistic computer environments
Starred by
Created 2 years ago
Updated 1 week ago
SoM
by
microsoft
0.5%
1k
Visual prompting method for GPT-4V and LMMs
Starred by
Created 2 years ago
Updated 1 year ago
dynasaur
by
adobe-research
0%
349
LLM agent framework using dynamic action creation via Python code generation
Starred by
Created 1 year ago
Updated 11 months ago
browser-use
by
browser-use
0.4%
73k
SDK for AI agent browser control
Starred by
+28
Created 1 year ago
Updated 1 day ago
ShowUI
by
showlab
0.4%
2k
Vision-language-action model for GUI agent & computer use (CVPR 2025 paper)
Starred by
Created 1 year ago
Updated 6 months ago
WilmerAI
by
SomeOddCodeGuy
0.1%
789
AI inference router for specialized workflows
Starred by
Created 1 year ago
Updated 1 month ago
aisuite
by
andrewyng
0.2%
13k
Unified interface for multiple generative AI providers
Starred by
+5
Created 1 year ago
Updated 2 weeks ago
servers
by
modelcontextprotocol
0.6%
74k
Reference implementations for the Model Context Protocol (MCP) servers
Starred by
+21
Created 1 year ago
Updated 3 days ago
python-sdk
by
modelcontextprotocol
0.6%
20k
Python SDK for Model Context Protocol (MCP) servers/clients
Starred by
+4
Created 1 year ago
Updated 2 days ago
every-chatgpt-gui
by
billmei
0.3%
4k
Curated list of ChatGPT, Claude, and other LLM front-end GUI clients
Created 2 years ago
Updated 3 weeks ago
MinerU
by
opendatalab
0.7%
50k
PDF extraction tool for converting PDFs to Markdown and JSON
Starred by
Created 1 year ago
Updated 3 days ago
Qwen3-Coder
by
QwenLM
0.5%
14k
Code LLM for code completion, generation, and assistant use cases
Starred by
+8
Created 1 year ago
Updated 4 months ago
LLMxMapReduce
by
thunlp
0.2%
839
Framework for LLM long-sequence processing via MapReduce-inspired divide-and-conquer
Created 1 year ago
Updated 3 weeks ago
llm-app
by
pathwaycom
0.4%
48k
LLM app templates for RAG, AI pipelines, and enterprise search
Starred by
Created 2 years ago
Updated 1 month ago
docling
by
docling-project
1.4%
45k
Prepare documents for generative AI
Starred by
+12
Created 1 year ago
Updated 3 days ago
Qwen3
by
QwenLM
0.3%
26k
Large language model series by Qwen team, Alibaba Cloud
Starred by
+11
Created 1 year ago
Updated 1 month ago
meditron
by
epfLLM
0.1%
2k
Open-source medical LLMs adapted from Llama-2
Starred by
Created 2 years ago
Updated 1 year ago
OmniParser
by
microsoft
0.1%
24k
Screen parsing tool for vision-based GUI agents
Starred by
+6
Created 1 year ago
Updated 2 months ago
fast-apply
by
kortix-ai
0%
372
Pipeline for data generation and fine-tuning Qwen2.5 Coder models
Starred by
Created 1 year ago
Updated 2 months ago
Emu3
by
baaivision
0.2%
2k
Multimodal model for vision-language understanding and generation
Starred by
Created 1 year ago
Updated 1 week ago
together-cookbook
by
togethercomputer
0.1%
1k
Cookbook for open-source models via Together AI
Created 1 year ago
Updated 3 days ago
zerox
by
getomni-ai
0.1%
12k
OCR SDK for AI ingestion of documents with complex layouts
Starred by
Created 1 year ago
Updated 6 months ago
Janus
by
deepseek-ai
0.1%
18k
Unified multimodal model research paper for understanding and generation
Starred by
+4
Created 1 year ago
Updated 10 months ago
TransformerEngine
by
NVIDIA
0.6%
3k
Library for Transformer model acceleration on NVIDIA GPUs
Starred by
+4
Created 3 years ago
Updated 4 days ago
O1-Journey
by
GAIR-NLP
0.1%
2k
Research paper on replicating O1 via "journey learning"
Starred by
+1
Created 1 year ago
Updated 10 months ago
chunkr
by
lumina-ai-inc
0.1%
3k
Document intelligence API for RAG/LLM workflows
Starred by
Created 1 year ago
Updated 2 months ago
zep
by
getzep
0.6%
4k
Memory foundation for AI stacks, enabling continuous learning
Starred by
Created 2 years ago
Updated 1 week ago
ColBERT
by
stanford-futuredata
0.1%
4k
Neural search for fast, accurate retrieval over large text collections
Starred by
+8
Created 5 years ago
Updated 1 month ago
optillm
by
algorithmicsuperintelligence
0.9%
3k
Optimizing inference proxy for LLMs
Starred by
+7
Created 1 year ago
Updated 23 hours ago
LiveCodeBench
by
LiveCodeBench
0.7%
723
Benchmark for holistic LLM code evaluation
Starred by
Created 1 year ago
Updated 4 months ago
zml
by
zml
0.6%
3k
AI inference stack for production
Starred by
Created 1 year ago
Updated 2 days ago
Awesome-LLM-Strawberry
by
hijkzzz
0.1%
7k
Collection of LLM papers, blogs, and projects focused on reasoning techniques
Starred by
Created 1 year ago
Updated 1 month ago
GOT-OCR2.0
by
Ucas-HaoranWei
0.2%
8k
OCR research paper for unified end-to-end model
Created 1 year ago
Updated 9 months ago
colpali
by
illuin-tech
0.8%
2k
Vision-language model code for document retrieval research
Starred by
Created 1 year ago
Updated 2 days ago
SuperPrompt
by
NeoVertex1
0.1%
6k
Prompt engineering research for AI agent understanding
Starred by
Created 1 year ago
Updated 2 months ago
rStar
by
zhentingqi
0.2%
966
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 10 months ago
llama-stack-apps
by
llamastack
0.0%
4k
Agentic app examples built on Llama Stack
Starred by
+1
Created 1 year ago
Updated 3 months ago
ms-swift
by
modelscope
1.3%
11k
SDK for fine-tuning and deploying LLMs/MLLMs
Starred by
Created 2 years ago
Updated 23 hours ago
InternLM-XComposer
by
InternLM
0.1%
3k
Multimodal model for long-context video/audio interactions, image understanding, and composition
Starred by
Created 2 years ago
Updated 6 months ago
DisTrO
by
NousResearch
0%
966
Distributed optimizers research paper
Starred by
+1
Created 1 year ago
Updated 1 month ago
llama_cloud_services
by
run-llama
0.1%
4k
SDK for LlamaCloud GenAI services
Starred by
Created 1 year ago
Updated 3 days ago
DistillKit
by
arcee-ai
0.5%
785
Open-source toolkit for LLM distillation research
Starred by
Created 1 year ago
Updated 4 months ago
flux
by
black-forest-labs
0.3%
25k
Inference code for FLUX image generation & editing models
Starred by
+10
Created 1 year ago
Updated 4 months ago
llama-stack
by
llamastack
0.2%
8k
Composable building blocks for Llama apps
Starred by
+6
Created 1 year ago
Updated 2 days ago
MindSearch
by
InternLM
0.1%
7k
LLM multi-agent framework for web search (Perplexity AI, SearchGPT)
Starred by
Created 1 year ago
Updated 4 months ago
unstructured
by
Unstructured-IO
0.4%
13k
ETL solution for structuring unstructured data for language models
Starred by
+12
Created 3 years ago
Updated 6 days ago
VIINA
by
zhukovyuri
0.3%
325
Event data system for the 2022 Russian Invasion of Ukraine
Created 3 years ago
Updated 1 day ago
MInference
by
microsoft
0.4%
1k
Framework for long-context LLM inference speedup via sparse attention
Starred by
Created 1 year ago
Updated 2 months ago
ultravox
by
fixie-ai
0.1%
4k
Multimodal LLM for real-time voice interactions
Starred by
+2
Created 1 year ago
Updated 2 months ago
octo
by
octo-models
0.1%
1k
Robot policy for generalist manipulation, trained on 800k trajectories
Starred by
Created 1 year ago
Updated 1 year ago
DeepSeek-Coder-V2
by
deepseek-ai
0.3%
6k
Open-source code language model comparable to GPT4-Turbo
Starred by
Created 1 year ago
Updated 2 weeks ago
MathBlackBox
by
trotsky1997
0.1%
1k
Research paper for mathematical reasoning via LLMs
Starred by
+1
Created 1 year ago
Updated 11 months ago
EAGLE
by
SafeAILab
0.9%
2k
Speculative decoding research paper for faster LLM inference
Starred by
+5
Created 2 years ago
Updated 1 week ago
tianshou
by
thu-ml
0.2%
9k
PyTorch RL library for algorithm development and application
Starred by
+3
Created 7 years ago
Updated 1 week ago
gemma-2B-10M
by
mustafaaljadery
0.1%
941
Gemma 2B with 10M context length using Infini-attention
Starred by
Created 1 year ago
Updated 1 year ago
RULER
by
NVIDIA
0.5%
1k
Evaluation suite for long-context language models research paper
Starred by
Created 1 year ago
Updated 2 weeks ago
VILA
by
NVlabs
0.3%
4k
Open-source VLMs for efficient video/multi-image understanding
Starred by
+1
Created 1 year ago
Updated 3 days ago
ThunderKittens
by
HazyResearch
0.5%
3k
CUDA kernel framework for fast deep learning primitives
Starred by
+14
Created 1 year ago
Updated 2 days ago
selfcodealign
by
bigcode-project
0%
321
Research paper for self-alignment in code generation
Starred by
Created 1 year ago
Updated 9 months ago
VAR
by
FoundationVision
0.3%
9k
Image generation research paper using visual autoregressive modeling
Starred by
Created 1 year ago
Updated 2 weeks ago
FlagEmbedding
by
FlagOpen
0.4%
11k
Toolkit for retrieval and RAG applications
Starred by
+8
Created 2 years ago
Updated 1 month ago
FILM
by
microsoft
0.4%
261
LLM for enhanced context utilization
Created 1 year ago
Updated 1 year ago
PLLaVA
by
magic-research
0.1%
673
Research paper for parameter-free LLaVA extension to videos
Created 1 year ago
Updated 1 year ago
cohere-toolkit
by
cohere-ai
0.2%
3k
RAG toolkit for LLM application development and deployment
Starred by
+4
Created 1 year ago
Updated 1 week ago
EasyContext
by
jzhang38
0%
750
Recipes for language model context length extrapolation to 1M tokens
Starred by
+2
Created 1 year ago
Updated 1 year ago
Spec-Bench
by
hemingkx
0.3%
338
Benchmark for speculative decoding methods (ACL 2024 paper)
Created 1 year ago
Updated 7 months ago
openai-node
by
openai
0.2%
10k
TypeScript/JavaScript SDK for the OpenAI API
Starred by
+4
Created 4 years ago
Updated 1 week ago
llama3
by
meta-llama
0.1%
29k
*Deprecated* minimal example for loading and running Llama 3 models
Starred by
+13
Created 1 year ago
Updated 10 months ago
distilabel
by
argilla-io
0.6%
3k
Framework for synthetic data and AI feedback pipelines
Starred by
+12
Created 2 years ago
Updated 6 days ago
Open-Sora-Plan
by
PKU-YuanGroup
0.1%
12k
Open-source project aiming to reproduce Sora-like T2V model
Starred by
+2
Created 1 year ago
Updated 1 month ago
LLaVA-UHD
by
thunlp
1.0%
397
Efficient native-resolution encoding for multimodal LLMs
Created 1 year ago
Updated 3 days ago
higgsfield
by
higgsfield-ai
0.0%
3k
ML framework for large model training and GPU orchestration
Starred by
+10
Created 7 years ago
Updated 1 year ago
SWE-agent
by
SWE-agent
0.3%
18k
Agent for automated software engineering (NeurIPS 2024)
Starred by
+23
Created 1 year ago
Updated 6 days ago
VoiceCraft
by
jasonppy
0.0%
8k
Zero-shot speech editing and TTS research paper
Starred by
Created 1 year ago
Updated 8 months ago
Open-Sora
by
hpcaitech
0.2%
28k
Video generation initiative for efficient, high-quality video production
Starred by
+4
Created 1 year ago
Updated 7 months ago
bark
by
suno-ai
0.1%
39k
Generative audio model for realistic speech and sound effects
Starred by
+19
Created 2 years ago
Updated 1 year ago
pal
by
reasoning-machines
0.2%
517
Program-aided language model for reasoning tasks
Starred by
Created 3 years ago
Updated 2 years ago
torchtune
by
meta-pytorch
0.1%
6k
PyTorch library for LLM post-training and experimentation
Starred by
+12
Created 2 years ago
Updated 6 days ago
self-rag
by
AkariAsai
0.1%
2k
Self-RAG implementation for learning retrieval, generation, and critique via self-reflection
Starred by
+1
Created 2 years ago
Updated 1 year ago
OpenPipe
by
OpenPipe
0.1%
3k
Fine-tuning platform for cheaper models
Starred by
+5
Created 2 years ago
Updated 1 year ago
LLM-Blender
by
yuchenlin
0.1%
971
LLM ensembling framework using pairwise ranking and generative fusion
Starred by
+3
Created 2 years ago
Updated 1 year ago
grok-1
by
xai-org
0.1%
51k
JAX example code for loading and running Grok-1 open-weights model
Starred by
+22
Created 1 year ago
Updated 1 year ago
aici
by
microsoft
0.1%
2k
AICI constrains LLM output using (Wasm) programs
Starred by
+7
Created 2 years ago
Updated 10 months ago
Yi
by
01-ai
0.0%
8k
Open-source bilingual LLMs trained from scratch
Starred by
+7
Created 2 years ago
Updated 1 year ago
anthropic-tools
by
anthropics
0.3%
329
SDK for tool/function calling with Anthropic models (research preview)
Starred by
Created 2 years ago
Updated 1 year ago
self-rewarding-lm-pytorch
by
lucidrains
0.1%
1k
Training framework for self-rewarding language models
Starred by
+4
Created 1 year ago
Updated 1 year ago
OpenCodeInterpreter
by
OpenCodeInterpreter
0%
2k
Open-source code generation system for bridging LLMs and code interpreters
Starred by
Created 1 year ago
Updated 1 year ago
LWM
by
LargeWorldModel
0.1%
7k
Multimodal autoregressive model for long-context video/text
Starred by
+6
Created 1 year ago
Updated 1 year ago
ai
by
vercel
1.0%
20k
AI SDK for building AI-powered applications and agents
Starred by
+15
Created 2 years ago
Updated 23 hours ago
SPIN
by
uclaml
0.2%
1k
Self-Play Fine-Tuning (SPIN) research paper implementation
Starred by
Created 1 year ago
Updated 1 year ago
trigger.dev
by
triggerdotdev
0.3%
13k
Open-source platform for background jobs and AI workflows
Starred by
+7
Created 3 years ago
Updated 2 days ago
AgentBoard
by
hkust-nlp
0.3%
366
Analytical evaluation board for multi-turn LLM agents
Starred by
Created 1 year ago
Updated 1 year ago
sparrow
by
katanaml
0.2%
5k
Data processing & instruction calling tool using ML, LLM, and Vision LLM
Starred by
Created 3 years ago
Updated 6 days ago
search_with_lepton
by
leptonai
0.0%
8k
Conversational search engine demo
Starred by
+9
Created 1 year ago
Updated 2 weeks ago
autogen
by
microsoft
0.3%
52k
Agentic framework for multi-agent AI applications
Starred by
+19
Created 2 years ago
Updated 1 month ago
m2
by
HazyResearch
0%
561
Sub-quadratic architecture research paper
Starred by
+1
Created 2 years ago
Updated 11 months ago
mergekit
by
arcee-ai
0.3%
6k
CLI tool for merging pretrained language models, combining strengths without retraining
Starred by
+15
Created 2 years ago
Updated 3 days ago
ToolAlpaca
by
tangqiaoyu
0.2%
886
Tool-learning framework for language models, research paper
Starred by
Created 2 years ago
Updated 1 year ago
ToolBench
by
OpenBMB
0.2%
5k
Open platform for LLM tool learning (ICLR'24 spotlight)
Starred by
+6
Created 2 years ago
Updated 6 months ago
NexusRaven
by
nexusflowai
0%
318
Evaluation framework for function-calling LLM, NexusRaven-13B
Starred by
Created 2 years ago
Updated 2 years ago
gpt4free
by
xtekky
0.1%
66k
API package for multi-provider LLM requests (GPT-4.1, Gemini 2.5, Deepseek R1)
Starred by
Created 2 years ago
Updated 1 day ago
promptbench
by
microsoft
0.1%
3k
LLM evaluation framework
Starred by
Created 2 years ago
Updated 1 month ago
spinningup
by
openai
0.1%
11k
Educational resource for learning deep reinforcement learning
Starred by
+10
Created 7 years ago
Updated 1 year ago
MiniGPT-4
by
Vision-CAIR
0.0%
26k
Vision-language model for multi-task learning
Starred by
+15
Created 2 years ago
Updated 1 year ago
LLMCompiler
by
SqueezeAILab
0.4%
2k
LLM compiler for parallel function calling
Starred by
+2
Created 2 years ago
Updated 1 year ago
NexusRaven-V2
by
nexusflowai
0%
415
Open-source LLM for function calling, outperforming GPT-4 in some cases
Starred by
Created 2 years ago
Updated 1 year ago
mamba
by
state-spaces
0.4%
17k
Mamba SSM architecture for sequence modeling
Starred by
+22
Created 2 years ago
Updated 2 weeks ago
gpt-fast
by
meta-pytorch
0.1%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 2 years ago
Updated 3 months ago
generative-models
by
Stability-AI
0.1%
27k
Generative models SDK for video, image, and 3D synthesis research
Starred by
+5
Created 2 years ago
Updated 3 weeks ago
LLM-Shearing
by
princeton-nlp
0%
632
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 2 years ago
Updated 1 year ago
TensorRT-LLM
by
NVIDIA
0.4%
12k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+18
Created 2 years ago
Updated 21 hours ago
gpt-crawler
by
BuilderIO
0.1%
22k
CLI tool for site crawling to generate custom GPT knowledge files
Starred by
Created 2 years ago
Updated 4 months ago
draw-a-ui
by
SawyerHood
0.1%
14k
Web app generates HTML from UI wireframes
Starred by
Created 2 years ago
Updated 4 months ago
open-interpreter
by
openinterpreter
0.1%
61k
Natural language interface for computers
Starred by
+33
Created 2 years ago
Updated 4 days ago
LLaVA-Plus-Codebase
by
LLaVA-VL
0.1%
762
Multimodal agent for vision tasks using external tools
Starred by
Created 2 years ago
Updated 1 year ago
opengpts
by
langchain-ai
0.1%
7k
Open-source platform for building custom GPT assistants
Starred by
+6
Created 2 years ago
Updated 5 months ago
openchat
by
imoneoi
0.1%
5k
Open-source LLM fine-tuned with C-RLFT, inspired by offline reinforcement learning
Starred by
+4
Created 2 years ago
Updated 1 year ago
CLIP
by
openai
0.3%
32k
Image-text matching model for zero-shot prediction
Starred by
+28
Created 5 years ago
Updated 1 year ago
LLaVA
by
haotian-liu
0.3%
24k
Multimodal assistant with GPT-4 level capabilities
Starred by
+16
Created 2 years ago
Updated 1 year ago
llmperf
by
ray-project
0.6%
1k
LLM validation/benchmark library for LLM APIs
Starred by
+2
Created 2 years ago
Updated 11 months ago
LightLLM
by
ModelTC
0.5%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 2 days ago
examples
by
graphcore
0%
332
ML examples for Graphcore IPUs, training and inference
Starred by
Created 7 years ago
Updated 1 year ago
streaming-llm
by
mit-han-lab
0.1%
7k
Framework for efficient LLM streaming
Starred by
+2
Created 2 years ago
Updated 1 year ago
mistral-inference
by
mistralai
0.1%
11k
Inference library for Mistral models
Starred by
+8
Created 2 years ago
Updated 1 week ago
can-ai-code
by
the-crypt-keeper
0%
598
AI coding model evaluation framework
Starred by
Created 2 years ago
Updated 5 months ago
LongLoRA
by
dvlab-research
0.0%
3k
LongLoRA: Efficient fine-tuning for long-context LLMs
Starred by
+1
Created 2 years ago
Updated 1 year ago
Qwen
by
QwenLM
0.3%
20k
Chat & pretrained LLM by Alibaba Cloud
Starred by
+12
Created 2 years ago
Updated 4 days ago
ollama
by
ollama
0.3%
157k
CLI tool for running LLMs locally
Starred by
+45
Created 2 years ago
Updated 1 day ago
alpaca_farm
by
tatsu-lab
0%
837
RLHF simulation framework for accessible instruction-following/alignment research
Starred by
+1
Created 2 years ago
Updated 1 year ago
Medusa
by
FasterDecoding
0.1%
3k
Framework for accelerating LLM generation using multiple decoding heads
Starred by
+6
Created 2 years ago
Updated 1 year ago
adept-inference
by
persimmon-ai-labs
0%
412
Inference code for the Persimmon-8B LLM
Starred by
Created 2 years ago
Updated 2 years ago
butterfish
by
bakks
0.4%
464
CLI tool for adding AI to your shell
Starred by
Created 2 years ago
Updated 7 months ago
shell_gpt
by
TheR1D
0.1%
12k
CLI tool for shell command generation and task automation using LLMs
Starred by
Created 2 years ago
Updated 1 month ago
LocalAI
by
mudler
1.0%
39k
Open-source OpenAI alternative for local AI inference
Starred by
+13
Created 2 years ago
Updated 21 hours ago
legalbench
by
HazyResearch
0.4%
514
Legal reasoning benchmark for evaluating LLMs
Created 3 years ago
Updated 1 year ago
yarn
by
jquesnelle
0.3%
2k
Context window extension method for LLMs (research paper, models)
Starred by
+4
Created 2 years ago
Updated 1 year ago
shell-ai
by
ricklamers
0.1%
1k
CLI tool for natural language to shell command translation
Created 2 years ago
Updated 2 months ago
llama-cookbook
by
meta-llama
0.1%
18k
Guide for building with Llama models
Starred by
+15
Created 2 years ago
Updated 3 weeks ago
codellama
by
meta-llama
0.0%
16k
Inference code for CodeLlama models
Starred by
+12
Created 2 years ago
Updated 1 year ago
prm800k
by
openai
0%
2k
Dataset of LLM solutions to math problems with step-level correctness labels
Starred by
+4
Created 2 years ago
Updated 2 years ago
FLAML
by
microsoft
0.1%
4k
AutoML library for efficient machine learning and AI operations
Starred by
+1
Created 5 years ago
Updated 1 month ago
engshell
by
emcf
0%
2k
English-language shell for OS, powered by LLMs
Starred by
Created 2 years ago
Updated 1 year ago
WizardLM
by
nlpxucan
0.1%
9k
LLMs built using Evol-Instruct for complex instruction following
Starred by
+15
Created 2 years ago
Updated 5 months ago
sqlcoder
by
defog-ai
0.1%
4k
LLM for natural language to SQL conversion
Starred by
Created 2 years ago
Updated 1 year ago
QuIP
by
Cornell-RelaxML
0.3%
390
Code for LLM quantization research
Created 2 years ago
Updated 1 year ago
lmdeploy
by
InternLM
0.5%
7k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 1 day ago
flexflow-train
by
flexflow
0.1%
2k
Accelerating distributed deep learning training
Starred by
+8
Created 7 years ago
Updated 1 week ago
Platypus
by
arielnlee
0%
630
Code for fine-tuning LLMs using LoRA
Starred by
Created 2 years ago
Updated 1 year ago
MetaGPT
by
FoundationAgents
0.2%
60k
Multi-agent framework for collaborative AI software development
Starred by
+9
Created 2 years ago
Updated 1 month ago
outlines
by
dottxt-ai
0.6%
13k
SDK for structured LLM text generation
Starred by
+34
Created 2 years ago
Updated 2 days ago
Chinese-Llama-2-7b
by
LinkSoul-AI
0.0%
2k
Chinese Llama 2 model for chat, fully open-source and commercially available
Created 2 years ago
Updated 2 years ago
sentence-transformers
by
huggingface
0.2%
18k
Framework for text embeddings, retrieval, and reranking
Starred by
+22
Created 6 years ago
Updated 1 week ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 2 years ago
Updated 1 year ago
swiss_army_llama
by
Dicklesworthstone
0%
1k
FastAPI service for semantic text search using precomputed embeddings
Starred by
Created 2 years ago
Updated 9 months ago
transformers-tutorials
by
abhimishra91
0.1%
856
Tutorials for fine-tuning transformer models on NLP tasks
Starred by
Created 5 years ago
Updated 1 year ago
bert
by
google-research
0.1%
40k
TensorFlow code and pre-trained models for BERT
Starred by
+26
Created 7 years ago
Updated 1 year ago
ALCE
by
princeton-nlp
0.2%
501
Benchmark for evaluating LLMs' citation abilities
Starred by
Created 2 years ago
Updated 1 year ago
lorahub
by
sail-sg
0.1%
659
Framework for efficient cross-task generalization via dynamic LoRA composition
Starred by
Created 2 years ago
Updated 1 year ago
text-to-text-transfer-transformer
by
google-research
0.1%
6k
Unified text-to-text transformer for NLP research
Starred by
+13
Created 6 years ago
Updated 3 weeks ago
ai-chatbot
by
vercel
0.4%
19k
Next.js chatbot template for building AI-powered chat applications
Starred by
+9
Created 2 years ago
Updated 1 day ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 2 years ago
Updated 1 year ago
clownfish
by
newhouseb
0%
329
Constrained decoding for LLMs against JSON schema
Starred by
Created 2 years ago
Updated 2 years ago
stanford_alpaca
by
tatsu-lab
0.1%
30k
Instruction-following LLaMA model training and data generation
Starred by
+25
Created 2 years ago
Updated 1 year ago
deep-learning-pytorch-huggingface
by
philschmid
0.2%
1k
Tutorials for deep learning with PyTorch and Hugging Face libraries
Starred by
Created 3 years ago
Updated 9 months ago
xTuring
by
stochasticai
0%
3k
SDK for fine-tuning and customizing open-source LLMs
Starred by
+3
Created 2 years ago
Updated 4 days ago
LMOps
by
microsoft
0.4%
4k
AI research initiative for building AI products with foundation models
Starred by
+7
Created 3 years ago
Updated 1 week ago
DialogStudio
by
salesforce
0%
517
Unified dataset for conversational AI research
Created 2 years ago
Updated 10 months ago
ggml
by
ggml-org
0.2%
14k
Tensor library for machine learning
Starred by
+16
Created 3 years ago
Updated 6 days ago
H2O
by
FMInference
0%
488
KV cache eviction research paper for efficient LLM inference
Starred by
Created 2 years ago
Updated 1 year ago
h2ogpt
by
h2oai
0.0%
12k
Private chat with local GPT with document, images, video, etc
Starred by
+3
Created 2 years ago
Updated 1 month ago
qlora
by
artidoro
0.1%
11k
Finetuning tool for quantized LLMs
Starred by
+19
Created 2 years ago
Updated 1 year ago
LLaMA-Factory
by
hiyouga
0.6%
63k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+25
Created 2 years ago
Updated 1 day ago
long_llama
by
CStanKonrad
0%
1k
LLM for long context handling, fine-tuned with Focused Transformer
Starred by
Created 2 years ago
Updated 2 years ago
self-instruct
by
yizhongw
0.1%
5k
Self-Instruct: Research paper for aligning language models with self-generated instructions
Starred by
+3
Created 2 years ago
Updated 2 years ago
llama-cpp-python
by
abetlen
0.3%
10k
Python bindings for llama.cpp, enabling local LLM inference
Starred by
+11
Created 2 years ago
Updated 3 months ago
helm
by
stanford-crfm
0.3%
3k
Open-source Python framework for holistic evaluation of foundation models
Starred by
+10
Created 4 years ago
Updated 1 week ago
text-generation-inference
by
huggingface
0.2%
11k
Rust/Python/gRPC server for fast LLM text generation
Starred by
+35
Created 3 years ago
Updated 1 week ago
alpaca_lora_4bit
by
johnsmith0031
0%
535
Fine-tuning and inference tool for quantized LLaMA models
Starred by
Created 2 years ago
Updated 2 years ago
minimal-llama
by
zphang
0%
457
Code for running and fine-tuning LLaMA models
Starred by
Created 2 years ago
Updated 2 years ago
LongChat
by
DachengLi1
0%
532
Long-context LLM chatbot training and evaluation framework
Starred by
+2
Created 2 years ago
Updated 1 year ago
exllama
by
turboderp
0.0%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 2 years ago
Updated 2 years ago
LMFlow
by
OptimalScale
0.0%
8k
Toolkit for finetuning and inference of large foundation models
Starred by
+9
Created 2 years ago
Updated 2 days ago
vllm
by
vllm-project
0.8%
64k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+57
Created 2 years ago
Updated 22 hours ago
openai-python
by
openai
0.2%
29k
Python SDK for the OpenAI API
Starred by
+16
Created 5 years ago
Updated 1 day ago
starcoder
by
bigcode-project
0.0%
7k
Code LM for code generation and instruction fine-tuning
Starred by
+12
Created 2 years ago
Updated 1 year ago
axolotl
by
axolotl-ai-cloud
0.3%
11k
CLI tool for streamlined post-training of AI models
Starred by
+25
Created 2 years ago
Updated 1 day ago
hnswlib
by
nmslib
0.1%
5k
Header-only C++ library for fast approximate nearest neighbors
Starred by
+13
Created 8 years ago
Updated 2 months ago
open_clip
by
mlfoundations
0.3%
13k
OpenCLIP: open-source CLIP implementation for vision-language representation learning
Starred by
+14
Created 4 years ago
Updated 3 weeks ago
accelerate
by
huggingface
0.2%
9k
PyTorch training helper for distributed execution
Starred by
+17
Created 5 years ago
Updated 2 days ago
SpQR
by
Vahe1994
0%
550
Weight compression research paper for near-lossless LLM quantization
Starred by
Created 2 years ago
Updated 11 months ago
AutoGPTQ
by
AutoGPTQ
0.1%
5k
LLM quantization package using GPTQ algorithm
Starred by
+12
Created 2 years ago
Updated 7 months ago
llm-awq
by
mit-han-lab
0.3%
3k
Weight quantization research paper for LLM compression/acceleration
Starred by
+4
Created 2 years ago
Updated 4 months ago
Alpaca-CoT
by
PhoebusSi
0.1%
3k
IFT platform for instruction collection, parameter-efficient methods, and LLMs
Starred by
Created 2 years ago
Updated 1 year ago
baize-chatbot
by
project-baize
0%
3k
Chat model trained via LoRA, using ChatGPT-generated dialogs
Starred by
+3
Created 2 years ago
Updated 1 year ago
falcontune
by
rmihaylov
0%
465
CLI tool for finetuning Falcon LLMs
Starred by
Created 2 years ago
Updated 2 years ago
gorilla
by
ShishirPatil
0.1%
13k
LLM tool-use framework for API invocation and function calling
Starred by
+15
Created 2 years ago
Updated 2 days ago
MeZO
by
princeton-nlp
0.1%
1k
Research paper implementation for memory-efficient LM fine-tuning
Starred by
Created 2 years ago
Updated 1 year ago
lm-evaluation-harness
by
EleutherAI
0.6%
11k
Framework for few-shot language model evaluation
Starred by
+18
Created 5 years ago
Updated 3 days ago
llama.cpp
by
ggml-org
0.4%
91k
C/C++ library for local LLM inference
Starred by
+51
Created 2 years ago
Updated 1 day ago
GPTCache
by
zilliztech
0.2%
8k
Semantic cache for LLM queries, integrated with LangChain and LlamaIndex
Starred by
+8
Created 2 years ago
Updated 4 months ago
tree-of-thoughts
by
kyegomez
0.0%
5k
Plug-and-play implementation of Tree of Thoughts for LLM reasoning
Starred by
+8
Created 2 years ago
Updated 4 months ago
gpt-neox
by
EleutherAI
0.1%
7k
Framework for training large-scale autoregressive language models
Starred by
+22
Created 5 years ago
Updated 2 months ago
pythia
by
EleutherAI
0.2%
3k
LLM suite for interpretability, learning dynamics, ethics, and transparency research
Starred by
+5
Created 3 years ago
Updated 2 weeks ago
llm-foundry
by
mosaicml
0.1%
4k
LLM training code for Databricks foundation models
Starred by
+14
Created 2 years ago
Updated 1 month ago
sd-webui-controlnet
by
Mikubill
0.1%
18k
WebUI extension for ControlNet, an image-generation plugin
Starred by
+2
Created 2 years ago
Updated 1 year ago
ControlNet
by
lllyasviel
0.1%
33k
Neural network structure for adding conditional control to diffusion models
Starred by
+26
Created 2 years ago
Updated 1 year ago
RWKV-LM
by
BlinkDL
0.1%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+29
Created 4 years ago
Updated 2 weeks ago
basaran
by
hyperonym
0%
1k
Open-source API server for text completion
Starred by
Created 2 years ago
Updated 1 year ago
pandas-ai
by
sinaptik-ai
0.2%
23k
Python SDK for conversational data analysis using LLMs and RAG
Starred by
+3
Created 2 years ago
Updated 1 month ago
guidance
by
guidance-ai
0.1%
21k
Guidance is a programming paradigm for steering LLMs
Starred by
+38
Created 3 years ago
Updated 1 week ago
raft
by
rapidsai
0.1%
955
CUDA-accelerated primitives for ML/data mining algorithms
Starred by
Created 6 years ago
Updated 5 days ago
lit-llama
by
Lightning-AI
0.1%
6k
LLaMA implementation for pretraining, finetuning, and inference
Starred by
+5
Created 2 years ago
Updated 5 months ago
civitai
by
civitai
0.2%
7k
Platform for sharing AI models
Starred by
Created 3 years ago
Updated 2 days ago
LyCORIS
by
KohakuBlueleaf
0.1%
2k
Parameter-efficient fine-tuning algorithms for Stable Diffusion
Created 2 years ago
Updated 2 weeks ago
stable-diffusion-webui
by
AUTOMATIC1111
0.1%
159k
Web UI for Stable Diffusion
Starred by
+31
Created 3 years ago
Updated 3 weeks ago
flash-attention
by
Dao-AILab
0.6%
21k
Fast, memory-efficient attention implementation
Starred by
+31
Created 3 years ago
Updated 5 days ago
LoRA
by
microsoft
0.3%
13k
PyTorch library for low-rank adaptation (LoRA) of LLMs
Starred by
+12
Created 4 years ago
Updated 11 months ago
EditAnything
by
sail-sg
0.0%
3k
Image editing research paper using segmentation and diffusion
Starred by
+2
Created 2 years ago
Updated 9 months ago
ImageBind
by
facebookresearch
0.1%
9k
PyTorch implementation for multimodal embeddings research paper
Starred by
+5
Created 2 years ago
Updated 1 week ago
langchain
by
langchain-ai
0.4%
121k
Framework for building LLM-powered applications
Starred by
+83
Created 3 years ago
Updated 2 days ago
open-llms
by
eugeneyan
0.1%
13k
Curated list of commercially-usable open LLMs
Starred by
+7
Created 2 years ago
Updated 9 months ago
IF
by
deep-floyd
0.0%
8k
Text-to-image model for photorealistic synthesis and language understanding
Starred by
+9
Created 2 years ago
Updated 1 year ago
EasyLM
by
young-geng
0.0%
3k
LLM training/finetuning framework in JAX/Flax
Starred by
+9
Created 3 years ago
Updated 1 year ago
open_llama
by
openlm-research
0.0%
8k
Open-source reproduction of LLaMA models
Starred by
+14
Created 2 years ago
Updated 2 years ago
LLaMA-Adapter
by
OpenGVLab
0.1%
6k
Efficient fine-tuning for instruction-following LLaMA models
Starred by
+3
Created 2 years ago
Updated 1 year ago
bitsandbytes
by
bitsandbytes-foundation
0.3%
8k
PyTorch library for k-bit quantization, enabling accessible LLMs
Starred by
+26
Created 4 years ago
Updated 4 days ago
composer
by
mosaicml
0.1%
5k
DL framework for training at scale, optimized for large-scale clusters
Starred by
+17
Created 4 years ago
Updated 2 weeks ago
fairseq
by
facebookresearch
0.1%
32k
Sequence modeling toolkit for translation, language modeling, and text generation research
Starred by
+42
Created 8 years ago
Updated 2 months ago
alpaca-lora
by
tloen
0.0%
19k
LoRA fine-tuning for LLaMA
Starred by
+22
Created 2 years ago
Updated 1 year ago
web-llm
by
mlc-ai
0.2%
17k
In-browser LLM inference engine using WebGPU for hardware acceleration
Starred by
+20
Created 2 years ago
Updated 6 days ago
transformers
by
huggingface
0.2%
153k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 1 day ago
Awesome-LLM
by
Hannibal046
0.3%
26k
Curated list of Large Language Model resources
Starred by
+8
Created 2 years ago
Updated 4 months ago
agent-ci
by
pegasi-ai
0.3%
354
AI testing framework for LLM output validation
Created 2 years ago
Updated 3 weeks ago
trl
by
huggingface
0.6%
16k
Library for transformer RL
Starred by
+28
Created 5 years ago
Updated 2 days ago
peft
by
huggingface
0.3%
20k
Parameter-efficient fine-tuning (PEFT) library
Starred by
+16
Created 3 years ago
Updated 1 week ago
annotated_deep_learning_paper_implementations
by
labmlai
0.2%
65k
PyTorch implementations/tutorials of deep learning papers with side-by-side notes
Starred by
+4
Created 5 years ago
Updated 2 weeks ago
Megatron-LM
by
NVIDIA
0.5%
14k
Framework for training transformer models at scale
Starred by
+19
Created 6 years ago
Updated 21 hours ago
minGPT
by
karpathy
0.3%
23k
Minimal PyTorch re-implementation for GPT training and inference
Starred by
+30
Created 5 years ago
Updated 1 year ago
nanoGPT
by
karpathy
0.7%
50k
Minimalist repo for training/finetuning GPT models
Starred by
+43
Created 2 years ago
Updated 2 weeks ago
llama
by
meta-llama
0.0%
59k
Inference code for Llama 2 models (deprecated)
Starred by
+38
Created 2 years ago
Updated 10 months ago
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 2 years ago
Updated 11 months ago
FasterTransformer
by
NVIDIA
0.1%
6k
Optimized transformer library for inference
Starred by
+12
Created 4 years ago
Updated 1 year ago
dolly
by
databrickslabs
0%
11k
Instruction-following LLM trained on the Databricks Machine Learning Platform
Starred by
+15
Created 2 years ago
Updated 2 years ago
StableLM
by
Stability-AI
0.0%
16k
Language models by Stability AI
Starred by
+25
Created 2 years ago
Updated 1 year ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 6 months ago
web-stable-diffusion
by
mlc-ai
0%
4k
Browser-based Stable Diffusion demo with no server support
Starred by
+2
Created 2 years ago
Updated 1 year ago
DeepSpeed
by
deepspeedai
0.2%
41k
Deep learning optimization library for distributed training and inference
Starred by
+36
Created 5 years ago
Updated 4 days ago
llama_index
by
run-llama
0.4%
46k
Data framework for building LLM-powered agents
Starred by
+44
Created 3 years ago
Updated 2 days ago
ColossalAI
by
hpcaitech
0.1%
41k
AI system for large-scale parallel training
Starred by
+25
Created 4 years ago
Updated 6 days ago
DeepLearningExamples
by
NVIDIA
0.1%
15k
Deep learning examples for training and deployment
Starred by
+8
Created 7 years ago
Updated 1 year ago
tensorflow
by
tensorflow
0.1%
193k
Open-source ML framework
Starred by
+97
Created 10 years ago
Updated 1 day ago
Feedback? Help us improve.