X
2,142
Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Casper Hansen
Casper Hansen
Author of AutoAWQ
GitHub
X
Authored Projects (1)
Starred
by
Vincent Weisser
(Cofounder of Prime Intellect)
,
Woosuk Kwon
(Author of vLLM)
,
Wing Lian
(Founder of Axolotl AI)
,
Junyang Lin
(Core Maintainer at Alibaba Qwen),
and
4 more.
AutoAWQ
by
casper-hansen
0.2%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Implements Activation-aware Weight Quantization (AWQ) algorithm.
Speeds up models 3x, reduces memory 3x vs FP16.
Supports GEMM/GEMV for speed/context optimization.
Includes fused modules for further speedup.
Created 2 years ago
Updated 3 months ago
Starred Projects (140)
slime
by
THUDM
5.3%
1k
LLM post-training framework for RL scaling
Starred by
Created 2 months ago
Updated 1 day ago
crawl4ai
by
unclecode
0.7%
52k
Open-source web crawler/scraper for LLMs, AI agents, and data pipelines
Starred by
+7
Created 1 year ago
Updated 2 days ago
Biomni
by
snap-stanford
0.7%
2k
Biomedical AI agent for autonomous research tasks
Starred by
+1
Created 5 months ago
Updated 3 days ago
AReaL
by
inclusionAI
5.1%
2k
Distributed RL system for LLM reasoning
Starred by
Created 6 months ago
Updated 1 day ago
gemini-fullstack-langgraph-quickstart
by
google-gemini
0.6%
17k
Full-stack agent quickstart
Starred by
+1
Created 3 months ago
Updated 2 months ago
ZeroSearch
by
Alibaba-NLP
0.4%
1k
Research paper on incentivizing LLM search without real search engines
Created 3 months ago
Updated 1 week ago
WebThinker
by
sunnynexus
0.8%
1k
Research framework for autonomous web search and report drafting
Created 5 months ago
Updated 4 weeks ago
OpenDeepSearch
by
sentient-agi
0.4%
4k
OpenDeepSearch: search tool for AI agents
Starred by
Created 5 months ago
Updated 4 months ago
ReCall
by
Agent-RL
0.9%
1k
RL framework for LLM tool use
Starred by
Created 5 months ago
Updated 3 months ago
ii-researcher
by
Intelligent-Internet
0.4%
468
Open-source framework for building search/research agents
Starred by
Created 5 months ago
Updated 3 weeks ago
veScale
by
volcengine
0%
861
PyTorch-native framework for LLM training
Starred by
+1
Created 1 year ago
Updated 1 month ago
verifiers
by
willccbb
51.7%
3k
RL for LLMs in verifiable environments
Starred by
+9
Created 7 months ago
Updated 1 day ago
deep-research
by
dzhng
0.4%
18k
AI research assistant for iterative, in-depth topic exploration
Starred by
+4
Created 6 months ago
Updated 2 months ago
evalchemy
by
mlfoundations
0.8%
522
LLM evaluation toolkit for post-trained language models
Starred by
Created 9 months ago
Updated 2 months ago
verl
by
volcengine
1.8%
13k
RL training library for LLMs
Starred by
+13
Created 10 months ago
Updated 1 day ago
open-thoughts
by
open-thoughts
0.7%
2k
Open dataset for training reasoning models
Starred by
+1
Created 7 months ago
Updated 1 month ago
open-deep-research
by
btahir
0.2%
2k
Open-source app for AI-powered research reports from web search
Created 8 months ago
Updated 5 months ago
storm
by
stanford-oval
0.1%
27k
LLM system for automated knowledge curation and article generation
Starred by
+5
Created 1 year ago
Updated 2 months ago
cline
by
cline
0.6%
50k
VS Code extension for autonomous coding agent
Starred by
+24
Created 1 year ago
Updated 1 day ago
bolt.new
by
stackblitz
0.3%
16k
AI-powered web development agent for full-stack apps
Starred by
+2
Created 11 months ago
Updated 8 months ago
mle-bench
by
openai
4.4%
909
Benchmark for evaluating AI agents on machine learning engineering tasks
Starred by
+1
Created 10 months ago
Updated 1 day ago
aideml
by
WecoAI
0.9%
1k
ML engineering agent for automated AI R&D, surpassing human experts
Starred by
Created 1 year ago
Updated 2 weeks ago
OpenCoder-llm
by
OpenCoder-llm
0.2%
2k
Open code LLM family (1.5B/8B) for English and Chinese
Created 10 months ago
Updated 8 months ago
optillm
by
codelion
1.0%
3k
Optimizing inference proxy for LLMs
Starred by
+6
Created 1 year ago
Updated 1 day ago
nanotron
by
huggingface
0.7%
2k
Minimalistic library for large language model pretraining
Starred by
+11
Created 1 year ago
Updated 3 days ago
MinerU
by
opendatalab
0.8%
43k
PDF extraction tool for converting PDFs to Markdown and JSON
Starred by
Created 1 year ago
Updated 1 day ago
torchtitan
by
pytorch
0.9%
4k
PyTorch platform for generative AI model training research
Starred by
+9
Created 1 year ago
Updated 1 day ago
rStar
by
zhentingqi
0.1%
956
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 7 months ago
ml-engineering
by
stas00
0.5%
15k
Open book for LLM/VLM training engineers
Starred by
+16
Created 5 years ago
Updated 1 day ago
Jobs_Applier_AI_Agent_AIHawk
by
feder-cr
0.1%
29k
AI agent for automating job applications
Starred by
Created 1 year ago
Updated 3 months ago
MindSearch
by
InternLM
0.2%
7k
LLM multi-agent framework for web search (Perplexity AI, SearchGPT)
Starred by
Created 1 year ago
Updated 1 month ago
aider
by
Aider-AI
0.5%
37k
AI pair programming in your terminal
Starred by
+35
Created 2 years ago
Updated 2 weeks ago
SWE-agent
by
SWE-agent
0.5%
17k
Agent for automated software engineering (NeurIPS 2024)
Starred by
+23
Created 1 year ago
Updated 3 days ago
openui
by
wandb
0.1%
22k
UI prototyping tool using LLMs
Starred by
+8
Created 1 year ago
Updated 1 month ago
megablocks
by
databricks
1.1%
1k
Lightweight library for mixture-of-experts (MoE) training
Starred by
+15
Created 2 years ago
Updated 2 months ago
OpenHands
by
All-Hands-AI
0.4%
63k
AI platform for software development agents
Starred by
+36
Created 1 year ago
Updated 1 day ago
devika
by
stitionai
0.1%
19k
Agentic AI software engineer for high-level instruction to code generation
Starred by
+7
Created 1 year ago
Updated 11 months ago
torchtune
by
pytorch
0.4%
5k
PyTorch library for LLM post-training and experimentation
Starred by
+12
Created 1 year ago
Updated 1 day ago
OpenCodeInterpreter
by
OpenCodeInterpreter
0.1%
2k
Open-source code generation system for bridging LLMs and code interpreters
Starred by
Created 1 year ago
Updated 1 year ago
bonito
by
BatsResearch
0.1%
788
Synthetic data generator for instruction tuning datasets
Starred by
Created 1 year ago
Updated 1 month ago
VILA
by
NVlabs
0.4%
4k
Open-source VLMs for efficient video/multi-image understanding
Starred by
+1
Created 1 year ago
Updated 3 weeks ago
augmentoolkit
by
e-p-armstrong
0.5%
2k
Data toolkit for custom LLM creation using open-source AI
Starred by
+2
Created 1 year ago
Updated 1 week ago
TensorRT-LLM
by
NVIDIA
0.6%
11k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+17
Created 2 years ago
Updated 1 day ago
AQLM
by
Vahe1994
0%
1k
PyTorch code for LLM compression via Additive Quantization (AQLM)
Starred by
+1
Created 1 year ago
Updated 2 weeks ago
lilac
by
databricks
0%
1k
Data exploration tool for LLM dataset curation and quality control
Starred by
+6
Created 2 years ago
Updated 1 year ago
MiniCPM
by
OpenBMB
0.4%
8k
Ultra-efficient LLMs for end devices, achieving 5x+ speedup
Starred by
Created 1 year ago
Updated 1 week ago
rawdog
by
AbanteAI
0.1%
2k
CLI tool for auto-executing Python scripts
Starred by
Created 1 year ago
Updated 1 week ago
worker-vllm
by
runpod-workers
0.8%
358
RunPod worker template for blazing-fast LLM endpoints
Starred by
Created 2 years ago
Updated 1 day ago
infinity
by
michaelfeil
0.5%
2k
REST API for high-throughput, low-latency embedding and reranking
Starred by
+7
Created 1 year ago
Updated 1 day ago
datatrove
by
huggingface
0.3%
3k
Data processing library for large-scale text data
Starred by
+8
Created 2 years ago
Updated 2 days ago
llm-decontaminator
by
lm-sys
0.3%
309
LLM contamination detector for quantifying rephrased samples
Starred by
Created 1 year ago
Updated 1 year ago
sglang
by
sgl-project
1.6%
17k
Fast serving framework for LLMs and vision language models
Starred by
+32
Created 1 year ago
Updated 1 day ago
bagel
by
jondurbin
0%
324
Fine-tuning pipeline for language models, "with everything."
Starred by
Created 1 year ago
Updated 1 year ago
llama-moe
by
pjlab-sys4nlp
0.1%
983
MoE model from LLaMA with continual pre-training
Starred by
Created 2 years ago
Updated 8 months ago
rank_llm
by
castorini
1.1%
525
Python toolkit for reproducible information retrieval research
Starred by
Created 2 years ago
Updated 2 days ago
InfiniteBench
by
OpenBMB
0%
344
Benchmark for evaluating language models on super-long contexts (100k+ tokens)
Starred by
Created 1 year ago
Updated 11 months ago
airoboros
by
jondurbin
0.2%
1k
Self-instruct tool for LLM finetuning
Starred by
+2
Created 2 years ago
Updated 1 year ago
magicoder
by
ise-uiuc
0.1%
2k
Code generation model family for instruction following
Starred by
+2
Created 1 year ago
Updated 10 months ago
gpt-fast
by
meta-pytorch
0.2%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 1 year ago
Updated 6 days ago
prompt-lookup-decoding
by
apoorvumang
0.2%
561
Decoding method for faster LLM generation
Starred by
Created 1 year ago
Updated 1 year ago
LongChat
by
DachengLi1
0%
528
Long-context LLM chatbot training and evaluation framework
Starred by
+2
Created 2 years ago
Updated 1 year ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.6%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 1 year ago
Updated 1 year ago
UltraFeedback
by
OpenBMB
0%
347
Preference dataset for training reward/critique models
Starred by
Created 2 years ago
Updated 1 year ago
LLM-Shearing
by
princeton-nlp
0%
629
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 1 year ago
Updated 1 year ago
giskard-oss
by
Giskard-AI
0.3%
5k
Open-source testing framework for AI & LLM systems
Starred by
+3
Created 3 years ago
Updated 1 week ago
flashinfer
by
flashinfer-ai
1.2%
4k
Kernel library for LLM serving
Starred by
+10
Created 2 years ago
Updated 1 day ago
DeepSpeed-MII
by
deepspeedai
0.1%
2k
Python library for high-throughput, low-latency, and cost-effective model inference
Starred by
+5
Created 3 years ago
Updated 1 month ago
sec-insights
by
run-llama
0.0%
3k
Full-stack RAG app for SEC filings Q&A
Starred by
Created 2 years ago
Updated 5 months ago
synthesizer
by
SciPhi-AI
0%
628
LLM framework for RAG and data creation
Starred by
+3
Created 1 year ago
Updated 1 year ago
llama_index
by
run-llama
0.3%
44k
Data framework for building LLM-powered agents
Starred by
+39
Created 2 years ago
Updated 1 day ago
self-rag
by
AkariAsai
0.5%
2k
Self-RAG implementation for learning retrieval, generation, and critique via self-reflection
Starred by
+1
Created 1 year ago
Updated 1 year ago
fine-tune-mistral
by
abacaj
0%
718
Fine-tuning script for Mistral-7B
Starred by
Created 1 year ago
Updated 1 year ago
chatgpt-ui
by
WongSaang
0.1%
2k
Web client for ChatGPT with multi-user and i18n support
Created 2 years ago
Updated 1 year ago
ray-llm
by
ray-project
0%
1k
LLM deployment framework on Ray (now upstreamed to Ray)
Starred by
+2
Created 2 years ago
Updated 5 months ago
TinyChatEngine
by
mit-han-lab
0.3%
888
On-device LLM/VLM inference library for edge deployment
Created 2 years ago
Updated 1 year ago
axolotl
by
axolotl-ai-cloud
0.5%
10k
CLI tool for streamlined post-training of AI models
Starred by
+23
Created 2 years ago
Updated 1 day ago
LLaMA-Factory
by
hiyouga
0.6%
57k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+21
Created 2 years ago
Updated 1 day ago
TinyLlama
by
jzhang38
0.2%
9k
Tiny pretraining project for a 1.1B Llama model
Starred by
+17
Created 2 years ago
Updated 1 year ago
yarn
by
jquesnelle
0.4%
2k
Context window extension method for LLMs (research paper, models)
Starred by
+4
Created 2 years ago
Updated 1 year ago
chat-ui
by
huggingface
0.2%
9k
Chat UI: open-source interface for LLMs
Starred by
+8
Created 2 years ago
Updated 3 weeks ago
mlc-llm
by
mlc-ai
0.1%
21k
Universal LLM deployment engine with ML compilation
Starred by
+21
Created 2 years ago
Updated 3 days ago
LightLLM
by
ModelTC
0.7%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 1 day ago
lmdeploy
by
InternLM
0.7%
7k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 1 day ago
CTranslate2
by
OpenNMT
0.3%
4k
Fast inference engine for Transformer models
Starred by
+6
Created 6 years ago
Updated 4 months ago
llm-awq
by
mit-han-lab
0.4%
3k
Weight quantization research paper for LLM compression/acceleration
Starred by
+4
Created 2 years ago
Updated 1 month ago
skypilot
by
skypilot-org
0.4%
9k
Framework for cloud AI/batch jobs, unifying execution across diverse infrastructure
Starred by
+24
Created 4 years ago
Updated 1 day ago
bitsandbytes
by
bitsandbytes-foundation
0.3%
8k
PyTorch library for k-bit quantization, enabling accessible LLMs
Starred by
+24
Created 4 years ago
Updated 3 days ago
transformer-deploy
by
ELS-RD
0.1%
2k
CLI tool for optimized Hugging Face Transformer deployment
Starred by
+5
Created 3 years ago
Updated 10 months ago
kernl
by
ELS-RD
0%
2k
PyTorch transformer inference engine for GPU speedup
Starred by
+3
Created 3 years ago
Updated 1 year ago
vllm
by
vllm-project
1.1%
56k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+55
Created 2 years ago
Updated 1 day ago
gpt-engineer
by
AntonOsika
0.1%
55k
CLI platform for code generation experimentation
Starred by
+17
Created 2 years ago
Updated 3 months ago
SqueezeLLM
by
SqueezeAILab
0.1%
703
Quantization framework for efficient LLM serving (ICML 2024 paper)
Starred by
Created 2 years ago
Updated 1 year ago
rebuff
by
protectai
0.6%
1k
SDK for LLM prompt injection detection
Starred by
Created 2 years ago
Updated 1 year ago
exllama
by
turboderp
0.1%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 2 years ago
Updated 1 year ago
llama-cpp-python
by
abetlen
0.3%
10k
Python bindings for llama.cpp, enabling local LLM inference
Starred by
+11
Created 2 years ago
Updated 2 weeks ago
ctransformers
by
marella
0.1%
2k
Python bindings for fast Transformer model inference
Starred by
+7
Created 2 years ago
Updated 1 year ago
faiss
by
facebookresearch
0.3%
37k
Similarity search library for dense vectors
Starred by
+51
Created 8 years ago
Updated 1 day ago
netron
by
lutzroeder
0.2%
31k
Model visualizer for neural networks, deep learning, and ML
Starred by
+23
Created 14 years ago
Updated 2 days ago
hnswlib
by
nmslib
0.3%
5k
Header-only C++ library for fast approximate nearest neighbors
Starred by
+11
Created 8 years ago
Updated 2 months ago
deepsparse
by
neuralmagic
0.0%
3k
CPU inference runtime for sparse deep learning models
Starred by
Created 4 years ago
Updated 2 months ago
qlora
by
artidoro
0.1%
11k
Finetuning tool for quantized LLMs
Starred by
+19
Created 2 years ago
Updated 1 year ago
ChatRWKV
by
BlinkDL
0.0%
10k
Open-source chatbot powered by the RWKV RNN language model
Starred by
+3
Created 2 years ago
Updated 3 days ago
rwkv.cpp
by
RWKV
0%
2k
CPU inference lib for RWKV language model
Starred by
Created 2 years ago
Updated 5 months ago
RWKV-LM
by
BlinkDL
0.1%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+27
Created 4 years ago
Updated 6 days ago
scikit-llm
by
BeastByteAI
0.0%
3k
SDK for integrating LLMs into scikit-learn pipelines
Starred by
+1
Created 2 years ago
Updated 4 weeks ago
h2ogpt
by
h2oai
0.1%
12k
Private chat with local GPT with document, images, video, etc
Starred by
+3
Created 2 years ago
Updated 3 months ago
ggml
by
ggml-org
0.3%
13k
Tensor library for machine learning
Starred by
+15
Created 2 years ago
Updated 1 day ago
DB-GPT
by
eosphoros-ai
0.3%
17k
AI-native data app development framework with agentic workflow
Starred by
Created 2 years ago
Updated 3 days ago
Plan-and-Solve-Prompting
by
AGI-Edgerunners
0.1%
686
Research paper code for improved zero-shot chain-of-thought reasoning
Created 2 years ago
Updated 2 years ago
AutoGPTQ
by
AutoGPTQ
0.1%
5k
LLM quantization package using GPTQ algorithm
Starred by
+12
Created 2 years ago
Updated 4 months ago
llm-foundry
by
mosaicml
0.2%
4k
LLM training code for Databricks foundation models
Starred by
+14
Created 2 years ago
Updated 1 week ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 2 years ago
Updated 9 months ago
StableLM
by
Stability-AI
0.0%
16k
Language models by Stability AI
Starred by
+24
Created 2 years ago
Updated 1 year ago
ai-pdf-chatbot-langchain
by
mayooear
0.1%
16k
AI chatbot agent for PDF document Q&A using LangChain & LangGraph
Starred by
+3
Created 2 years ago
Updated 6 months ago
RedPajama-Data
by
togethercomputer
0.1%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 2 years ago
Updated 8 months ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 2 years ago
Updated 1 year ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+34
Created 2 years ago
Updated 2 months ago
chatbot-ui
by
mckaywrigley
0.2%
32k
Open-source AI chat app
Starred by
+14
Created 2 years ago
Updated 1 year ago
text-generation-webui
by
oobabooga
0.1%
45k
Web UI for LLM text generation
Starred by
+24
Created 2 years ago
Updated 1 day ago
gpt4all
by
nomic-ai
0.1%
77k
Desktop app for local LLM inference, no GPU/API needed
Starred by
+29
Created 2 years ago
Updated 3 months ago
pathos
by
uqfoundation
0.1%
1k
Framework for parallel graph management and execution in heterogeneous computing
Starred by
Created 12 years ago
Updated 2 months ago
alpha-rptr
by
TheFourGreatErrors
0.7%
584
Trading bot for automated algorithmic trading
Starred by
Created 5 years ago
Updated 6 months ago
dalle-flow
by
jina-ai
0.1%
3k
Text-to-image generation with human-in-the-loop refinement
Starred by
Created 3 years ago
Updated 2 years ago
dalle-mini
by
borisdayma
0.0%
15k
Text-to-image model for generating images from text prompts
Starred by
+13
Created 4 years ago
Updated 1 year ago
AI-Chip
by
basicmi
0.1%
2k
AI chip resource list
Starred by
Created 8 years ago
Updated 1 year ago
interpret
by
interpretml
0.1%
7k
ML interpretability Python package for glassbox models and blackbox explanations
Starred by
+2
Created 6 years ago
Updated 4 days ago
jetson-containers
by
dusty-nv
0.8%
4k
Container build system for NVIDIA Jetson AI/ML development
Starred by
Created 5 years ago
Updated 1 day ago
applied-ml
by
eugeneyan
0.1%
28k
ML resource collection: papers/blogs sharing data science & ML production work
Starred by
+8
Created 5 years ago
Updated 1 year ago
streamlit
by
streamlit
0.3%
41k
SDK for rapidly building interactive data apps
Starred by
+18
Created 6 years ago
Updated 1 day ago
TextAttack
by
QData
0.2%
3k
Python framework for NLP adversarial attacks, data augmentation, and model training
Starred by
+4
Created 5 years ago
Updated 1 month ago
caliban
by
google
0%
501
CLI tool for reproducible research workflows, locally or in the cloud
Starred by
+1
Created 5 years ago
Updated 1 year ago
yolov5
by
ultralytics
0.2%
55k
YOLOv5 in PyTorch for object detection, segmentation, and classification
Starred by
+5
Created 5 years ago
Updated 3 days ago
ParallelWaveGAN
by
kan-bayashi
0%
2k
Pytorch vocoder for real-time speech synthesis, based on Parallel WaveGAN
Starred by
Created 5 years ago
Updated 1 year ago
ALAE
by
podgorskiy
0%
4k
Adversarial latent autoencoder for combining generative/representational properties
Starred by
+1
Created 6 years ago
Updated 4 years ago
pytorch-lightning
by
Lightning-AI
0.1%
30k
Deep learning framework for pretraining, finetuning, and deploying AI models
Starred by
+28
Created 6 years ago
Updated 1 day ago
melgan
by
seungwonpark
0%
646
PyTorch implementation of MelGAN vocoder
Created 5 years ago
Updated 4 years ago
awesome-production-machine-learning
by
EthicalML
0.1%
19k
Curated list of open-source libraries for production ML
Starred by
+14
Created 7 years ago
Updated 5 days ago
transformers
by
huggingface
0.2%
149k
ML library for pretrained model inference and training
Starred by
+92
Created 6 years ago
Updated 1 day ago
GPT2
by
ConnorJL
0%
1k
GPT2 training implementation, supporting TPUs and GPUs
Starred by
Created 6 years ago
Updated 2 years ago
Learn-Natural-Language-Processing-Curriculum
by
llSourcell
0.2%
1k
NLP curriculum for video course
Created 6 years ago
Updated 4 years ago
Feedback? Help us improve.