Home
Browse all repos
/
Discover and explore top open-source AI tools and projects—updated daily.
Home
Browse all repos
Home
>
Users
>
Casper Hansen
Casper Hansen
Author of AutoAWQ
GitHub
X
Authored Projects (1)
Starred
by
Vincent Weisser
(Cofounder of Prime Intellect)
,
Ji Yichao
(Cofounder of Manus)
,
Woosuk Kwon
(Coauthor of vLLM)
,
Wing Lian
(Founder of Axolotl AI),
and
6 more.
AutoAWQ
by
casper-hansen
0.2%
2k
AutoAWQ is a tool for 4-bit quantized LLM inference
Implements Activation-aware Weight Quantization (AWQ) algorithm.
Speeds up models 3x, reduces memory 3x vs FP16.
Supports GEMM/GEMV for speed/context optimization.
Includes fused modules for further speedup.
Created 2 years ago
Updated 5 months ago
Starred Projects (144)
xtuner
by
InternLM
0.3%
5k
LLM fine-tuning toolkit for research
Starred by
+2
Created 2 years ago
Updated 6 hours ago
slime
by
THUDM
3.9%
2k
LLM post-training framework for RL scaling
Starred by
+2
Created 3 months ago
Updated 48 minutes ago
crawl4ai
by
unclecode
0.6%
55k
Open-source web crawler/scraper for LLMs, AI agents, and data pipelines
Starred by
+7
Created 1 year ago
Updated 1 day ago
Biomni
by
snap-stanford
1.3%
2k
Biomedical AI agent for autonomous research tasks
Starred by
+1
Created 6 months ago
Updated 1 day ago
RL2
by
ChenmienTan
2.2%
890
Reinforcement learning for large language models
Starred by
+1
Created 6 months ago
Updated 2 hours ago
AReaL
by
inclusionAI
2.6%
3k
Distributed RL system for LLM reasoning
Starred by
Created 7 months ago
Updated 1 hour ago
gemini-fullstack-langgraph-quickstart
by
google-gemini
0.4%
17k
Full-stack agent quickstart
Starred by
+1
Created 4 months ago
Updated 1 month ago
ZeroSearch
by
Alibaba-NLP
0.3%
1k
Research paper on incentivizing LLM search without real search engines
Created 5 months ago
Updated 1 month ago
WebThinker
by
RUC-NLPIR
1.0%
1k
Research framework for autonomous web search and report drafting
Created 6 months ago
Updated 3 weeks ago
OpenDeepSearch
by
sentient-agi
0.5%
4k
OpenDeepSearch: search tool for AI agents
Starred by
Created 6 months ago
Updated 6 months ago
ReCall
by
Agent-RL
0.6%
1k
RL framework for LLM tool use
Starred by
Created 7 months ago
Updated 5 months ago
ii-researcher
by
Intelligent-Internet
0.2%
475
Open-source framework for building search/research agents
Starred by
Created 6 months ago
Updated 2 months ago
veScale
by
volcengine
0.2%
874
PyTorch-native framework for LLM training
Starred by
+1
Created 1 year ago
Updated 1 month ago
verifiers
by
PrimeIntellect-ai
1.5%
3k
RL for LLMs in verifiable environments
Starred by
+11
Created 8 months ago
Updated 2 days ago
deep-research
by
dzhng
0.3%
18k
AI research assistant for iterative, in-depth topic exploration
Starred by
+5
Created 8 months ago
Updated 1 month ago
evalchemy
by
mlfoundations
0.9%
546
LLM evaluation toolkit for post-trained language models
Starred by
Created 11 months ago
Updated 3 months ago
verl
by
volcengine
1.7%
14k
RL training library for LLMs
Starred by
+13
Created 11 months ago
Updated 3 hours ago
open-thoughts
by
open-thoughts
0.2%
2k
Open dataset for training reasoning models
Starred by
+1
Created 8 months ago
Updated 1 month ago
open-deep-research
by
btahir
0.3%
2k
Open-source app for AI-powered research reports from web search
Created 9 months ago
Updated 7 months ago
storm
by
stanford-oval
0.1%
28k
LLM system for automated knowledge curation and article generation
Starred by
+5
Created 1 year ago
Updated 2 weeks ago
cline
by
cline
0.4%
51k
VS Code extension for autonomous coding agent
Starred by
+26
Created 1 year ago
Updated 55 minutes ago
bolt.new
by
stackblitz
0.3%
16k
AI-powered web development agent for full-stack apps
Starred by
+2
Created 1 year ago
Updated 10 months ago
mle-bench
by
openai
2.4%
1k
Benchmark for evaluating AI agents on machine learning engineering tasks
Starred by
+1
Created 1 year ago
Updated 20 hours ago
aideml
by
WecoAI
0.6%
1k
ML engineering agent for automated AI R&D, surpassing human experts
Starred by
Created 1 year ago
Updated 3 weeks ago
OpenCoder-llm
by
OpenCoder-llm
0.4%
2k
Open code LLM family (1.5B/8B) for English and Chinese
Starred by
Created 11 months ago
Updated 10 months ago
optillm
by
codelion
1.3%
3k
Optimizing inference proxy for LLMs
Starred by
+6
Created 1 year ago
Updated 1 week ago
docling
by
docling-project
1.5%
42k
Prepare documents for generative AI
Starred by
+12
Created 1 year ago
Updated 18 hours ago
nanotron
by
huggingface
0.4%
2k
Minimalistic library for large language model pretraining
Starred by
+11
Created 2 years ago
Updated 1 month ago
MinerU
by
opendatalab
2.3%
46k
PDF extraction tool for converting PDFs to Markdown and JSON
Starred by
Created 1 year ago
Updated 22 hours ago
torchtitan
by
pytorch
0.7%
5k
PyTorch platform for generative AI model training research
Starred by
+11
Created 1 year ago
Updated 2 hours ago
rStar
by
zhentingqi
0.1%
962
Research paper for improving small LLM reasoning via mutual reasoning
Starred by
Created 1 year ago
Updated 8 months ago
ml-engineering
by
stas00
0.3%
15k
Open book for LLM/VLM training engineers
Starred by
+16
Created 5 years ago
Updated 16 hours ago
Jobs_Applier_AI_Agent_AIHawk
by
feder-cr
0.2%
29k
AI agent for automating job applications
Starred by
Created 1 year ago
Updated 4 months ago
MindSearch
by
InternLM
0.2%
7k
LLM multi-agent framework for web search (Perplexity AI, SearchGPT)
Starred by
Created 1 year ago
Updated 3 months ago
aider
by
Aider-AI
0.3%
38k
AI pair programming in your terminal
Starred by
+36
Created 2 years ago
Updated 1 week ago
SWE-agent
by
SWE-agent
0.3%
18k
Agent for automated software engineering (NeurIPS 2024)
Starred by
+23
Created 1 year ago
Updated 1 day ago
openui
by
wandb
0.1%
22k
UI prototyping tool using LLMs
Starred by
+8
Created 1 year ago
Updated 2 weeks ago
megablocks
by
databricks
0.2%
1k
Lightweight library for mixture-of-experts (MoE) training
Starred by
+15
Created 2 years ago
Updated 3 months ago
OpenHands
by
All-Hands-AI
0.3%
64k
AI platform for software development agents
Starred by
+36
Created 1 year ago
Updated 1 hour ago
devika
by
stitionai
0.0%
19k
Agentic AI software engineer for high-level instruction to code generation
Starred by
+7
Created 1 year ago
Updated 2 weeks ago
torchtune
by
meta-pytorch
0.2%
6k
PyTorch library for LLM post-training and experimentation
Starred by
+12
Created 2 years ago
Updated 21 hours ago
OpenCodeInterpreter
by
OpenCodeInterpreter
0.1%
2k
Open-source code generation system for bridging LLMs and code interpreters
Starred by
Created 1 year ago
Updated 1 year ago
bonito
by
BatsResearch
0.3%
796
Synthetic data generator for instruction tuning datasets
Starred by
Created 1 year ago
Updated 3 months ago
VILA
by
NVlabs
0.5%
4k
Open-source VLMs for efficient video/multi-image understanding
Starred by
+1
Created 1 year ago
Updated 2 months ago
augmentoolkit
by
e-p-armstrong
0.1%
2k
Data toolkit for custom LLM creation using open-source AI
Starred by
+3
Created 1 year ago
Updated 3 weeks ago
TensorRT-LLM
by
NVIDIA
0.5%
12k
LLM inference optimization SDK for NVIDIA GPUs
Starred by
+17
Created 2 years ago
Updated 53 minutes ago
AQLM
by
Vahe1994
0%
1k
PyTorch code for LLM compression via Additive Quantization (AQLM)
Starred by
+2
Created 1 year ago
Updated 2 months ago
lilac
by
databricks
0%
1k
Data exploration tool for LLM dataset curation and quality control
Starred by
+6
Created 2 years ago
Updated 1 year ago
MiniCPM
by
OpenBMB
0.2%
8k
Ultra-efficient LLMs for end devices, achieving 5x+ speedup
Starred by
Created 1 year ago
Updated 6 days ago
rawdog
by
AbanteAI
0.2%
2k
CLI tool for auto-executing Python scripts
Starred by
Created 1 year ago
Updated 1 month ago
worker-vllm
by
runpod-workers
0.5%
371
RunPod worker template for blazing-fast LLM endpoints
Starred by
Created 2 years ago
Updated 3 weeks ago
infinity
by
michaelfeil
0.6%
2k
REST API for high-throughput, low-latency embedding and reranking
Starred by
+8
Created 2 years ago
Updated 1 week ago
datatrove
by
huggingface
0.5%
3k
Data processing library for large-scale text data
Starred by
+9
Created 2 years ago
Updated 19 hours ago
llm-decontaminator
by
lm-sys
0%
311
LLM contamination detector for quantifying rephrased samples
Starred by
Created 2 years ago
Updated 1 year ago
sglang
by
sgl-project
1.2%
19k
Fast serving framework for LLMs and vision language models
Starred by
+32
Created 1 year ago
Updated 1 hour ago
bagel
by
jondurbin
0%
324
Fine-tuning pipeline for language models, "with everything."
Starred by
Created 1 year ago
Updated 1 year ago
llama-moe
by
pjlab-sys4nlp
0.3%
993
MoE model from LLaMA with continual pre-training
Starred by
Created 2 years ago
Updated 10 months ago
rank_llm
by
castorini
0.4%
543
Python toolkit for reproducible information retrieval research
Starred by
Created 2 years ago
Updated 1 week ago
UltraEval
by
OpenBMB
0.4%
250
An open-source framework for evaluating foundation models
Created 1 year ago
Updated 11 months ago
InfiniteBench
by
OpenBMB
0.8%
351
Benchmark for evaluating language models on super-long contexts (100k+ tokens)
Starred by
Created 1 year ago
Updated 1 year ago
airoboros
by
jondurbin
0.1%
1k
Self-instruct tool for LLM finetuning
Starred by
+3
Created 2 years ago
Updated 1 year ago
magicoder
by
ise-uiuc
0.1%
2k
Code generation model family for instruction following
Starred by
+2
Created 1 year ago
Updated 11 months ago
gpt-fast
by
meta-pytorch
0.2%
6k
PyTorch text generation for efficient transformer inference
Starred by
+20
Created 2 years ago
Updated 1 month ago
prompt-lookup-decoding
by
apoorvumang
0%
572
Decoding method for faster LLM generation
Starred by
Created 1 year ago
Updated 1 year ago
LongChat
by
DachengLi1
0%
532
Long-context LLM chatbot training and evaluation framework
Starred by
+2
Created 2 years ago
Updated 1 year ago
LLMTest_NeedleInAHaystack
by
gkamradt
0.4%
2k
LLM testing tool for evaluating in-context retrieval accuracy
Starred by
+3
Created 1 year ago
Updated 1 year ago
UltraFeedback
by
OpenBMB
0%
353
Preference dataset for training reward/critique models
Starred by
Created 2 years ago
Updated 1 year ago
LLM-Shearing
by
princeton-nlp
0.2%
631
Code for LLM pre-training acceleration via structured pruning (ICLR 2024)
Starred by
+1
Created 2 years ago
Updated 1 year ago
giskard-oss
by
Giskard-AI
0.4%
5k
Open-source testing framework for AI & LLM systems
Starred by
+3
Created 3 years ago
Updated 5 days ago
flashinfer
by
flashinfer-ai
1.3%
4k
Kernel library for LLM serving
Starred by
+10
Created 2 years ago
Updated 5 hours ago
DeepSpeed-MII
by
deepspeedai
0.1%
2k
Python library for high-throughput, low-latency, and cost-effective model inference
Starred by
+5
Created 3 years ago
Updated 3 months ago
sec-insights
by
run-llama
0.1%
3k
Full-stack RAG app for SEC filings Q&A
Starred by
Created 2 years ago
Updated 7 months ago
synthesizer
by
SciPhi-AI
0%
626
LLM framework for RAG and data creation
Starred by
+3
Created 2 years ago
Updated 1 year ago
llama_index
by
run-llama
0.3%
45k
Data framework for building LLM-powered agents
Starred by
+44
Created 2 years ago
Updated 11 hours ago
self-rag
by
AkariAsai
0.2%
2k
Self-RAG implementation for learning retrieval, generation, and critique via self-reflection
Starred by
+1
Created 2 years ago
Updated 1 year ago
fine-tune-mistral
by
abacaj
0%
714
Fine-tuning script for Mistral-7B
Starred by
Created 2 years ago
Updated 2 years ago
chatgpt-ui
by
WongSaang
0.1%
2k
Web client for ChatGPT with multi-user and i18n support
Created 2 years ago
Updated 1 year ago
ray-llm
by
ray-project
0%
1k
LLM deployment framework on Ray (now upstreamed to Ray)
Starred by
+2
Created 2 years ago
Updated 7 months ago
TinyChatEngine
by
mit-han-lab
0.3%
900
On-device LLM/VLM inference library for edge deployment
Created 2 years ago
Updated 1 year ago
axolotl
by
axolotl-ai-cloud
0.5%
11k
CLI tool for streamlined post-training of AI models
Starred by
+25
Created 2 years ago
Updated 13 hours ago
LLaMA-Factory
by
hiyouga
0.7%
60k
Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)
Starred by
+23
Created 2 years ago
Updated 2 hours ago
TinyLlama
by
jzhang38
0.1%
9k
Tiny pretraining project for a 1.1B Llama model
Starred by
+18
Created 2 years ago
Updated 1 year ago
yarn
by
jquesnelle
0.4%
2k
Context window extension method for LLMs (research paper, models)
Starred by
+4
Created 2 years ago
Updated 1 year ago
chat-ui
by
huggingface
0.1%
9k
Chat UI: open-source interface for LLMs
Starred by
+8
Created 2 years ago
Updated 51 minutes ago
mlc-llm
by
mlc-ai
0.2%
21k
Universal LLM deployment engine with ML compilation
Starred by
+21
Created 2 years ago
Updated 2 days ago
LightLLM
by
ModelTC
0.5%
4k
Python framework for LLM inference and serving
Starred by
+6
Created 2 years ago
Updated 1 hour ago
lmdeploy
by
InternLM
0.4%
7k
Toolkit for LLM compression, deployment, and serving
Starred by
+8
Created 2 years ago
Updated 1 day ago
CTranslate2
by
OpenNMT
0.2%
4k
Fast inference engine for Transformer models
Starred by
+6
Created 6 years ago
Updated 6 months ago
llm-awq
by
mit-han-lab
0.5%
3k
Weight quantization research paper for LLM compression/acceleration
Starred by
+4
Created 2 years ago
Updated 2 months ago
skypilot
by
skypilot-org
0.4%
9k
Framework for cloud AI/batch jobs, unifying execution across diverse infrastructure
Starred by
+24
Created 4 years ago
Updated 1 hour ago
bitsandbytes
by
bitsandbytes-foundation
0.3%
8k
PyTorch library for k-bit quantization, enabling accessible LLMs
Starred by
+26
Created 4 years ago
Updated 1 week ago
transformer-deploy
by
ELS-RD
0%
2k
CLI tool for optimized Hugging Face Transformer deployment
Starred by
+6
Created 4 years ago
Updated 11 months ago
kernl
by
ELS-RD
0.1%
2k
PyTorch transformer inference engine for GPU speedup
Starred by
+3
Created 3 years ago
Updated 1 year ago
vllm
by
vllm-project
0.8%
60k
LLM serving engine for high-throughput, memory-efficient inference
Starred by
+57
Created 2 years ago
Updated 52 minutes ago
gpt-engineer
by
AntonOsika
0.0%
55k
CLI platform for code generation experimentation
Starred by
+17
Created 2 years ago
Updated 5 months ago
SqueezeLLM
by
SqueezeAILab
0.1%
704
Quantization framework for efficient LLM serving (ICML 2024 paper)
Starred by
Created 2 years ago
Updated 1 year ago
rebuff
by
protectai
0%
1k
SDK for LLM prompt injection detection
Starred by
Created 2 years ago
Updated 1 year ago
exllama
by
turboderp
0%
3k
Llama implementation for memory-efficient quantized weights
Starred by
+6
Created 2 years ago
Updated 2 years ago
llama-cpp-python
by
abetlen
0.1%
10k
Python bindings for llama.cpp, enabling local LLM inference
Starred by
+11
Created 2 years ago
Updated 2 months ago
ctransformers
by
marella
0%
2k
Python bindings for fast Transformer model inference
Starred by
+8
Created 2 years ago
Updated 1 year ago
faiss
by
facebookresearch
0.3%
38k
Similarity search library for dense vectors
Starred by
+52
Created 8 years ago
Updated 8 hours ago
netron
by
lutzroeder
0.2%
32k
Model visualizer for neural networks, deep learning, and ML
Starred by
+23
Created 15 years ago
Updated 17 hours ago
hnswlib
by
nmslib
0.1%
5k
Header-only C++ library for fast approximate nearest neighbors
Starred by
+13
Created 8 years ago
Updated 1 month ago
deepsparse
by
neuralmagic
0.0%
3k
CPU inference runtime for sparse deep learning models
Starred by
Created 4 years ago
Updated 4 months ago
qlora
by
artidoro
0.1%
11k
Finetuning tool for quantized LLMs
Starred by
+19
Created 2 years ago
Updated 1 year ago
ChatRWKV
by
BlinkDL
0.0%
10k
Open-source chatbot powered by the RWKV RNN language model
Starred by
+4
Created 2 years ago
Updated 2 weeks ago
rwkv.cpp
by
RWKV
0.1%
2k
CPU inference lib for RWKV language model
Starred by
Created 2 years ago
Updated 6 months ago
RWKV-LM
by
BlinkDL
0.2%
14k
RNN for LLM, transformer-level performance, parallelizable training
Starred by
+29
Created 4 years ago
Updated 1 hour ago
scikit-llm
by
BeastByteAI
0%
3k
SDK for integrating LLMs into scikit-learn pipelines
Starred by
+1
Created 2 years ago
Updated 2 weeks ago
h2ogpt
by
h2oai
0.1%
12k
Private chat with local GPT with document, images, video, etc
Starred by
+3
Created 2 years ago
Updated 5 days ago
ggml
by
ggml-org
0.3%
13k
Tensor library for machine learning
Starred by
+16
Created 3 years ago
Updated 3 days ago
DB-GPT
by
eosphoros-ai
0.3%
17k
AI-native data app development framework with agentic workflow
Starred by
Created 2 years ago
Updated 5 hours ago
Plan-and-Solve-Prompting
by
AGI-Edgerunners
0%
692
Research paper code for improved zero-shot chain-of-thought reasoning
Created 2 years ago
Updated 2 years ago
AutoGPTQ
by
AutoGPTQ
0.2%
5k
LLM quantization package using GPTQ algorithm
Starred by
+12
Created 2 years ago
Updated 6 months ago
llm-foundry
by
mosaicml
0.1%
4k
LLM training code for Databricks foundation models
Starred by
+14
Created 2 years ago
Updated 2 days ago
private-gpt
by
zylon-ai
0.1%
57k
Private AI API for local document interaction using LLMs
Starred by
+13
Created 2 years ago
Updated 11 months ago
StableLM
by
Stability-AI
0.0%
16k
Language models by Stability AI
Starred by
+25
Created 2 years ago
Updated 1 year ago
ai-pdf-chatbot-langchain
by
mayooear
0.2%
16k
AI chatbot agent for PDF document Q&A using LangChain & LangGraph
Starred by
+3
Created 2 years ago
Updated 7 months ago
RedPajama-Data
by
togethercomputer
0.0%
5k
Dataset pipeline for training large language models
Starred by
+8
Created 2 years ago
Updated 10 months ago
OpenChatKit
by
togethercomputer
0.0%
9k
Open-source toolkit for building specialized/general-purpose chat models
Starred by
+13
Created 2 years ago
Updated 1 year ago
FastChat
by
lm-sys
0.1%
39k
Open platform for training, serving, and evaluating LLM-based chatbots
Starred by
+36
Created 2 years ago
Updated 4 months ago
chatbot-ui
by
mckaywrigley
0.1%
32k
Open-source AI chat app
Starred by
+14
Created 2 years ago
Updated 1 year ago
text-generation-webui
by
oobabooga
0.1%
45k
Web UI for LLM text generation
Starred by
+24
Created 2 years ago
Updated 5 hours ago
gpt4all
by
nomic-ai
0.1%
77k
Desktop app for local LLM inference, no GPU/API needed
Starred by
+29
Created 2 years ago
Updated 4 months ago
pathos
by
uqfoundation
0%
1k
Framework for parallel graph management and execution in heterogeneous computing
Starred by
Created 12 years ago
Updated 2 days ago
alpha-rptr
by
TheFourGreatErrors
0.7%
597
Trading bot for automated algorithmic trading
Starred by
Created 5 years ago
Updated 7 months ago
dalle-flow
by
jina-ai
0.1%
3k
Text-to-image generation with human-in-the-loop refinement
Starred by
Created 3 years ago
Updated 2 years ago
dalle-mini
by
borisdayma
0.1%
15k
Text-to-image model for generating images from text prompts
Starred by
+15
Created 4 years ago
Updated 1 year ago
AI-Chip
by
basicmi
0.1%
2k
AI chip resource list
Starred by
Created 8 years ago
Updated 1 year ago
interpret
by
interpretml
0.0%
7k
ML interpretability Python package for glassbox models and blackbox explanations
Starred by
+2
Created 6 years ago
Updated 3 days ago
jetson-containers
by
dusty-nv
0.4%
4k
Container build system for NVIDIA Jetson AI/ML development
Starred by
Created 5 years ago
Updated 6 hours ago
applied-ml
by
eugeneyan
0.1%
28k
ML resource collection: papers/blogs sharing data science & ML production work
Starred by
+8
Created 5 years ago
Updated 1 year ago
streamlit
by
streamlit
0.2%
42k
SDK for rapidly building interactive data apps
Starred by
+20
Created 6 years ago
Updated 2 hours ago
TextAttack
by
QData
0.1%
3k
Python framework for NLP adversarial attacks, data augmentation, and model training
Starred by
+4
Created 6 years ago
Updated 3 months ago
caliban
by
google
0%
500
CLI tool for reproducible research workflows, locally or in the cloud
Starred by
+1
Created 5 years ago
Updated 1 year ago
yolov5
by
ultralytics
0.2%
56k
YOLOv5 in PyTorch for object detection, segmentation, and classification
Starred by
+5
Created 5 years ago
Updated 5 days ago
ParallelWaveGAN
by
kan-bayashi
0.2%
2k
Pytorch vocoder for real-time speech synthesis, based on Parallel WaveGAN
Starred by
Created 6 years ago
Updated 1 year ago
ALAE
by
podgorskiy
0.0%
4k
Adversarial latent autoencoder for combining generative/representational properties
Starred by
+1
Created 6 years ago
Updated 4 years ago
pytorch-lightning
by
Lightning-AI
0.1%
30k
Deep learning framework for pretraining, finetuning, and deploying AI models
Starred by
+31
Created 6 years ago
Updated 5 hours ago
melgan
by
seungwonpark
0%
649
PyTorch implementation of MelGAN vocoder
Created 6 years ago
Updated 5 years ago
awesome-production-machine-learning
by
EthicalML
0.3%
19k
Curated list of open-source libraries for production ML
Starred by
+14
Created 7 years ago
Updated 1 week ago
transformers
by
huggingface
0.3%
151k
ML library for pretrained model inference and training
Starred by
+96
Created 7 years ago
Updated 1 hour ago
GPT2
by
ConnorJL
0%
1k
GPT2 training implementation, supporting TPUs and GPUs
Starred by
Created 6 years ago
Updated 2 years ago
Learn-Natural-Language-Processing-Curriculum
by
llSourcell
0%
1k
NLP curriculum for video course
Created 6 years ago
Updated 5 years ago
Feedback? Help us improve.