Discover and explore top open-source AI tools and projects—updated daily.
Andyyyy64Hardware-aware local LLM selection tool
Top 23.3% on SourcePulse
This project addresses the challenge of selecting the optimal local Large Language Model (LLM) for specific hardware, moving beyond simple size-based fitting. It targets engineers, researchers, and power users by providing an automated, evidence-based ranking system that considers real-world benchmarks and hardware capabilities, enabling faster and more informed LLM adoption.
How It Works
whichllm automatically detects a user's hardware (GPU, CPU, RAM) and queries the HuggingFace API for popular LLMs. It ranks models by a composite score derived from real-world benchmarks (LiveBench, Chatbot Arena ELO, Open LLM Leaderboard, etc.), confidence-weighted evidence, and hardware fit. The system accounts for quantization, VRAM usage (weights, KV cache, activations), speed, and recency, providing a more accurate performance prediction than size alone.
Quick Start & Requirements
uvx whichllm (recommended), uv tool install whichllm, brew install andyyyy64/whichllm/whichllm, or pip install whichllm.nvidia-ml-py. AMD/Apple Silicon detected automatically.uv.Highlighted Details
whichllm run) with automatic model download and environment setup.whichllm --gpu) and reverse lookup (whichllm plan).whichllm snippet).Maintenance & Community
Contributions are welcome; refer to CONTRIBUTING.md for guidelines. No specific community channels (e.g., Discord, Slack) or notable contributors/sponsorships are detailed in the provided text.
Licensing & Compatibility
Limitations & Caveats
The ranking score includes markers (~, ?) indicating when direct benchmark data is unavailable, relying instead on inherited or interpolated scores, which may affect accuracy for some models. Apple Silicon and CPU-only modes are restricted to GGUF formats for stability. The system actively rejects fabricated or misleading benchmark claims from model uploaders.
1 day ago
Inactive
ztxz16
AlexsJones