llm-leaderboard  by JonathanChavezTamales

LLM benchmark and pricing data repository

Created 1 year ago
309 stars

Top 86.9% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a community-driven platform for comparing and exploring Large Language Models (LLMs), offering detailed benchmark scores, provider pricing, and model specifications. It serves researchers, developers, and users seeking to understand and select LLMs based on performance and cost.

How It Works

The project aggregates data on hundreds of LLMs, including parameters, context window sizes, licensing, capabilities, pricing, and performance metrics. Benchmark results are standardized and sourced with verifiable links, undergoing community review to ensure data quality and accuracy.

Quick Start & Requirements

The primary interface is the interactive dashboard available at llm-stats.com. No specific installation or local setup is required to view the data.

Highlighted Details

  • Comprehensive data on hundreds of LLMs.
  • Includes model parameters, context window sizes, licensing, and capabilities.
  • Features provider pricing and performance metrics (throughput, latency).
  • Standardized benchmark results with verifiable source links.

Maintenance & Community

The project is community-driven, with contributions welcomed via pull requests following contribution guidelines. Discussions and community engagement are facilitated through a Discord server.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README.

Limitations & Caveats

While the project strives for accuracy, there is no guarantee that the data is 100% accurate. Some benchmark scores are marked as "N/A," indicating missing data for specific models and tests.

Health Check
Last Commit

3 days ago

Responsiveness

1 week

Pull Requests (30d)
1
Issues (30d)
7
Star History
14 stars in the last 30 days

Explore Similar Projects

Starred by Nir Gazit Nir Gazit(Cofounder of Traceloop), Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
3 more.

haven by redotvideo

0%
346
LLM fine-tuning and evaluation platform
Created 2 years ago
Updated 1 year ago
Starred by Morgan Funtowicz Morgan Funtowicz(Head of ML Optimizations at Hugging Face), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
7 more.

lighteval by huggingface

2.6%
2k
LLM evaluation toolkit for multiple backends
Created 1 year ago
Updated 1 day ago
Starred by Han Wang Han Wang(Cofounder of Mintlify), John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), and
6 more.

evidently by evidentlyai

0.3%
7k
Open-source framework for ML/LLM observability
Created 4 years ago
Updated 15 hours ago
Feedback? Help us improve.