pyllms by kagisearch

Python SDK for LLM access and benchmarking

Created 2 years ago

813 stars

Top 43.6% on SourcePulse

View on GitHub

7 Experts Love This Project

Shyamal Anadkat

Research Scientist at OpenAI

Aman Sanger

Cofounder of Cursor

Luca Soldaini

Research Scientist at Ai2

Jeff Hammerbacher

Cofounder of Cloudera

and 3 more!

Project Summary

PyLLMs is a Python library designed for seamless integration with a wide array of Large Language Models (LLMs), offering a unified interface for developers and researchers. It simplifies connecting to services like OpenAI, Anthropic, Google, and Hugging Face, while also providing a built-in benchmarking system to evaluate model performance across quality, speed, and cost.

How It Works

The library abstracts the complexities of interacting with different LLM providers through a consistent API. It handles request formatting, authentication, and response parsing, standardizing output to include crucial metadata like token counts, cost, and latency. This approach allows users to switch between models or query multiple models concurrently with minimal code changes, facilitating efficient A/B testing and performance analysis.

Quick Start & Requirements

Install via pip: pip install pyllms
Requires API keys for most providers, configurable via environment variables or directly in llms.init().
Official documentation and examples are available in the README.

Highlighted Details

Supports over 20 LLM providers, including OpenAI, Anthropic, Google (Vertex AI), Ollama (local), Groq, and Together.
Features built-in benchmarking to compare model quality, speed, and cost.
Offers asynchronous and streaming capabilities for compatible models.
Provides utilities for token counting and managing chat history/system messages.

Maintenance & Community

The project appears to be actively maintained by the kagisearch organization. Further community engagement details (e.g., Discord, Slack) are not explicitly mentioned in the README.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The list of supported models may not be exhaustive or perfectly up-to-date, requiring users to verify compatibility for specific model versions. The benchmarking feature's effectiveness relies on the chosen evaluator model.

Health Check

Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

6 stars in the last 30 days