scikit-llm by BeastByteAI

SDK for integrating LLMs into scikit-learn pipelines

Created 2 years ago

3,490 stars

Top 13.8% on SourcePulse

View on GitHub

6 Experts Love This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Rodrigo Nader

Cofounder of Langflow

Shyamal Anadkat

Research Scientist at OpenAI

Gabriel Almeida

Cofounder of Langflow

and 2 more!

Project Summary

Scikit-LLM enables the integration of large language models (LLMs) into the scikit-learn ecosystem, targeting data scientists and ML engineers who want to leverage LLMs for text analysis within a familiar framework. It simplifies using LLMs for tasks like classification, offering a scikit-learn-compatible API.

How It Works

The library provides scikit-learn-compatible estimators that wrap various LLMs, abstracting away the complexities of API calls and prompt engineering. It allows users to treat LLMs as interchangeable components within scikit-learn pipelines, facilitating experimentation and deployment.

Quick Start & Requirements

Primary install: pip install scikit-llm
Prerequisites: OpenAI API key and organization ID.
Documentation: https://github.com/BeastByteAI/scikit-llm

Highlighted Details

Zero-shot text classification example provided using GPT-4.
Supports integration with scikit-learn pipelines.
Offers a consistent API for different LLMs.

Maintenance & Community

Project authors: Iryna Kondrashchenko and Oleh Kostromin.
Community engagement encouraged via GitHub issues and Discord.
Related projects: Dingo, Falcon.

Licensing & Compatibility

License: Not explicitly stated in the README.
Compatibility: Designed for use with scikit-learn, implying Python compatibility.

Limitations & Caveats

The library currently focuses on OpenAI models and requires users to manage their own API keys and costs. The README does not specify supported LLM providers beyond OpenAI or detail performance benchmarks.

Health Check

Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

3 stars in the last 30 days