MLServer  by SeldonIO

Inference server for machine learning models

created 5 years ago
831 stars

Top 43.7% on sourcepulse

GitHubView on GitHub
Project Summary

MLServer is an open-source inference server designed for deploying machine learning models via REST and gRPC interfaces. It targets ML engineers and data scientists seeking a standardized, scalable, and framework-agnostic solution for model deployment, offering compatibility with the KFServing V2 Dataplane spec.

How It Works

MLServer utilizes an "inference runtime" system, acting as a bridge between the server and various ML frameworks. This modular design allows for easy integration of new frameworks and custom model serving logic. It supports multi-model serving within a single process, parallel inference across worker pools, and adaptive batching for request optimization, all while adhering to the V2 Inference Protocol for broad compatibility.

Quick Start & Requirements

Highlighted Details

  • Supports Scikit-Learn, XGBoost, Spark MLlib, LightGBM, CatBoost, Tempo, MLflow, Alibi-Detect, Alibi-Explain, and HuggingFace.
  • Compliant with KFServing V2 Dataplane spec for REST and gRPC.
  • Features multi-model serving, adaptive batching, and parallel inference workers.
  • Integrates with Kubernetes-native frameworks like Seldon Core and KServe.

Maintenance & Community

  • Developed by SeldonIO.
  • Versioning managed via ./hack/update-version.sh.
  • Testing commands: make test, tox -e py3 -- <test_file>.

Licensing & Compatibility

  • Licensed under Apache License 2.0.
  • Note: Associated libraries (e.g., Alibi Detect/Explain) may have different licenses (e.g., Business Source License 1.1).

Limitations & Caveats

Python versions 3.7, 3.13, and earlier are unsupported. The README notes that associated software may have different licensing terms, requiring careful review for commercial use.

Health Check
Last commit

5 days ago

Responsiveness

1 day

Pull Requests (30d)
26
Issues (30d)
1
Star History
33 stars in the last 90 days

Explore Similar Projects

Starred by Eugene Yan Eugene Yan(AI Scientist at AWS), Jared Palmer Jared Palmer(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
3 more.

seldon-core by SeldonIO

0.1%
5k
MLOps framework for production model deployment on Kubernetes
created 7 years ago
updated 23 hours ago
Feedback? Help us improve.