BentoML  by bentoml

Framework for serving AI apps and models

Created 6 years ago
8,084 stars

Top 6.4% on SourcePulse

GitHubView on GitHub
Project Summary

BentoML is a Python framework for building and serving AI applications, designed to simplify the creation of REST APIs for any machine learning model. It targets AI/ML engineers and developers who need to deploy models efficiently, offering features like automatic Docker containerization, dependency management, and optimized inference serving.

How It Works

BentoML abstracts the complexities of model serving by allowing users to define inference logic within Python classes and functions, decorated with @bentoml.service and @bentoml.api. It automatically handles dependency packaging, environment replication, and API server generation. Key optimizations include dynamic batching, model parallelism, and multi-model orchestration, aiming to maximize hardware utilization for high-performance inference.

Quick Start & Requirements

Highlighted Details

  • Supports a wide range of AI frameworks, modalities, and runtimes.
  • Offers advanced features like multi-stage pipelines, multi-model inference graphs, and adaptive batching.
  • Provides seamless local development and debugging with production-ready deployment options.
  • Includes BentoCloud for simplified cloud deployment and scaling.

Maintenance & Community

  • Active community with a Slack channel for support and discussion.
  • Open to contributions via GitHub Issues and pull requests.
  • Usage tracking is enabled by default but can be opted out.

Licensing & Compatibility

  • License: Apache License 2.0.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The framework collects anonymous usage data by default, which users can opt out of. While it supports many frameworks, specific model or runtime integrations might require custom configurations or additional dependencies.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
35
Issues (30d)
6
Star History
105 stars in the last 30 days

Explore Similar Projects

Starred by Amanpreet Singh Amanpreet Singh(Cofounder of Contextual AI), Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), and
7 more.

truss by basetenlabs

0.2%
1k
Model deployment tool for productionizing AI/ML models
Created 3 years ago
Updated 1 day ago
Starred by Eugene Yan Eugene Yan(AI Scientist at AWS), Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
4 more.

seldon-core by SeldonIO

0.2%
5k
MLOps framework for production model deployment on Kubernetes
Created 7 years ago
Updated 14 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Travis Fischer Travis Fischer(Founder of Agentic), and
2 more.

modelscope by modelscope

0.2%
8k
Model-as-a-Service library for model inference, training, and evaluation
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.