BentoML  by bentoml

Framework for serving AI apps and models

created 6 years ago
7,933 stars

Top 6.7% on sourcepulse

GitHubView on GitHub
Project Summary

BentoML is a Python framework for building and serving AI applications, designed to simplify the creation of REST APIs for any machine learning model. It targets AI/ML engineers and developers who need to deploy models efficiently, offering features like automatic Docker containerization, dependency management, and optimized inference serving.

How It Works

BentoML abstracts the complexities of model serving by allowing users to define inference logic within Python classes and functions, decorated with @bentoml.service and @bentoml.api. It automatically handles dependency packaging, environment replication, and API server generation. Key optimizations include dynamic batching, model parallelism, and multi-model orchestration, aiming to maximize hardware utilization for high-performance inference.

Quick Start & Requirements

Highlighted Details

  • Supports a wide range of AI frameworks, modalities, and runtimes.
  • Offers advanced features like multi-stage pipelines, multi-model inference graphs, and adaptive batching.
  • Provides seamless local development and debugging with production-ready deployment options.
  • Includes BentoCloud for simplified cloud deployment and scaling.

Maintenance & Community

  • Active community with a Slack channel for support and discussion.
  • Open to contributions via GitHub Issues and pull requests.
  • Usage tracking is enabled by default but can be opted out.

Licensing & Compatibility

  • License: Apache License 2.0.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The framework collects anonymous usage data by default, which users can opt out of. While it supports many frameworks, specific model or runtime integrations might require custom configurations or additional dependencies.

Health Check
Last commit

3 days ago

Responsiveness

1 day

Pull Requests (30d)
16
Issues (30d)
5
Star History
292 stars in the last 90 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
8 more.

higgsfield by higgsfield-ai

0.3%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Alexey Milovidov Alexey Milovidov(Cofounder of Clickhouse), and
7 more.

OpenLLM by bentoml

0.2%
12k
SDK for running open-source LLMs as OpenAI-compatible APIs
created 2 years ago
updated 3 days ago
Feedback? Help us improve.