simple_tensorflow_serving  by tobegit3hub

Serving service for machine learning models

Created 7 years ago
760 stars

Top 45.8% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a generic, easy-to-use serving service for machine learning models, primarily targeting developers and researchers who need to deploy models via RESTful APIs. It aims to simplify the deployment process, offering broad framework support and client generation capabilities.

How It Works

The service utilizes Flask to start an HTTP server and loads TensorFlow models using tf.saved_model.loader. It constructs feed_dict from incoming JSON requests, executes inference via sess.run(), and supports multiple model versions through independent threads. Client code generation is handled by reading user models and rendering templates with Jinja.

Quick Start & Requirements

  • Install: pip install simple_tensorflow_serving
  • Prerequisites: Python, TensorFlow. GPU support requires CUDA.
  • Docker: Available for CPU, GPU, HDFS, and Python 3.4.
  • Docs: https://stfs.readthedocs.io/

Highlighted Details

  • Supports TensorFlow, MXNet, PyTorch, ONNX, Scikit-learn, XGBoost, PMML, and more.
  • Offers GPU acceleration and dynamic loading of custom TensorFlow ops.
  • Generates clients in various languages (Python, Go, JavaScript, etc.) from models.
  • Provides basic authentication and TLS/SSL support.
  • Handles image models via raw file uploads.

Maintenance & Community

The project is hosted on GitHub and welcomes contributions via issues and pull requests.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

While claiming similar performance to TensorFlow Serving, benchmarks indicate TensorFlow Serving performs better, especially with GPUs. The project does not specify its maintenance status or active development.

Health Check
Last Commit

6 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
11 more.

petals by bigscience-workshop

0.1%
10k
Run LLMs at home, BitTorrent-style
Created 3 years ago
Updated 1 year ago
Feedback? Help us improve.