truss  by basetenlabs

Model deployment tool for productionizing AI/ML models

Created 3 years ago
1,065 stars

Top 35.5% on SourcePulse

GitHubView on GitHub
Project Summary

Truss simplifies AI/ML model deployment for developers and ML engineers by providing a unified framework for packaging, testing, and serving models across diverse Python frameworks. It aims to streamline the productionization process, enabling faster iteration and reliable deployment with a consistent experience from development to production.

How It Works

Truss packages model code, weights, and dependencies into a self-contained "Truss" that includes a Python-based model server. This server exposes load() and predict() methods, abstracting away complex infrastructure concerns like Docker and Kubernetes. The config.yaml file specifies dependencies and server configurations, allowing Truss to manage the environment and dependencies automatically.

Quick Start & Requirements

  • Install Truss: pip install --upgrade truss
  • Create a Truss: truss init <your-truss-name>
  • Dependencies: Specified in config.yaml (e.g., torch, transformers).
  • Deployment: Requires a Baseten API key for deployment to Baseten's platform.
  • Example: text-classification

Highlighted Details

  • Supports all Python ML frameworks, including Hugging Face Transformers, PyTorch, TensorFlow, and NVIDIA Triton.
  • Provides a fast developer loop with live reload for rapid iteration.
  • Offers pre-built examples for popular models like Llama 2, Stable Diffusion XL, and Whisper.
  • Enables easy deployment to Baseten, with future support for AWS SageMaker planned.

Maintenance & Community

Truss is maintained by Baseten and actively welcomes community contributions. Further details on contributions and community engagement can be found in their contributor's guide and code of conduct.

Licensing & Compatibility

The project is licensed under the Apache-2.0 license, which permits commercial use and linking with closed-source projects.

Limitations & Caveats

Currently, Baseten is the primary deployment target, with other cloud providers like AWS SageMaker planned for future integration.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
59
Issues (30d)
0
Star History
24 stars in the last 30 days

Explore Similar Projects

Starred by Eugene Yan Eugene Yan(AI Scientist at AWS), Jared Palmer Jared Palmer(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
4 more.

seldon-core by SeldonIO

0.2%
5k
MLOps framework for production model deployment on Kubernetes
Created 7 years ago
Updated 13 hours ago
Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
3 more.

serve by pytorch

0.1%
4k
Serve, optimize, and scale PyTorch models in production
Created 6 years ago
Updated 1 month ago
Feedback? Help us improve.