truss by basetenlabs

Model deployment tool for productionizing AI/ML models

Created 3 years ago

1,104 stars

Top 34.6% on SourcePulse

View on GitHub

9 Experts Love This Project

Amanpreet Singh

Cofounder of Contextual AI

Yineng Zhang

Inference Lead at SGLang; Research Scientist at Together AI

Gabriel Almeida

Cofounder of Langflow

Jeff Hammerbacher

Cofounder of Cloudera

and 5 more!

Project Summary

Truss simplifies AI/ML model deployment for developers and ML engineers by providing a unified framework for packaging, testing, and serving models across diverse Python frameworks. It aims to streamline the productionization process, enabling faster iteration and reliable deployment with a consistent experience from development to production.

How It Works

Truss packages model code, weights, and dependencies into a self-contained "Truss" that includes a Python-based model server. This server exposes load() and predict() methods, abstracting away complex infrastructure concerns like Docker and Kubernetes. The config.yaml file specifies dependencies and server configurations, allowing Truss to manage the environment and dependencies automatically.

Quick Start & Requirements

Install Truss: pip install --upgrade truss
Create a Truss: truss init <your-truss-name>
Dependencies: Specified in config.yaml (e.g., torch, transformers).
Deployment: Requires a Baseten API key for deployment to Baseten's platform.
Example: text-classification

Highlighted Details

Supports all Python ML frameworks, including Hugging Face Transformers, PyTorch, TensorFlow, and NVIDIA Triton.
Provides a fast developer loop with live reload for rapid iteration.
Offers pre-built examples for popular models like Llama 2, Stable Diffusion XL, and Whisper.
Enables easy deployment to Baseten, with future support for AWS SageMaker planned.

Maintenance & Community

Truss is maintained by Baseten and actively welcomes community contributions. Further details on contributions and community engagement can be found in their contributor's guide and code of conduct.

Licensing & Compatibility

The project is licensed under the Apache-2.0 license, which permits commercial use and linking with closed-source projects.

Limitations & Caveats

Currently, Baseten is the primary deployment target, with other cloud providers like AWS SageMaker planned for future integration.

Health Check

Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History