truss  by basetenlabs

Model deployment tool for productionizing AI/ML models

created 3 years ago
1,034 stars

Top 36.9% on sourcepulse

GitHubView on GitHub
Project Summary

Truss simplifies AI/ML model deployment for developers and ML engineers by providing a unified framework for packaging, testing, and serving models across diverse Python frameworks. It aims to streamline the productionization process, enabling faster iteration and reliable deployment with a consistent experience from development to production.

How It Works

Truss packages model code, weights, and dependencies into a self-contained "Truss" that includes a Python-based model server. This server exposes load() and predict() methods, abstracting away complex infrastructure concerns like Docker and Kubernetes. The config.yaml file specifies dependencies and server configurations, allowing Truss to manage the environment and dependencies automatically.

Quick Start & Requirements

  • Install Truss: pip install --upgrade truss
  • Create a Truss: truss init <your-truss-name>
  • Dependencies: Specified in config.yaml (e.g., torch, transformers).
  • Deployment: Requires a Baseten API key for deployment to Baseten's platform.
  • Example: text-classification

Highlighted Details

  • Supports all Python ML frameworks, including Hugging Face Transformers, PyTorch, TensorFlow, and NVIDIA Triton.
  • Provides a fast developer loop with live reload for rapid iteration.
  • Offers pre-built examples for popular models like Llama 2, Stable Diffusion XL, and Whisper.
  • Enables easy deployment to Baseten, with future support for AWS SageMaker planned.

Maintenance & Community

Truss is maintained by Baseten and actively welcomes community contributions. Further details on contributions and community engagement can be found in their contributor's guide and code of conduct.

Licensing & Compatibility

The project is licensed under the Apache-2.0 license, which permits commercial use and linking with closed-source projects.

Limitations & Caveats

Currently, Baseten is the primary deployment target, with other cloud providers like AWS SageMaker planned for future integration.

Health Check
Last commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
70
Issues (30d)
0
Star History
62 stars in the last 90 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
8 more.

higgsfield by higgsfield-ai

0.3%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Starred by Eugene Yan Eugene Yan(AI Scientist at AWS), Jared Palmer Jared Palmer(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), and
3 more.

seldon-core by SeldonIO

0.1%
5k
MLOps framework for production model deployment on Kubernetes
created 7 years ago
updated 1 day ago
Feedback? Help us improve.