tensorzero by tensorzero

LLMOps framework for optimizing LLM applications via production data feedback

Created 1 year ago

10,788 stars

Top 4.7% on SourcePulse

13 Experts Love This Project

hammer

Jeff Hammerbacher

Cofounder of Cloudera

kerollmops

Clément Renault

Cofounder of Meilisearch

chiphuyen

Author of "AI Engineering", "Designing Machine Learning Systems"

ebursztein

Cybersecurity Lead at Google DeepMind

and 9 more!

Project Summary

TensorZero is an open-source framework designed to create a feedback loop for optimizing Large Language Model (LLM) applications. It targets engineers and researchers building production-grade LLM systems, enabling them to leverage production data for smarter, faster, and cheaper models.

How It Works

TensorZero unifies several key components of the LLMOps lifecycle: an LLM gateway for accessing diverse models, an observability layer to capture inference metrics and feedback, an optimization engine for prompts and models (including fine-tuning and RL), and an evaluation framework for comparing different strategies. This integrated approach aims to create a compounding data and learning flywheel, allowing systems to improve over time based on real-world usage. The core gateway is built in Rust for low-latency performance.

Quick Start & Requirements

Install: pip install tensorzero
Prerequisites: ClickHouse database is required for observability. Supports integration with any OpenAI-compatible API.
Setup: Quick Start guide claims a 5-minute setup from a basic OpenAI wrapper to a production-ready application with observability and fine-tuning.
Links: Quick Start, Comprehensive Tutorial, Deployment Guide.

Highlighted Details

Unified gateway supports numerous LLM providers (Anthropic, AWS Bedrock, Azure OpenAI, Gemini, Mistral, vLLM, etc.) and OpenAI-compatible APIs.
Rust-based gateway boasts <1ms P99 latency overhead at 10k QPS.
Features include A/B testing, fallbacks, prompt templating, batch inference, multimodal support, and GitOps configuration.
Optimization capabilities extend to supervised fine-tuning (SFT), preference fine-tuning (DPO), and inference-time optimizations like Best-of-N sampling and Dynamic In-Context Learning (DICL).

Maintenance & Community

Backed by investors of prominent open-source projects and AI labs.
Team includes former Rust compiler maintainer and researchers from top universities.
Community channels available via Slack and Discord.

Licensing & Compatibility

The project is 100% open-source and self-hosted with no paid features. The specific license is not explicitly stated in the README, but the emphasis on self-hosting and no paid features suggests a permissive license suitable for commercial use.

Limitations & Caveats

While the gateway is written in Rust, the Python client and other integrations rely on Python. A ClickHouse database is a mandatory dependency for the observability features.

Health Check

Last Commit

19 hours ago

Responsiveness

Inactive

Pull Requests (30d)

332

Issues (30d)

213

Star History

141 stars in the last 30 days

Explore Similar Projects

Seed-Coder by ByteDance-Seed

Code LLM for code generation, completion, and reasoning tasks

Created 8 months ago

Updated 7 months ago

bce-qianfan-sdk by baidubce

SDK for Baidu's Qianfan LLM platform, enabling AI workflows

Created 2 years ago

Updated 1 month ago

PandaLM by WeOpenML

LLM evaluation benchmark for reproducible, automated assessment

Created 2 years ago

Updated 1 year ago

david-share by david-xinyuwei

Deep learning resource for LLM training, inference, and fine-tuning

Created 5 years ago

Updated 1 day ago

promptomatix by SalesforceAIResearch

LLM prompt optimization framework

Created 6 months ago

Updated 5 months ago

Starred by

Philipp Moritz

Philipp Moritz(Cofounder of Anyscale) and

Woosuk Kwon

Woosuk Kwon(Coauthor of vLLM).

LLMSys-PaperList by AmberLJC

Curated list of LLM systems papers

Created 2 years ago

Updated 4 days ago

Starred by

Chuan Li

Chuan Li(Chief Scientific Officer at Lambda).

prompt-ops by meta-llama

Prompt optimizer for Llama models, migrating from other LLMs

Created 10 months ago

Updated 3 days ago

LLM-VM by anarchy-ai

Open-source AGI server for LLMs

Created 2 years ago

Updated 1 year ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

3 more.

promptbench by microsoft

LLM evaluation framework

Created 2 years ago

Updated 3 months ago

edu by wandb

Educational materials for deep learning

Created 5 years ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Travis Fischer

Travis Fischer(Founder of Agentic), and

4 more.

oumi by oumi-ai

Open-source platform for end-to-end foundation model lifecycle

Created 1 year ago

Updated 1 day ago

Starred by

Gregor Zunic

Gregor Zunic(Cofounder of Browser Use),

Alex Chen

Alex Chen(Cofounder of Nexa AI), and

15 more.

ragas by vibrantlabsai

Toolkit for LLM application evaluation

Created 2 years ago

Updated 1 day ago

Feedback? Help us improve.