timber by kossisoroyce

Compiles classical ML models into fast, native C inference code

Created 3 months ago

683 stars

Top 49.2% on SourcePulse

1 Expert Loves This Project

jmorganca

Cofounder of Ollama

Project Summary

Timber addresses the need for high-performance, portable inference for classical machine learning models by compiling them into optimized native C99 code. It targets teams in fraud detection, edge computing, and regulated industries requiring fast, predictable, and auditable model deployments. The primary benefit is a significant reduction in inference latency and runtime overhead compared to traditional Python-based serving.

How It Works

Timber employs an Ahead-of-Time (AOT) compilation strategy, transforming trained models from frameworks like XGBoost, LightGBM, scikit-learn, CatBoost, and ONNX (specifically TreeEnsemble operators) into standalone C99 inference code. This compiled code is then served via a local HTTP API, following an Ollama-style workflow for loading and querying models. This approach eliminates the Python runtime from the critical inference path, enabling microsecond-level latency.

Quick Start & Requirements

Primary install: pip install timber-compiler
Load model: timber load <model_file> --name <model_name>
Serve model: timber serve <model_name>
Supported formats include XGBoost JSON, LightGBM text/model, scikit-learn pickle, ONNX TreeEnsemble, and CatBoost JSON.
Docs: https://kossisoroyce.github.io/timber/

Highlighted Details

Achieves up to 336x faster inference compared to Python XGBoost for single-sample predictions.
Native inference latency is approximately 2 µs.
Generates small artifacts, around 48 KB for an example model.
Ideal for edge/embedded deployments and environments demanding deterministic, auditable artifacts.

Maintenance & Community

Includes contributing guidelines, a code of conduct, and a security policy.
Development setup via pip install -e ".[dev]".
The project seeks community support and donations via https://buymeacoffee.com/electricsheepafrica.

Licensing & Compatibility

Licensed under Apache-2.0, which is permissive for commercial use and integration into closed-source projects.

Limitations & Caveats

ONNX support is currently limited to TreeEnsemble operators.
CatBoost support requires JSON exports, not native binary formats.
scikit-learn parsing may encounter issues with uncommon or custom estimator wrappers; pickle loading requires trusted artifacts.
XGBoost primarily accepts JSON model inputs.
Optional benchmark backends require separate installation.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

6 stars in the last 30 days

Explore Similar Projects

MoE-Infinity by EfficientMoE

Cost-effective, fast MoE model inference library

Created 2 years ago

Updated 1 month ago

Starred by

Meng Zhang

Meng Zhang(Cofounder of TabbyML) and

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

cformers by NolanoOrg

CPU inference for transformer models via a C++ backend

Created 3 years ago

Updated 2 years ago

Starred by

Lysandre Debut

Lysandre Debut(Chief Open-Source Officer at Hugging Face).

xinfer by guoqingbao

Pure Rust LLM inference engine

Created 11 months ago

Updated 1 day ago

FlashRT by LiangSu8899

High-performance realtime inference engine for AI workloads

Created 1 month ago

Updated 1 day ago

Starred by

Ying Sheng

Ying Sheng(Coauthor of SGLang).

ScaleLLM by vectorch-ai

LLM inference system for production environments

Created 2 years ago

Updated 5 months ago

usls by jamjamjon

Efficient Rust inference for vision and vision-language models

Created 2 years ago

Updated 5 days ago

InferLLM by MegEngine

Lightweight LLM inference framework

Created 3 years ago

Updated 2 years ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

speculators by vllm-project

Accelerating LLM inference with speculative decoding

Created 1 year ago

Updated 23 hours ago

esp-dl by espressif

Lightweight NN inference framework for ESP series chips in AIoT

Created 7 years ago

Updated 2 days ago

nndeploy by nndeploy

Multi-platform inference deployment framework

Created 2 years ago

Updated 1 month ago

Starred by

Michael Han

Michael Han(Cofounder of Unsloth),

Meng Zhang

Meng Zhang(Cofounder of TabbyML), and

11 more.

lmdeploy by InternLM

Toolkit for LLM compression, deployment, and serving

Created 3 years ago

Updated 1 day ago

Starred by

Peter Salanki

Peter Salanki(Cofounder of CoreWeave),

Travis Fischer

Travis Fischer(Founder of Agentic), and

8 more.

serving by tensorflow

Serving system for machine learning models in production

Created 10 years ago

Updated 22 hours ago

Feedback? Help us improve.