ort by pytorch

PyTorch extension for ONNX Runtime model acceleration

Created 4 years ago

368 stars

Top 76.7% on SourcePulse

1 Expert Loves This Project

luiscape

Cofounder of Lightning AI

Project Summary

This library accelerates PyTorch model training and inference using ONNX Runtime and Intel® OpenVINO™. It targets PyTorch developers seeking to reduce training time, scale large models, and optimize inference performance, particularly on Intel hardware.

How It Works

The library provides ORTModule for training, which converts PyTorch models to ONNX format and leverages ONNX Runtime for accelerated execution. It also includes optimized optimizers like FusedAdam and FP16_Optimizer, and a LoadBalancingDistributedSampler for efficient data loading in distributed training. For inference, ORTInferenceModule enables ONNX Runtime with the OpenVINO™ Execution Provider, targeting Intel CPUs, GPUs, and VPUs with options for precision (FP32/FP16) and backend.

Quick Start & Requirements

Training: pip install torch-ort followed by python -m torch_ort.configure. Requires NVIDIA or AMD GPU.
Inference: pip install torch-ort-infer[openvino]. Requires Ubuntu 18.04/20.04, Python 3.7-3.9.
Docs: https://www.onnxruntime.ai/
Examples: microsoft/onnxruntime-training-examples, demos

Highlighted Details

Reduces PyTorch training time and GPU cost for large transformer models.
Supports FusedAdam, FP16_Optimizer, and LoadBalancingDistributedSampler for training efficiency.
Inference acceleration via OpenVINO™ Execution Provider on Intel hardware (CPU, GPU, VPU).
Offers Mixture of Experts (MoE) layer implementation, usable standalone or with ONNX Runtime.

Maintenance & Community

Actively maintained by PyTorch and ONNX Runtime teams.
CI checks for API stability.
Contribution guide available.

Licensing & Compatibility

MIT License.
Compatible with commercial and closed-source applications.

Limitations & Caveats

Training acceleration currently requires NVIDIA or AMD GPUs. Inference package has specific OS and Python version requirements.

Health Check

Last Commit

3 weeks ago

Responsiveness

1 week

Pull Requests (30d)

5

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

1 more.

varuna by microsoft

Tool for efficient large DNN model training on commodity hardware

Created 4 years ago

Updated 1 year ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

2 more.

tensor_parallel by BlackSamorez

PyTorch module for multi-GPU model parallelism

Created 3 years ago

Updated 2 years ago

RyzenAI-SW by amd

AI inference acceleration on AMD Ryzen™ AI PCs

Created 2 years ago

Updated 3 weeks ago

Starred by

Luca Antiga

Luca Antiga(CTO of Lightning AI),

William Falcon

William Falcon(Founder of Lightning AI), and

4 more.

lightning-thunder by Lightning-AI

PyTorch compiler for model optimization via source-to-source transformation

Created 1 year ago

Updated 2 days ago

bolt by huawei-noah

Deep learning library for high-performance, heterogeneous deployment

Created 6 years ago

Updated 9 months ago

Starred by

Chaoyu Yang

Chaoyu Yang(Founder of Bento),

Georgios Konstantopoulos

Georgios Konstantopoulos(CTO, General Partner at Paradigm), and

3 more.

ort by pykeio

Fast ML inference and training for ONNX models in Rust

Created 3 years ago

Updated 1 day ago

Starred by

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral).

gen-efficientnet-pytorch by rwightman

PyTorch image models for efficient architectures

Created 6 years ago

Updated 1 year ago

Starred by

Tim J. Baek

Tim J. Baek(Founder of Open WebUI).

onnxruntime-genai by microsoft

GenAI extension for running LLMs with ONNX Runtime

Created 2 years ago

Updated 2 days ago

Starred by

Luca Antiga

Luca Antiga(CTO of Lightning AI),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

2 more.

TensorRT by pytorch

PyTorch compiler for NVIDIA GPUs using TensorRT

Created 5 years ago

Updated 1 day ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face),

Piotr Dąbkowski

Piotr Dąbkowski(Cofounder of ElevenLabs), and

13 more.

optimum by huggingface

Hardware optimization tools for Transformers, Diffusers, etc

Created 4 years ago

Updated 3 weeks ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

20 more.

accelerate by huggingface

PyTorch training helper for distributed execution

Created 5 years ago

Updated 2 days ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory) and

Morgan Funtowicz

Morgan Funtowicz(Head of ML Optimizations at Hugging Face).

openvino by openvinotoolkit

Open source toolkit for optimizing and deploying AI inference

Created 7 years ago

Updated 2 days ago

Feedback? Help us improve.