openvino  by openvinotoolkit

Open source toolkit for optimizing and deploying AI inference

created 6 years ago
8,657 stars

Top 6.0% on sourcepulse

GitHubView on GitHub
Project Summary

OpenVINO™ is an open-source toolkit designed to optimize and deploy deep learning models across various hardware, including CPUs, GPUs, and NPUs. It targets developers and researchers seeking to boost inference performance for computer vision, NLP, and generative AI tasks, offering broad framework compatibility and flexible deployment options from edge to cloud.

How It Works

OpenVINO employs a two-stage process: model conversion and inference optimization. It converts models trained in frameworks like PyTorch, TensorFlow, and ONNX into an intermediate representation (IR). This IR is then optimized for specific hardware targets using techniques like quantization and layer fusion, enabling efficient execution on Intel hardware and beyond.

Quick Start & Requirements

Highlighted Details

  • Supports models from PyTorch, TensorFlow, ONNX, Keras, PaddlePaddle, and JAX/Flax.
  • Integrates with Hugging Face Optimum, Torch.compile, vLLM, ONNX Runtime, LlamaIndex, LangChain, and Keras 3.
  • Offers a dedicated GenAI API and repository for generative AI applications.
  • Includes tools like NNCF for advanced optimization and OVMS for model serving.

Maintenance & Community

  • Active community with contributions welcomed via GitHub Issues.
  • Support available on the Intel DevHub Discord server.
  • Resources include a blog, cheat sheet, and performance benchmarks.

Licensing & Compatibility

  • Licensed under Apache License Version 2.0.
  • Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

While supporting a wide range of hardware, optimal performance is typically achieved on Intel architectures. The toolkit collects telemetry data by default, which can be opted out of.

Health Check
Last commit

23 hours ago

Responsiveness

1 day

Pull Requests (30d)
444
Issues (30d)
48
Star History
467 stars in the last 90 days

Explore Similar Projects

Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers), Michael Han Michael Han(Cofounder of Unsloth), and
1 more.

ktransformers by kvcache-ai

0.4%
15k
Framework for LLM inference optimization experimentation
created 1 year ago
updated 2 days ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
5 more.

TensorRT-LLM by NVIDIA

0.6%
11k
LLM inference optimization SDK for NVIDIA GPUs
created 1 year ago
updated 13 hours ago
Feedback? Help us improve.