Deep-Learning-in-Production  by ahkarami

Notes and references for deploying deep learning models to production

created 7 years ago
4,357 stars

Top 11.4% on sourcepulse

GitHubView on GitHub
Project Summary

This repository serves as a curated collection of notes, references, and tutorials for deploying deep learning models into production environments. It targets engineers and researchers involved in MLOps, aiming to provide a comprehensive guide to various frameworks, tools, and best practices for model serving, optimization, and deployment across different platforms.

How It Works

The repository organizes resources by deep learning framework (PyTorch, TensorFlow, Keras, MXNet), deployment target (web, mobile, embedded), and supporting technologies (serving frameworks, containerization, MLOps tools). It highlights conversion techniques between frameworks (e.g., ONNX), optimization strategies (quantization, pruning), and infrastructure considerations (Kubernetes, AWS Lambda).

Quick Start & Requirements

This repository is a collection of links and notes, not a runnable project. No installation or execution commands are provided.

Highlighted Details

  • Extensive coverage of PyTorch and TensorFlow deployment patterns, including C++ APIs and serving with Flask/TorchServe.
  • Resources on model conversion and interoperability using ONNX and MMdnn.
  • Detailed sections on mobile/embedded deployment (ncnn, TensorFlow Lite) and containerized/orchestrated deployments (Docker, Kubernetes, Kubeflow, Seldon Core).
  • Focus on performance optimization techniques like quantization, pruning, and hardware acceleration (NVIDIA Triton, TensorRT, OpenVINO).

Maintenance & Community

The repository is maintained by ahkarami. No specific community channels or active development signals are present in the README.

Licensing & Compatibility

The repository itself contains links to various open-source projects, each with its own license. The content is for informational purposes and does not impose a specific license on the user's projects.

Limitations & Caveats

This is a curated list of external resources, not a unified framework or tool. Users must individually evaluate and integrate the linked projects. The content may not reflect the latest advancements or best practices in the rapidly evolving MLOps landscape.

Health Check
Last commit

8 months ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
0
Star History
22 stars in the last 90 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
8 more.

higgsfield by higgsfield-ai

0.3%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Feedback? Help us improve.