mlops-for-devops by techiescamp

Hands-on MLOps guide for DevOps engineers

Created 4 months ago

449 stars

Top 66.2% on SourcePulse

Project Summary

MLOps for DevOps Engineers provides a hands-on, project-based guide to Machine Learning Operations tailored for DevOps, Platform, and SRE engineers, requiring no prior ML background. Concepts are explained via familiar DevOps analogies, enabling effective operation of ML workloads in production by bridging the gap between ML and traditional infrastructure practices.

How It Works

This project flips the typical MLOps resource by focusing on infrastructure and operations for ML, not ML theory. It uses a project-based approach with a real-world employee attrition prediction use case to illustrate concepts. All components run on Kubernetes and Docker, leveraging familiar DevOps tooling. The core approach emphasizes building ML foundations locally, then transitioning to production-grade orchestration, model serving, and monitoring.

Quick Start & Requirements

Prerequisites include intermediate proficiency in Linux CLI, Docker, Kubernetes, and Git, with basic to intermediate AWS and basic Python (script reading/running) skills. No ML expertise is required, as the material teaches these concepts. The project is structured into phases and steps with detailed guides, implying setup within a Kubernetes/Docker environment.

Highlighted Details

The project covers three main tracks: Traditional ML (training, serving, automating, monitoring models on Kubernetes), Foundational Models (serving LLMs using vLLM, TGI, Ollama), and LLM-Powered DevOps (Kubernetes monitoring, RAG pipelines, agents).
Phase 1 focuses on local ML development and data pipelines, building a complete ML foundation from raw data to a trained, tested model.
Phase 2 addresses enterprise orchestration, aiming to replace manual workflows with production-grade systems for data versioning (DVC, S3), automated pipelines (Airflow on Kubernetes), and experiment tracking (MLflow).
The tech stack spans Python (Pandas, scikit-learn, XGBoost), FastAPI, KServe, MLflow, Kubeflow Pipelines, Prometheus, Grafana, Evidently AI, Kubernetes, Helm, GitHub Actions, and LLM serving tools like vLLM, TGI, and Ollama.

Maintenance & Community

No specific details on active contributors, sponsorships, or community channels (e.g., Discord/Slack) are provided in the README.

Licensing & Compatibility

The project employs a dual licensing model: Apache 2.0 for code (scripts, configs, manifests) and All Rights Reserved for content (README, guides, docs). Commercial use of content requires contacting contact@devopscube.com.

Limitations & Caveats

Several key tracks and phases are marked as 'In Progress' (🔄) or 'Planned' (🔜), including Enterprise Orchestration, Monitoring & Observation, Foundational Models, LLM Serving & Scaling, and LLM-Powered DevOps, indicating ongoing development. The 'All Rights Reserved' content license may impose restrictions on commercial redistribution or use of documentation.

mlops-for-devops by techiescamp

Explore Similar Projects

aqueduct by RunLLM

automlops by GoogleCloudPlatform

dot-ai by vfarcic

k8s-ai-conformance by cncf

Machine-Learning-Engineering-with-Python-Second-Edition by PacktPublishing

energy-forecasting by iusztinpaul

ai-infra-engineer-learning by ai-infra-curriculum

MLOps by raminmohammadi

python-for-devops by techiescamp

mlops-course by GokuMohandas

zenml by zenml-io

kubectl-ai by GoogleCloudPlatform