MLOps course for building production-ready ML batch systems
Top 40.0% on sourcepulse
This repository provides a free, 7-step MLOps course focused on building and deploying an end-to-end ML batch system for energy consumption forecasting. It targets intermediate to advanced Machine Learning Engineers and Software Engineers transitioning to MLE, offering practical experience with production-ready ML systems.
How It Works
The course guides users through designing, building, training, serving, and monitoring a batch ML system. It emphasizes MLOps best practices, integrating tools like Hopsworks (feature store), Weights & Biases (experiment tracking), Docker, Airflow (orchestration), and GitHub Actions (CI/CD). The approach is modular, covering feature engineering, training pipelines with hyperparameter tuning, batch prediction, data validation with Great Expectations, and deployment to GCP.
Quick Start & Requirements
.env
files with API keys and credentials for various services. Estimated GCP deployment cost is ~$20.Highlighted Details
Maintenance & Community
The project is maintained by the author, with opportunities for community contributions via GitHub Issues or Pull Requests. Direct contact is available via LinkedIn.
Licensing & Compatibility
Limitations & Caveats
The underlying energy data API is becoming obsolete, but a static dataset from 2020-2023 is provided. The course material is hosted on Medium, which may have a paywall. macOS M1/M2 users may encounter Poetry environment issues, with a provided script to mitigate them.
1 year ago
1 day