NBA game win probability predictor app
Top 98.0% on sourcepulse
This project provides an end-to-end machine learning deployment for predicting NBA game win probabilities, targeting individuals seeking to demonstrate ML deployment skills or explore sports analytics. It offers a daily prediction service via a Streamlit app, aiming to improve betting strategy profitability.
How It Works
The system leverages historical NBA game data, primarily from games.csv
, to train gradient boosted tree models (XGBoost and LightGBM). Key features are engineered as rolling statistics (e.g., average points, win streaks) for teams, calculated from previous games to avoid data leakage. Model probabilities are calibrated using sklearn.CalibratedClassifierCV
to align with true probabilities. A daily pipeline, orchestrated by GitHub Actions, scrapes new game data, updates features, and retrains the model, with predictions served through a Streamlit application.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is a personal portfolio piece. The author notes a temporary removal of Hopsworks feature store and model registry due to stability concerns. Feedback is actively sought via LinkedIn and Twitter.
Licensing & Compatibility
The repository does not explicitly state a license. Standard GitHub repository practices apply. Commercial use would require clarification on licensing terms.
Limitations & Caveats
The project is described as a "work in progress" with ongoing iterations. While the model achieved 61.5% accuracy on the 2022-2023 regular season, it lags behind top public models (65.6%). The author acknowledges that a real-world betting strategy is significantly more complex than this model alone.
1 month ago
1 day