Data-Science-for-COVID-19  by ThisIsIsaac

COVID-19 dataset and visualizer for Korean epidemics

created 5 years ago
277 stars

Top 94.5% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a comprehensive dataset for COVID-19 analysis, focusing on patient routes and broader medical statistics in Korea. It aims to facilitate data-driven insights for researchers and public health professionals by consolidating and visualizing complex health information.

How It Works

The project collects and processes data from the Korean CDC and local governments, structuring it for easy analysis. Key components include patient movement data, demographic information, and extensive medical statistics covering epidemics, vaccines, chronic diseases, and healthcare facilities. A planned multi-variate, time-scrollable visualizer will enable interactive exploration of this data.

Quick Start & Requirements

  • Install: No specific installation instructions are provided in the README. The project appears to be data-centric, likely requiring data analysis tools like Python with libraries such as Pandas, Matplotlib, or Seaborn.
  • Prerequisites: Access to the datasets is implied, potentially via Kaggle or direct download. No specific software versions or hardware requirements are listed.
  • Resources: The README mentions a Kaggle dataset, which may require a Kaggle account.

Highlighted Details

  • Official partnership with the Korean CDC is in progress for more accurate and up-to-date data.
  • Includes data on 22 major epidemics, 16 vaccines, 7 chronic diseases, and 5 major cancers.
  • Features patient route data and regional patient counts, with plans for time-based visualization.
  • Has been featured in media outlets like ZDNet Korea and The Washington Post, and in blog posts by Databricks and DataRobot.

Maintenance & Community

The project lists a large team of research directors, engineers, and former maintainers. It has participated in several competitions and has academic and media partnerships. A Slack community is mentioned via a sponsor.

Licensing & Compatibility

  • License: CC BY-NC-SA 4.0 (Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International).
  • Restrictions: This license prohibits commercial use and requires any derivative works to be shared under the same license.

Limitations & Caveats

The visualizer functionality is listed as "Coming soon," indicating it is not yet available. Daily updates are also planned but not yet implemented. The project's focus is primarily on Korean data.

Health Check
Last commit

4 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.