Curriculum for learning data science
Top 1.3% on sourcepulse
This repository provides a comprehensive 10-week, 20-lesson curriculum designed to introduce beginners to the field of data science. It covers fundamental concepts, ethical considerations, data handling, visualization, and real-world applications, targeting individuals new to data science, including students and educators. The project's project-based pedagogy and integrated quizzes aim to enhance learning retention and practical skill development.
How It Works
The curriculum employs a project-based learning approach, where each lesson includes pre- and post-lesson quizzes, written instructions, solutions, and assignments. This methodology reinforces learning through practical application. Key topics include data ethics, statistics, working with relational and NoSQL data, Python for data exploration (using libraries like Pandas), data preparation, various data visualization techniques with Matplotlib, the data science lifecycle, and cloud-based data science with Azure ML Studio.
Quick Start & Requirements
docsify serve
in the root folder. Notebooks require a separate Python kernel (e.g., in VS Code).Highlighted Details
Maintenance & Community
The project is maintained by Azure Cloud Advocates at Microsoft, with contributions from numerous authors, reviewers, and student ambassadors. Feedback is welcomed in the discussion forum. Related curricula on Generative AI, ML, Cybersecurity, and more are also available.
Licensing & Compatibility
The repository's license is not explicitly stated in the provided README text. Compatibility for commercial use or closed-source linking would require clarification of the licensing terms.
Limitations & Caveats
The README mentions that quizzes are gradually being localized, implying that some may not yet be available in all languages. Notebooks are not rendered by Docsify and require separate execution environments.
2 weeks ago
Inactive