cs249r_book  by harvard-edge

AI Engineering textbook and learning stack

Created 2 years ago
23,549 stars

Top 1.9% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a comprehensive, open-source textbook and learning stack for Machine Learning Systems Engineering, addressing the critical gap between AI model development and robust, real-world deployment. Aimed at engineers, researchers, and students, it establishes AI engineering as a foundational discipline by teaching the principles and practices of building efficient, reliable, and safe intelligent systems. The project offers a structured curriculum with hands-on components, enabling learners to master both theoretical concepts and practical implementation.

How It Works

The project is centered around a textbook being restructured into two focused volumes: Volume I covers foundational single-machine ML systems, while Volume II delves into distributed systems and production infrastructure at scale. Development occurs openly on the dev branch, allowing community observation and contribution. The learning stack complements the textbook with TinyTorch, a framework for understanding ML internals by implementing them from scratch, and hardware kits for hands-on deployment on edge devices like Arduino and Raspberry Pi.

Quick Start & Requirements

Highlighted Details

  • Restructuring into two volumes: Volume I (Foundations) and Volume II (Scale), with Volume I content complete and undergoing polish.
  • "Working in the Open" development model on the dev branch, providing transparency into the textbook's creation process.
  • TinyTorch framework allows users to build ML frameworks from scratch, fostering a deep understanding of underlying mechanisms.
  • Hardware kits and labs enable practical engineering experience under real-world constraints like memory, power, and timing.

Maintenance & Community

The project is actively developed on the dev branch, with a commitment to transparency. Community engagement is encouraged through GitHub Discussions. The broader ML Systems Ecosystem is accessible via https://mlsysbook.org.

Licensing & Compatibility

The book content is licensed under CC BY-NC-ND 4.0, permitting free sharing with attribution but prohibiting commercial use and derivative works. The TinyTorch code is licensed under Apache 2.0, allowing for free use, modification, and distribution, including in commercial projects.

Limitations & Caveats

Volume II is currently under active development, with content still being written and revised, meaning it may contain rough edges or placeholder figures. Components like Software Co-Labs and the AI Olympics are slated for release in 2026. The dev branch represents ongoing work and may not be as stable as the main branch, which hosts the last stable release.

Health Check
Last Commit

23 hours ago

Responsiveness

Inactive

Pull Requests (30d)
75
Issues (30d)
28
Star History
1,058 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.