mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-07 02:03:55 -05:00

Go to file

Vijay Janapa Reddi 68f2ca4664 Merge feat/staffml-visual-questions into dev

Visual-question infrastructure + practice-page layout restructure,
shipped together because the redesign exists specifically to make
room for visuals in the reading flow (context -> diagram -> ask).

Three commits:
  fb2a57e12 - Schema (Pydantic + LinkML) gains optional Visual model
              with path-traversal guard and a11y-required alt.
              Legacy exporter passes metadata to the summary bundle;
              vault build --legacy-json mirrors SVG assets from
              interviews/vault/visuals/<track>/ to
              interviews/staffml/public/question-visuals/<track>/.
              QuestionVisual React component renders the diagram
              between scenario and answer with graceful fallback.

  cb0b7ea30 - Practice-page layout restructure per 4-reviewer UX
              pass (Emma beginner, David power user, Chip Huyen
              practitioner; Soumith's agent bounced). Left column
              is now the read-answer-reveal flow; right column is
              the tools panel. Four safeguards folded in by default:
              (1) sticky Your-task callout that stays visible while
              the user scrolls to type (David's long-scenario fix),
              (2) HardwareRef defaultOpen + NapkinCalc defaultClosed
              per Chip's consulted-vs-invoked distinction,
              (3) submit-gradient guard -- a Think-longer? confirm
              fires on (elapsed<15s && chars<50) reveals, and
              self-calibrates off once the user has demonstrated
              normal deliberation (Chip's prediction was the subtle
              headline risk), (4) a Stuck?-ask-the-Interviewer nudge
              below the textarea (Emma's scaffolding request).
              modelAnswerRef + scrollIntoView keeps the post-reveal
              comparison on-screen.

  1898fe8c9 - First visual exemplar: cloud Ring AllReduce on 4 ranks.
              SVG follows .claude/rules/svg-style.md, YAML wires the
              visual block and pairs it with a question asking for
              the full AllReduce time. AUTHORING.md documents when a
              visual earns its place, the workflow, accessibility
              requirements, and anti-patterns.

2026-04-24 16:11:24 -04:00

.github

…

.vale/styles/textbook

…

binder

…

book

…

instructors

…

interviews

feat(vault): add first visual-question exemplar + authoring guide

2026-04-24 16:10:54 -04:00

kits

…

labs

…

mlperf-edu

…

mlsysim

…

periodic-table

…

README

…

shared

…

site

…

slides

…

tinytorch

…

tools

…

wheels

…

.all-contributorsrc

…

.codespell-ignore-words.txt

…

.gitignore

…

.nojekyll

…

.pre-commit-config.yaml

…

.yamllint

…

CITATION.bib

…

CITATION.cff

…

CNAME

…

CODE_OF_CONDUCT.md

…

CONTRIBUTING.md

…

LICENSE.md

…

package-lock.json

…

package.json

…

pyproject.toml

…

README.md

…

requirements.txt

…

SECURITY.md

…

README.md

Machine Learning Systems

Principles and Practices of Engineering Artificially Intelligent Systems

English • 中文 • 日本語 • 한국어

📘 Textbook (current edition) • 📙 Vol I + Vol II (Summer 2026) • 🔥 TinyTorch • 🔮 MLSys·im (dev) • 💼 StaffML (dev) • 🌐 Ecosystem

📚 Hardcopy edition coming 2026 with MIT Press.

Mission

The world is rushing to build AI systems. It is not engineering them.

That gap is what we mean by AI engineering.

AI engineering is the discipline of building efficient, reliable, safe, and robust intelligent systems that operate in the real world, not just models in isolation. Our mission is to establish AI engineering as a foundational discipline alongside software engineering and computer engineering, by teaching how to design, build, and evaluate end-to-end intelligent systems.

Our goal: Help 100,000 learners master ML Systems this year, and reach 1 million by 2030.

Why One Repository

I designed this as a single integrated curriculum, not a collection of independent projects. The textbook teaches the theory. TinyTorch makes you build the internals. The hardware kits force you to confront real constraints. The simulator lets you reason about infrastructure you can't afford to rent. Each piece exists because I found that students who only read don't internalize, and students who only code don't generalize.

The repository is the curriculum.

A growing community of contributors helps improve every part of it: fixing errors, sharpening explanations, testing on new hardware. Their work makes this better for everyone, and I'm grateful for every pull request.

The Curriculum

Every component connects. The textbook gives you the mental models. The labs let you reason through trade-offs interactively, powered by MLSys·im — a modeling engine for infrastructure you can't physically access, and a standalone tool in its own right. TinyTorch makes you build the machinery yourself. The hardware kits put you face-to-face with real deployment constraints. StaffML tests whether you actually understand it. And the instructor hub, slides, and newsletter give educators everything they need to bring this into a classroom.

For Students

	Component	Role in the Curriculum	Link
📖	Textbook	Two-volume MIT Press textbook. The theory, the mental models, and the quantitative reasoning that everything else builds on.	Current edition · Vol I + II (Summer 2026)
🔬	Labs	Interactive Marimo notebooks where you explore trade-offs from the textbook: change a parameter, see what breaks, build intuition. Powered by MLSys·im under the hood.	Read more (dev)
🔥	Tiny🔥Torch	Build your own ML framework from scratch across 20 progressive modules. You don't understand a system until you've built one.	Get started
🛠️	Hardware Kits	Deploy ML to Arduino, Raspberry Pi, and Jetson. Real memory limits, real power budgets, real latency.	Browse labs
🔮	MLSys·im	Calculate memory bottlenecks, network saturation, and scheduling limits at infrastructure scales you can't physically access.	Read more (dev)
💼	StaffML	Physics-grounded interview questions for ML systems roles. Vault, practice drills, mock interviews, and progress tracking.	Coming soon (dev)

For Educators

	Component	What It Provides	Link
🎓	Instructor Hub	The AI Engineering Blueprint: two 12-week syllabi, pedagogy guide, assessment rubrics, and a TA handbook.	View hub
🎬	Lecture Slides	Beamer slide decks for every chapter, with four theme variants. Drop into your course and teach.	Browse decks (dev)
📬	Newsletter	Updates on the curriculum, new chapters, and what the community is building.	Subscribe

What You Will Learn

This textbook teaches you to think at the intersection of machine learning and systems engineering. Each chapter bridges algorithmic concepts with the infrastructure that makes them work in practice.

You know...		You will learn...
How to train a model	→	How training scales across GPU clusters
That quantization shrinks models	→	How INT8 math maps to silicon
What a transformer is	→	Why KV-cache dominates memory at inference
Models run on GPUs	→	How schedulers balance latency vs throughput
Edge devices have limits	→	How to co-design models and hardware

Book Structure

The textbook follows the Hennessy & Patterson pedagogical model across two volumes:

	Volume	Theme	Scope
📗	Volume I	Build, Optimize, Deploy	Single-machine ML systems (1–8 GPUs). Foundations, optimization, and deployment on one node.
📘	Volume II	Scale, Distribute, Govern	Distributed systems at production scale. Multi-machine infrastructure, fault tolerance, and governance.

Quick Start

①	Read the textbook. Start with the current edition. It's the foundation for everything else.
②	Pick a hands-on path. Build a framework (TinyTorch), explore trade-offs (Labs), or deploy to real hardware (Kits).
③	Test yourself. Drill StaffML: physics-grounded systems design questions across cloud, edge, mobile, and TinyML.
④	Teach it. Adopt the curriculum with the AI Engineering Blueprint and lecture slides.

Branch Guide

Note

You are on the dev branch. Active development happens here. For the last stable release, see the main branch.

	Branch	What's on it	Status
🟢	`main` mlsysbook.ai	Single-volume textbook (current edition)	Live — this is what readers see today.
🟡	`dev` ← you are here	Volume I — two-volume split (content complete, editorial polish) Volume II — At Scale (active development) Curriculum — TinyTorch, Kits, MLSys·im, Labs, StaffML	TinyTorch and Hardware Kits are live. MLSys·im, Labs, and StaffML are in development.

The two-volume split replaces the single-volume edition at launch.

Support This Work

Star the repo
Stars signal to universities and foundations that this work matters. They directly fund workshops and hardware kits for underserved classrooms.

100 → 1,000 → 10,000 → 100,000 → 1M learners by 2030

Fund the mission
All contributions go to Open Collective, a transparent fund for educational outreach. Every dollar goes to reaching more students.

Contributing

	I want to...	Go here
📖	Fix a typo or improve a chapter	Textbook contributing guide
🔥	Add a TinyTorch module or fix a bug	TinyTorch contributing guide
🛠️	Improve hardware labs	Hardware kits guide
🐛	Report an issue	GitHub Issues
💬	Ask a question	GitHub Discussions

Contributors

Thanks goes to these wonderful people who have contributed to making this resource better for everyone!

Legend: 🪲 Bug Hunter · 🧑‍💻 Code Contributor · ✍️ Doc Wizard · 🎨 Design Artist · 🧠 Idea Spark · 🔎 Code Reviewer · 🧪 Test Tinkerer · 🛠️ Tool Builder

📖 Textbook Contributors

_{Vijay Janapa Reddi} 🪲 🧑‍💻 🎨 ✍️ 🧠 🔎 🧪 🛠️	_{Marcelo Rovai} 🧑‍💻 🎨 🧪	_{Gabriel Amazonas} 🪲 ✍️ 🧠	_{Zeljko Hrcek} 🧑‍💻 ✍️	_{Tess Watt} 🪲 ✍️	_{Kai Kleinbard} 🧑‍💻 🛠️	_{Didier Durand} ✍️ 🪲
_{Jason Jabbour} ✍️	_{Ikechukwu Uchendu} ✍️	_{Naeem Khoshnevis} ✍️	_{Sara Khosravi} ✍️	_{Douwe den Blanken} ✍️	_{Jeffrey Ma} ✍️	_{shanzehbatool} ✍️
_Elias ✍️	_{Jared Ping} ✍️	_{Itai Shapira} ✍️	_{Maximilian Lam} ✍️	_{Jayson Lin} ✍️	_{Sophia Cho} ✍️	_Andrea ✍️
_{Alex Rodriguez} ✍️	_{Korneel Van den Berghe} ✍️	_Nimo ✍️	_{Colby Banbury} ✍️	_{Zishen Wan} ✍️	_{Mark Mazumder} ✍️	_{Abdulrahman Mahmoud} ✍️
_{Divya Amirtharaj} ✍️	_{Srivatsan Krishnan} ✍️	_marin-llobet ✍️	_{Aghyad Deeb} ✍️	_{Haoran Qiu} ✍️	_{Emil Njor} ✍️	_{ELSuitorHarvard} ✍️
_kaiM0ves ✍️	_oishib ✍️	_{Jared Ni} ✍️	_{Aditi Raju} ✍️	_{Michael Schnebly} ✍️	_{Thuong Duong} ✍️	_{Yu-Shun Hsiao} ✍️
_{Henry Bae} ✍️	_{Eimhin Laverty} ✍️	_{Jae-Won Chung} ✍️	_{Shvetank Prakash} ✍️	_{Marco Zennaro} ✍️	_{Arya Tschand} ✍️	_{Andrew Bass} ✍️
_{Pong Trairatvorakul} ✍️	_{Eura Nofshin} ✍️	_{Matthew Stewart} ✍️	_{Emeka Ezike} ✍️	_jianqingdu ✍️	_{Jennifer Zhou} ✍️	_{The Random DIY} ✍️
_{Fatima Shah} ✍️	_{Bruno Scaglione} ✍️	_Allen-Kuang ✍️	_{Tauno Erik} ✍️	_gnodipac886 ✍️	_{Sercan Aygün} ✍️	_{TheHiddenLayer} ✍️
_{Gauri Jain} ✍️	_{Fin Amin} ✍️	_{Alex Oesterling} ✍️	_{Abenezer Angamo} ✍️	_{Baldassarre Cesarano} ✍️	_{Jahnic Beck} ✍️	_{अरनव शुक्ला \| Arnav Shukla} ✍️
_Rin ✍️	_{Bilge Acun} ✍️	_{Andy Cheng} ✍️	_{Aritra Ghosh} ✍️	_{abigailswallow} ✍️	_{Yang Zhou} ✍️	_{JEON HYUNJUN(Luciano)} ✍️
_{Emmanuel Rassou} ✍️	_{Jason Yik} ✍️	_{Jessica Quaye} ✍️	_{Cursor Agent} ✍️	_{happyappledog} ✍️	_Snuggs ✍️	_{Sam Wilcock} ✍️
_{Shreya Johri} ✍️	_{Sonia Murthy} ✍️	_{Costin-Andrei Oncescu} ✍️	_{formlsysbookissue} ✍️	_{Annie Laurie Cook} ✍️	_{Parampreet Singh} ✍️	_{Vijay Edupuganti} ✍️
_{Jothi Ramaswamy} ✍️	_{Batur Arslan} ✍️	_{Curren Iyer} ✍️	_{Edward Jin} ✍️	_bluebaer7 ✍️	_yanjingl ✍️	_a-saraf ✍️
_songhan ✍️	_jvijay ✍️	_Zishen ✍️	_{Kristian Radoš} ✍️	_{Dang Truong} 🧑‍💻	_pipme ✍️	_{Salman Chishti} ✍️
_{Paolo Estavillo} ✍️	_GronuJ ✍️	_{Pratham Chaudhary} 🧑‍💻	_Octopus ✍️

🔥 TinyTorch Contributors

_{Vijay Janapa Reddi} 🪲 🧑‍💻 🎨 ✍️ 🧠 🔎 🧪 🛠️	_kai 🪲 🧑‍💻 🎨 ✍️ 🧪	_{Dang Truong} 🪲 🧑‍💻 ✍️ 🧪	_{Didier Durand} 🪲 🧑‍💻 ✍️	_rnjema 🧑‍💻 ✍️ 🛠️	_{Pratham Chaudhary} 🪲 🧑‍💻 ✍️	_{Farhan Asghar} 🪲 🧑‍💻 ✍️
_Rocky 🧑‍💻 ✍️ 🧪	_{Karthik Dani} 🪲 🧑‍💻	_{Avik De} 🪲 🧪	_Takosaga 🪲 ✍️	_joeswagson 🧑‍💻 🛠️	_{AndreaMattiaGaravagno} 🧑‍💻 ✍️	_Rolds 🪲 🧑‍💻
_asgalon 🧑‍💻 ✍️	_{Amir Alasady} 🪲	_jettythek 🧑‍💻	_wzz 🪲	_{Ng Bo Lin} ✍️	_keo-dara 🪲	_{Wayne Norman} 🪲
_{Ilham Rafiqin} 🪲	_{Oscar Flores} ✍️	_harishb00a ✍️	_{Pastor Soto} ✍️	_{Salman Chishti} 🧑‍💻	_{Aditya Mulik} ✍️	_{Ademola Arigbabuwo} ✍️
_{Yaroslav Halchenko} 🧑‍💻	_Harish ✍️