24 Commits

Author SHA1 Message Date
Vijay Janapa Reddi
0b782b4c0f fix(mlsysim): publish tutorial slides alongside the docs site
The mlsysim docs hero linked "Slide Decks" to mlsysbook.ai/slides/, the
textbook teaching site, which has no relationship to the MLSys·im ISCA
tutorial. Wire the tutorial decks into the same publish path the paper
uses so the link resolves to mlsysim's own slides.

- mlsysim-build-pdfs.yml: split into build-paper / build-slides; new
  job installs xelatex + JetBrains Mono and uploads MLSYSIM-Slides
- mlsysim-publish-live.yml, mlsysim-preview-dev.yml: download
  MLSYSIM-Slides and inject tutorial_part{1,2}.pdf into MLSYSIM_DOCS
- mlsysim-update-pdfs.yml: redeploy slide PDFs in PDF-only hot-fix path
- mlsysim/tutorial/Makefile: build both decks (was part1 only) with
  proper target tracking
- mlsysim/docs/slides.qmd: new landing page with download buttons and
  a pointer disambiguating the textbook lecture decks
- mlsysim/docs/index.qmd: hero CTA now targets slides.qmd
- mlsysim/docs/config/_quarto-html.yml: register slide PDFs as
  resources and add navbar entry
2026-04-27 17:27:48 -04:00
Vijay Janapa Reddi
1eb30f5f86 fix(mlsysim): harden release QA and paper artifacts
Align the MLSys·im code, docs, paper, website, workflows, and lab wheel for the 0.1.1 release. This also fixes runtime/API issues found during release review and prepares the paper PDF plus archive package.
2026-04-25 10:06:01 -04:00
Vijay Janapa Reddi
0745a5fb73 mlsysim: align package identity copy with paper title
Four user-facing identity statements referenced an earlier working title
("A Composable Analytical Framework for Machine Learning Systems") that
no longer matches the actual paper title fixed in 0.1.1 ("MLSys·im:
First-Principles Infrastructure Modeling for Machine Learning Systems").
Align each identity-claiming statement to the paper title.

This covers user-facing *name-claims* only — the places where mlsysim
describes itself. Descriptive uses of "analytical framework" as a
technical category inside the paper and related technical prose are
retained (they situate mlsysim among other analytical tools like
Paleo, Calculon, Vidur; those uses are legitimate).

- mlsysim/pyproject.toml          : project description
- mlsysim/mlsysim/cli/main.py     : `mlsysim --help` text
- mlsysim/docs/tutorials/index.qmd: tutorial landing blurb
- mlsysim/tutorial/prerequisites.md: prerequisites preamble
2026-04-24 15:59:03 -04:00
Vijay Janapa Reddi
285d1e57e6 fix(tutorial): remove ISCA 2026 references — use generic 'Tutorial' label 2026-04-02 15:15:11 -04:00
Vijay Janapa Reddi
0803b97f88 feat(tutorial): upgrade to real beamerthememlsys — branded header, footer, fonts
Replaced bare Metropolis with the mlsys Beamer theme used by the
textbook lecture slides. Both decks now have:
- Crimson section navigation bar in header
- Branded footer (book logo + Harvard logo + page numbers)
- Helvetica Neue body font + JetBrains Mono/Courier code font
- Section divider pages with progress indicator
- \mlsysfocus for key equation slides

Part 1: 90 pages, 0 errors. Part 2: 47 pages, 0 errors.
2026-04-02 15:15:11 -04:00
Vijay Janapa Reddi
c02b98d333 fix(tutorial): slides compile cleanly — Part 1 (90pp) + Part 2 (47pp)
Part 1: fixed \predict→\PredictStart rename, added [fragile] to 13
frames, added xcolor[table] package. Zero errors, 90 pages.

Part 2: added standalone preamble+document wrapper, fixed pareto
figure path, added [fragile] to non-fragile lstlisting frames.
Zero errors, 47 pages.

Total: 137 pages of compiled Beamer slides for ISCA tutorial.
2026-04-02 15:15:11 -04:00
Vijay Janapa Reddi
c0f46fd04d fix(tutorial): minor Part 2 slide adjustments from simulation feedback 2026-04-02 07:36:16 -04:00
Vijay Janapa Reddi
1b66680aca docs(tutorial): simulation round 2 — NPS improved from -10/+5 to +10/+20
Eta explanation, AllReduce numbers-first, and compression fleet economics
all validated by re-simulated personas. 2 of 5 issues fully resolved,
2 partially resolved (timing still tight), 1 unaddressed (TinyML depth).
Multi-vendor Roofline is the single biggest NPS driver.
2026-04-02 07:34:25 -04:00
Vijay Janapa Reddi
8928a07ca8 fix(tutorial): simulation round 1 fixes — eta explanation, AllReduce numbers-first, compression fleet
1. Added 'Understanding eta' slide with table and CPI analogy (before first demo)
2. Added concrete AllReduce example with numbers BEFORE formula
3. Added 'Compression Changes Fleet Architecture' slide with cost table
Addresses top 3 issues from simulation feedback round 1.
2026-04-02 07:30:43 -04:00
Vijay Janapa Reddi
b7bf7a4ce5 docs(tutorial): simulated Q&A — 8 tough questions + 3 hallway conversations
Answer quality ranges 5-9/10. Weakest: MoE support (6/10), diffusion
models (5/10), TinyML depth (6/10) — all honest v0.2.0 gaps.
Strongest: spreadsheet comparison (7/10), inverse Roofline value (9/10),
CPI analogy for efficiency (9/10).
3 hallway conversations simulate real adoption decision dynamics.
2026-04-02 07:30:04 -04:00
Vijay Janapa Reddi
9955a76b92 feat(staffml): deep verification + mock NeurIPS reviews + paper improvements
Deep verification: 237-question stratified sample, 4.2% error rate found.
All 10 errors fixed (unit confusion, arithmetic, conceptual misapplication).
96 physics violations removed (impossible topic×track pairs).
Extended invariant checks added (applicability matrix enforcement).

Paper improvements from mock NeurIPS review feedback:
- Bloom critique softened ("complements" not "departs from")
- LLM generation transparency (95% ratio + 4.2% error rate disclosed)
- Scope explicitly limited to technical systems reasoning
- H100 specs corrected (989 TFLOPS, not 495)
- Track percentages reference table instead of hardcoding
- Figure captions use macros for consistency

New topics with questions: software-portability (50), comm-compute-overlap (50).
Phase metadata reclassified (42.5% inference, 37.7% both, 19.9% training).
2026-04-02 07:28:41 -04:00
Vijay Janapa Reddi
5c7e8e8fc5 docs(tutorial): simulation round 1 — 5 attendee personas, NPS -10 to +5
Key findings: efficiency never explained (fix needed), compression too
rushed (15 min), AllReduce needs numbers-before-formulas, TinyML feels
tangential. Inverse Roofline is the surprise hit. AMD engineer caught
a wrong MI300X bandwidth number.
2026-04-02 07:28:17 -04:00
Vijay Janapa Reddi
c7bd160e15 feat(tutorial): multi-vendor examples + slide polish for A+ quality
- Changed A100-vs-H100 comparisons to H100-vs-MI300X-vs-Gaudi3 three-way
- Added Hardware Zoo slide showing all 22 platforms across 6 vendors
- Added timing markers [N min] to speaker notes throughout
- Added "turn to your neighbor" discussion prompts after predict slides
- Added roadmap "You Are Here" slides after each break
- Stripped code boilerplate (import only shown once, aliases for later demos)
- Added figure references for new SVGs (multi-vendor-roofline, parallelism-3d,
  memory-hierarchy, pareto-front)
- Updated related_work.tex comparison table to highlight vendor breadth
2026-04-02 07:23:12 -04:00
Vijay Janapa Reddi
1faec33ae5 feat(tutorial): 4 new ISCA-quality SVG figures for multi-vendor + advanced topics
- multi-vendor-roofline.svg: H100, MI300X, Gaudi 3, TPU v5p, A100 on same plot
- parallelism-3d.svg: 16-GPU TP×PP×DP grid with communication labels
- memory-hierarchy.svg: HBM→L2→SRAM→Registers pyramid with annotations
- pareto-front.svg: Latency vs cost with Pareto frontier and SLA boundary

Total: 10 publication-quality SVG figures for the tutorial deck.
All follow style guide: 900×500 viewBox, semantic colors, 10px+ fonts.
2026-04-02 07:17:12 -04:00
Vijay Janapa Reddi
c7250e32df fix(tutorial): P0 slide blockers — hook, related work, figures, predict, transfer
1. Added $200M opening hook (2 slides before learning objectives)
2. Wired \input{related_work} between Part 0 and Part 1
3. Replaced figure placeholders with real SVG PDFs (carbon, hardware)
4. Added predict-then-reveal prompts before Aha #3 (256 GPU scaling)
   and Aha #4 (INT4 fleet halving)
5. Added "Name That Wall" personal transfer moment before closing

Addresses all 5 P0 blockers from Patterson's presentation review.
2026-04-01 23:27:06 -04:00
Vijay Janapa Reddi
8c6738772e fix(tutorial): bump SVG font sizes for ISCA projection readability
All font-size values < 10px bumped to 9.5-11px minimum across all 6
figures. At 900x500 viewBox on a 16:9 projection, this ensures text
is readable from the back of a 300-person room.
2026-04-01 23:24:46 -04:00
Vijay Janapa Reddi
f62dc8cca2 docs(tutorial): instructor quick-start guide — 15 min adoption path
Covers: install, first demo, first homework, semester plan, auto-grading
hints, and material inventory. Designed to minimize instructor adoption
friction for ML systems courses.
2026-04-01 23:19:21 -04:00
Vijay Janapa Reddi
0488d01c79 feat(tutorial): add 38 code listings + figure refs to Part 2 slides
Parts 5-9 were lecture-only (0 code blocks). Now every section has
live demo code using real mlsysim API: EconomicsModel, SustainabilityModel,
DSE search, Engine.sweep, ESP32/nRF52840/H100 comparison, Pipeline
composition, SensitivitySolver, capstone starter code.

Figure placeholders added for carbon-geography and hardware-spectrum SVGs.
2026-04-01 19:23:31 -04:00
Vijay Janapa Reddi
42dc31a202 feat(tutorial): 6 publication-quality SVG figures for ISCA slides
- roofline-model.svg: Classic Roofline with LLM decode + CNN training points
- iron-law-decomposition.svg: Iron Law equation with wall-to-term mapping
- serving-two-phases.svg: Prefill (compute) vs Decode (memory) phases
- allreduce-ring.svg: 8-GPU ring with reduce-scatter + all-gather
- hardware-spectrum.svg: nRF52840 → ESP32 → Jetson → H100 → NVL72 scale
- carbon-geography.svg: Norway/Quebec/US/Poland bar chart (41x gap)

All follow svg-style.md: 900x500 viewBox, semantic colors, Helvetica font.
2026-04-01 19:22:20 -04:00
Vijay Janapa Reddi
d42e075b66 build(tutorial): add Makefile for SVG→PDF + xelatex slide build pipeline
Supports inkscape and rsvg-convert for SVG conversion.
Targets: make (all), make svgs, make slides, make check, make clean.
Build output goes to _build/ (gitignored).
2026-04-01 19:19:35 -04:00
Vijay Janapa Reddi
e24a5a2d9e feat(tutorial): pilot study protocol for pre/post quiz research data
Within-subjects design (N≥30), paired t-test analysis plan, IRB
considerations, expected effect size (d=0.8), and timeline for
running at ISCA 2026. Produces publishable data for SIGCSE/L@S.
2026-04-01 19:14:49 -04:00
Vijay Janapa Reddi
016089f966 feat(tutorial): add 8 related-work slides — positioning, landscape, CPI analogy
Comparison table of 7 tools (ASTRA-sim, Calculon, DeepSpeed, LLMPerf, etc.).
Fidelity-speed spectrum TikZ diagram. Honest 'What We Don't Do' slide.
Iron Law wall-to-term mapping. CPI analogy for the efficiency parameter.
22 walls at a glance compact table. Designed for \input{} into main deck.
2026-04-01 19:14:33 -04:00
Vijay Janapa Reddi
b61f632b71 feat(tutorial): pre/post assessment quiz — 10 questions mapping to 6 understanding goals
Designed for dual use: tutorial engagement + publishable research data.
Tests transfer (apply framework to unfamiliar systems), not recall.
Includes distractor analysis and scoring rubric.
2026-04-01 19:13:27 -04:00
Vijay Janapa Reddi
24ca3b0ef9 feat(mlsysim): complete ISCA tutorial package — backward design + 94 slides
Full-day tutorial package for ISCA 2026:

Tutorial Design (DESIGN.md, 639 lines):
- Backward design with 4 transfer goals, 6 enduring understandings
- 5 designed "aha moments" with predict-then-reveal structure
- Hour-by-hour schedule with 40% hands-on minimum
- Capstone: multi-region fleet design under budget/latency/carbon constraints
- Facilitation notes, energy management, common pitfalls

Slides (3,081 lines LaTeX):
- Parts 0-4: Welcome, Iron Law, Serving, Compression, Distributed
- Parts 5-9: Economics, DSE, TinyML, Advanced, Wrap-up
- ~94 slides with live demos using real mlsysim API calls
- Speaker notes on every exercise slide

Supporting Materials:
- exercises.md: 8 hands-on exercises with expected answers
- cheatsheet.md: single-page reference card (Iron Law + top equations)
- prerequisites.md: setup instructions + troubleshooting FAQ
2026-04-01 19:02:23 -04:00