cs249r_book

mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-03-11 17:49:25 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	533cfa6e99	fix: pre-commit hooks — all 48 checks now pass - book/quarto/mlsys/__init__.py: add repo-root sys.path injection so mlsysim is importable when scripts run from book/quarto/ context - book/quarto/mlsys/{constants,formulas,formatting,hardware}.py: new compatibility shims that re-export from mlsysim.core.* and mlsysim.fmt - mlsysim/viz/__init__.py: remove try/except for dashboard import; use explicit "import from mlsysim.viz.dashboard" pattern instead - .codespell-ignore-words.txt: add "covert" (legitimate security term) - book/tools/scripts/reference_check_log.txt: delete generated artifact - Various QMD, bib, md files: auto-formatted by pre-commit hooks (trailing whitespace, bibtex-tidy, pipe table alignment)	2026-03-01 17:30:24 -05:00
Vijay Janapa Reddi	2ce322def1	LEGO updates , call out updates	2026-02-28 11:47:42 -05:00
Vijay Janapa Reddi	c69b6ab2d1	Add book tools (agent personas, check_figure_div_syntax)	2026-02-26 15:23:19 -05:00
Vijay Janapa Reddi	ad6229a899	Adds options for targeted reference validation Introduces `--only-from-report` and `--only-keys` arguments to the `references` validation command. These allow re-validating only specific citation keys, either from a previous validation report or a custom list. This significantly improves the workflow for correcting references by enabling focused re-runs and reducing validation time. Removes the standalone `README_REFERENCE_CHECK.md` documentation, as its content is now implicitly handled by the integrated CLI help and broader documentation.	2026-02-25 07:48:18 -05:00
Vijay Janapa Reddi	fdfd91bf03	refactor: sharpen System layer naming to reflect integrated Platforms (Hubs, Nodes, Phones) rather than hardware modules	2026-02-24 21:05:44 -05:00
Vijay Janapa Reddi	c57db2c2d6	feat: establish the 'Engineering Crux' hierarchy (Hardware -> Models -> Systems -> Scenarios) as the foundational framework for the curriculum	2026-02-24 20:26:22 -05:00
Vijay Janapa Reddi	56e091f7e0	feat: standardize System Archetypes in Vol 1 and Vol 2; add canonical roster table to Introductions; ensure tight math-prose integration	2026-02-24 20:23:50 -05:00
Vijay Janapa Reddi	e881d92625	refactor: introduce System Archetypes in mlsys/systems.py and integrate into Introduction and Serving chapters; verify math integrity and rationale for LEGO blocks	2026-02-24 19:12:51 -05:00
Vijay Janapa Reddi	9e809d21c4	feat: full-stack Pint robustness and class-based namespace isolation Python library (mlsys/): - constants.py: add ureg.default_format, set_application_registry, MS alias comment - formatting.py: isinstance checks, add fmt_full(), fmt_split(), .m_as() modernization - formulas.py: fleet formulas return Quantity, @ureg.check() decorators, .m_as() everywhere - hardware.py: dimension-first validation in __post_init__, Quantity[float] annotations - models.py: __post_init__ dimension checks, size_in_bytes() enforcement, ureg.count→ureg.param - test_units.py: +50 robustness tests (wrong-unit HardwareSpec, fleet formulas, fmt_full) - validate_pint_usage.py: new static analysis script for Pint anti-patterns in QMD files - transform_pico_cells.py: transformation script for PICO cell restructuring QMD chapters (Vol1 + Vol2 — all 43 chapters with Python cells): - Wrapped all Python compute cells in class-based namespace isolation (PICO pattern) - Added EXPORTS bridges so class-internal values are accessible to prose inline Python - Modernized .to(unit).magnitude → .m_as(unit) throughout - Removed bare .magnitude calls; all unit extractions now explicit - Fleet appendices (appendix_fleet, appendix_communication, appendix_reliability): full Quantity-return cascade for MTBF, AllReduce, Young-Daly, checkpoint formulas All 43 chapters verified building cleanly (HTML) after changes.	2026-02-21 14:33:36 -05:00
Vijay Janapa Reddi	62b98edee1	Updates book content and configuration Refines book abstracts, table of contents, and diagram configurations for improved clarity and structure. This commit enhances the descriptions of both Volume I and Volume II, emphasizing their respective focuses. It also introduces a framework decision tree to guide the selection of parallel training strategies and inference frameworks, and diagrams for visualizing hardware constraints.	2026-02-21 08:19:01 -05:00
Vijay Janapa Reddi	09602445de	chore: update book content, config, appendices, and tooling - Vol1: chapter updates across backmatter, benchmarking, data, frameworks, etc. - Vol2: content updates, new appendices (assumptions, communication, fleet, reliability) - Quarto: config, styles, formulas, constants - Add SEMINAL_PAPERS_V2.md, learning_objectives_bolding_parallel.sh - VSCode extension: package.json, chapterNavigatorProvider - Landing page and docs updates	2026-02-20 18:55:24 -05:00
Vijay Janapa Reddi	739b48622f	Add war story callout with proper icon formats and supporting files - Add war story callout definition in custom-numbered-blocks.yml - Create war story icon in all three formats (SVG, PNG, PDF) matching the 64x64 stroke-only style used by all other callout icons - Add war story bibliography and PDF config entry - Add first war story ("The Quadratic Wall") in nn_architectures - Include icon conversion utility script	2026-02-19 07:38:16 -05:00
Vijay Janapa Reddi	73a956a09b	chore(volumes,vscode-ext): batch volume updates and tooling improvements Checkpoint the branch-wide content/config revisions together with workbench enhancements so chapter rendering and developer workflows stay aligned. This captures the current validation-driven formatting and parallel build/debug improvements in one commit.	2026-02-15 14:03:27 -05:00
Vijay Janapa Reddi	1db2dacfe7	style(vol2): fix lowercase x multiplication notation across all chapters Convert all remaining lowercase 'x' used as multiplication (e.g., "1000x faster") to $\times$ across 17 vol2 chapters. These were flagged by the new lowercase_x_multiplication validator check. Simplifies the validator regex from a fragile word-list approach to a broader pattern matching digit-x-lowercase (e.g., \dx\s+[a-z]) which naturally excludes hardware counts (8x A100) and hex literals (0x61). Includes the conversion script in _archive.	2026-02-15 11:53:51 -05:00
Vijay Janapa Reddi	5d68f0a2e0	style: standardize multiplication notation to $\times$ across all chapters Convert all Unicode × (U+00D7) to LaTeX $\times$ in prose, tables, and math contexts across both volumes. Unicode × is preserved only inside fig-alt text for accessibility screen readers. One instance inside a plain markdown backtick code span (frameworks.qmd) was reverted to Unicode × since LaTeX doesn't render in code spans. Updates validate.py with a new lowercase-x-as-multiplication check and refines the latex_adjacent warning to distinguish _str variables (safe) from raw inline Python. Updates validate_inline_refs.py comments to reflect the new convention. Includes the conversion script in _archive.	2026-02-15 11:43:45 -05:00
Vijay Janapa Reddi	c9d21b768b	feat(binder): add `render plots` command for matplotlib figure gallery Integrate figure rendering into the binder CLI so plots can be previewed without a full Quarto build. Extracts Python code blocks with fig-* labels from QMD files, renders them to PNG, and outputs a browsable gallery at _output/plots/<chapter>/. Also fixes the package import chain so `binder` works correctly as an installed entry point. - Add book/cli/commands/render.py with RenderCommand class - Wire into main.py with help table entry and command dispatch - Add matplotlib>=3.7.0 to pyproject.toml dependencies - Add book/quarto/_output/ to .gitignore - Archive standalone render_figures.py to _archive/	2026-02-14 12:43:23 -05:00
Vijay Janapa Reddi	e3cc9f7af3	refactor: rename ml_ml_workflow files, consolidate CLI, and clean up scripts Remove redundant ml_ prefix from ml_workflow chapter files and update all Quarto config references. Consolidate custom scripts into native binder subcommands and archive obsolete tooling.	2026-02-13 11:06:28 -05:00
Vijay Janapa Reddi	2390c3ab31	Refactor: consolidate Quarto config layers and content reorganization. Unifies Quarto metadata into shared base/format/volume fragments while carrying through chapter path, asset, and tooling updates to keep the repository consistent and easier to maintain.	2026-02-12 15:38:55 -05:00
Vijay Janapa Reddi	d9cb03cf38	Refactor: Systematic Goal/Show/How header audit for Volume 1 - Completed full standardization of 150+ calculation headers across all 16 Volume 1 chapters. - Replaced legacy 'Why:' blocks with the 'Goal/Show/How' documentation pattern. - Finalized P.I.C.O. class refactors for complex cells in frameworks and serving. - Verified header consistency across introduction, ml_systems, training, and optimizations. - Performed minor stabilization in book/vscode-ext logic.	2026-02-11 21:33:27 -05:00
Vijay Janapa Reddi	ce68808185	Fix: make pipe table prettifier apply visible alignment changes Treat internal spacing changes as real formatting differences and normalize separator padding so table prettification is applied consistently. Save files before running pre-commit fixers from the extension so results match editor state.	2026-02-11 18:46:18 -05:00
Vijay Janapa Reddi	ff3797a1d8	Refactor: Finalize Volume 1 and update CLI/VSCode tooling - Completed full Volume 1 refactor to Safe Class Namespace pattern. - Fixed render errors and verified all 16 chapters. - Updated 'binder' CLI with native validation and maintenance namespaces. - Enhanced VS Code extension with Chapter Navigator and Run History. - Integrated 'binder validate' into pre-commit workflows.	2026-02-11 09:25:50 -05:00
Vijay Janapa Reddi	89a7b5305c	Refactor: Port Chapter 2 (ML Systems) to mlsys Objects - Refactored to use , , and registries. - Expanded with devices (Coral, Jetson, NUC) and . - Expanded with (DLRM) and family. - Fixed registry bugs (Coral bandwidth, GPT class definition). - All unit tests passing.	2026-02-10 17:19:34 -05:00
Vijay Janapa Reddi	f136f96fd8	Refactor: Rename library to and introduce Digital Twins Renamed the core physics engine from to to better align with the book's title and pedagogical goals. New features: - Introduced and classes in and . - Digital twins now carry their own attributes (bandwidth, FLOPS) and methods (ridge_point, memory_size). - Global search and replace updated all 16 chapters to use . - Maintained backward compatibility for legacy constants during migration.	2026-02-10 16:46:37 -05:00
Vijay Janapa Reddi	83ce92624e	Editorial Corrections & Code Hardening (Volume 1) This commit refactors the underlying Python calculation cells for Chapters 1-16 to strictly enforce mathematical consistency with the narrative. Key Text/Numeric Updates (For Editorial Review): 1. Chapter 3 (Workflow) - Edge Necessity Scenario: - Change: Increased clinic patient count from 100 to 150. - Reason: With 100 patients, the calculated upload time was ~5.5 hours, which fits within the 8-hour clinic day, contradicting the chapter's conclusion that 'Edge is Mandatory.' Increasing to 150 pushes upload time to >8 hours, mathematically validating the narrative. 2. Chapter 1 (Introduction) - Model Drift Scenario: - Change: Reduced monthly accuracy drift rate from 8.0% to 0.8%. - Reason: An 8% monthly drop is a catastrophic failure that would be immediately noticed. A 0.8% drop correctly models the 'silent failure' (boiling frog) scenario described in the text. 3. Chapter 3 (Workflow) - Velocity vs Quality: - Change: Reduced 'Large Model' accuracy gain per iteration from 0.5% to 0.15%. - Reason: The original rate caused the large model to hit 99% accuracy almost instantly, invalidating the 'Velocity is a Feature' argument. The new rate correctly models diminishing returns, allowing the faster (small) model to win. 4. Chapter 15 (Responsible Engineering) - TCO Analysis: - Verification: Verified and stabilized the 3-year Total Cost of Ownership (TCO) calculations. Confirmed that Inference TCO (.5M) dominates Training TCO (8K) by ~40x, supporting the 'Efficiency as Responsibility' thesis. Technical Changes (Code Only): - Refactored all calculation cells to use the P.I.C.O. (Parameters, Invariants, Calculation, Outputs) design pattern. - Added assertion guards (Invariants) to prevent future regressions where math contradicts prose. - Fixed variable scope issues in Chapter 10 (Model Compression) and Chapter 15. - Disabled false-positive linter warnings for standard LaTeX spacing.	2026-02-10 14:59:26 -05:00
Vijay Janapa Reddi	9d840fc1a0	Merge branch 'dev' into feature/book-volumes	2026-02-10 13:12:34 -05:00
Vijay Janapa Reddi	4dd1bf70aa	Fix pre-commit issues: cross-refs, footnotes, unreferenced tables, SVG hook - Fix broken cross-refs in training.qmd (em-dash parsed as part of ID) - Remove footnote from table cell in ml_systems.qmd - Add @tbl- references for 22 unreferenced tables across 5 files - Comment out stale SVG prevention hook in pre-commit config - Auto-fixes from bibtex-tidy, blank-line collapse, pipe-table prettify	2026-02-09 07:57:16 -05:00
Vijay Janapa Reddi	b43f8e887d	Editorial pass across Vol 1 chapters and config - Revise prose, index entries, and references across all 16 chapters - Update quarto config (epub/html) for vol1 and vol2 - Add render_figures.py testing script - Update inject_parts.lua filter and header-includes.tex - Update notation, appendix_dam, and references.bib - Remove vol1/vol2 foreword files	2026-02-08 17:18:42 -05:00
Vijay Janapa Reddi	4f9409380d	updates	2026-02-08 14:01:59 -05:00
Vijay Janapa Reddi	3173b403d6	first pass	2026-02-08 11:11:44 -05:00
Vijay Janapa Reddi	51e720714d	Add D·A·M Venn diagram and widen ML stack data bar connectors - Widen gap between ML systems stack layers and data bar in both margin and full-size variants so connector lines are clearly visible - Add D·A·M Intersection Landscape Venn diagram (SVG + PNG) to the D·A·M Taxonomy appendix with reference table and explanatory prose - Add forward references to the Venn from Introduction and Part III - Add learning objective for intersection mapping - Chapter prose refinements across multiple chapters	2026-02-08 09:30:47 -05:00
Vijay Janapa Reddi	3dbaa04ebf	fix: resolve all pre-commit hook failures across Vol 1 and Vol 2 Content fixes: - Add references for all 8 appendix_machine tables in surrounding prose - Remove cross-volume refs (@sec-distributed-training, @sec-security-privacy) and replace with self-contained prose - Fix broken cross-refs (em-dashes, @sec-data-engineering → @sec-data-engineering-ml) - Fix unreferenced equations (@eq-memory-wall, @eq-training-iron-law) - Fix nested/forbidden footnotes (hw_acceleration, introduction, dl_primer) - Fix drop cap incompatibility in conclusion.qmd - Fix codespell false positive ("trough" added to ignore list) - Add closer @tbl/@fig references near definitions across all chapters - Replace inline fmt() calls with pre-computed _str variables (dl_primer) Checker improvements: - figure_table_flow_audit.py: exclude code block lines from gap calculation, add forward-reference tolerance, broaden code block detection to all fenced blocks (tikz, etc.) - check_render_patterns.py: improve $...$ parsing with shortest-match spans, add exponent exception for {python} in ^{...}, exit 0 on warnings-only	2026-02-08 02:01:49 -05:00
Vijay Janapa Reddi	4ae406160d	feat: add Quarto equation labels and cross-references across Vol 1 Add proper equation labels ({#eq-...}) and prose references (@eq-...) to 138 equations across 15 Volume 1 chapters following the gold-standard pattern from serving.qmd. Key changes: - Label all display math equations with {#eq-kebab-case-name} - Add @eq-name references in prose before each equation - Equations include: Iron Law, Amdahl's Law, Roofline Model, activation functions, backpropagation, attention mechanisms, queuing theory, quantization, and system throughput formulas Also includes: - PDF formatting improvements (newpage directives for Vol 2) - LaTeX header updates for chapter styling - Pre-commit config and validation script updates	2026-02-07 09:40:01 -05:00
Vijay Janapa Reddi	3d54da6305	fix: resolve inline Python build errors across Vol 1 chapters Fix NameError build failures in ml_systems, data_engineering, and benchmarking chapters caused by missing imports and variables referenced before their defining code cells. - ml_systems: add missing Kparam and Bparam imports from physx.constants - data_engineering: compute transfer_time_10g_md preview in setup cell, add md_math import, add deduplication-dividend-calc cell, convert hardcoded values to physics engine units - benchmarking: compute BERT roofline preview values in roofline-example-calc cell before they are referenced in narrative text, convert hardcoded values to inline Python, condense redundant footnotes Also includes physics engine integration improvements across all Vol 1 chapters: unit-safe conversions, inline Python for previously hardcoded values, streamlined footnotes with cross-references, and new content validation scripts. All 21 Vol 1 chapters pass PDF build tests.	2026-02-06 09:57:25 -05:00
Vijay Janapa Reddi	e942b552ba	fix: resolve cross-reference issues and add missing table/figure refs - Update check_unreferenced_labels.py to detect YAML id: frontmatter - Add references to all unreferenced tables and listings in Vol1 - Scope unreferenced labels hook to Vol1 only (Vol2 has WIP chapters) - Fix inline Python in LaTeX math blocks across multiple chapters - Update test_units.py to use Dense (not Sparse) H100 FLOPS values - Update validate_inline_refs.py regex to ignore escaped dollar signs Key files fixed: - appendix_algorithm.qmd: @tbl-tensor-op-ref, @fig-broadcasting-rules - appendix_data.qmd: @tbl-data-gravity, @tbl-serialization-cost - appendix_dam.qmd: @tbl-dam-overlap, @tbl-bottleneck-actions, etc. - appendix_machine.qmd: @tbl-latency-hierarchy, @tbl-hardware-cheatsheet - frameworks.qmd: @lst-gradient-accumulation, @lst-custom-autograd-function - dnn_architectures.qmd: @lst-conv_layer_spatial	2026-02-06 06:03:19 -05:00
Vijay Janapa Reddi	668cc25030	refactor: inline QMD plots and slim viz helpers Move remaining plot logic into QMD blocks and keep physx/viz styling-only. Update preview scripts to use local plot code.	2026-02-04 16:34:31 -05:00
Vijay Janapa Reddi	05a184459d	Refactors code to use constants and formulas Replaces hardcoded numerical values with symbolic Python variables derived from defined constants and formulas. This improves code maintainability and consistency, ensuring calculations are based on accurate and up-to-date physical values.	2026-02-03 19:48:11 -05:00
Vijay Janapa Reddi	8bdaf339ae	Standardizes copyedit PDF font size Ensures copyedit PDFs use the same font size as regular PDFs. Increases float capacity for copy-edit mode to accommodate double-spacing and prevent potential layout issues. Copies the base configuration file by first removing any existing symbolic link to prevent errors.	2026-02-03 07:14:37 -05:00
Vijay Janapa Reddi	a6e0c81380	Update vol1 chapters and add compilation continuum visualizations	2026-02-02 13:28:35 -05:00
Vijay Janapa Reddi	7c0d3e401e	Fix index placement issues and add auto-fix script - Fix \index{} commands breaking rendering when placed before footnote definitions, div openers (:::), or on same line as headings - Add check_index_placement.py script with --fix flag to automatically detect and fix these patterns - Update training.qmd and data_engineering.qmd with corrected index placement - Include other pending content and visualization updates	2026-02-02 10:39:04 -05:00
Vijay Janapa Reddi	17ebb4f734	chore: remove testing logs from tracking	2026-02-02 06:51:35 -05:00
Vijay Janapa Reddi	baf420b0ce	fix: resolve pre-commit hook issues and add figure accessibility - Fix footnotes incorrectly breaking into definition text (appendix_machine, appendix_algorithm) - Move forbidden footnotes out of callout divs (hw_acceleration, serving, training) - Convert inline footnote info to inline text where appropriate - Add fig-alt text to code-cell figures (dl_primer, hw_acceleration, serving, workflow) - Add fig-cap and fig-alt to TikZ figures (distributed_training, introduction, edge_intelligence, networking, sustainable_ai) - Fix hook ordering: Black formatter now runs before blank line collapse - Move footnote definition outside TikZ div in frameworks.qmd	2026-02-02 06:49:30 -05:00
Vijay Janapa Reddi	42bc152f7d	Fixes pgfplots dimension overflow in data_selection chapter - Fixes fig-amortization-comparison: scales Y-axis values from 12000 to 12 to avoid LaTeX dimension limit (~16383pt) - Fixes fig-compute-optimal-frontier: replaces problematic \fill...plot with proper \addplot[fill=...] \closedcycle for log-scale coordinates - Updates figure reference text to use @fig-selection-inequality - Adds ch_data_selection.py calculation module - Updates viz.py with new plot functions - Various chapter updates across vol1 and vol2	2026-02-02 06:20:23 -05:00
Vijay Janapa Reddi	25d965e719	Fix inline Python rendering and add sci() base unit conversion Key changes: - sci() and sci_latex() now convert Pint quantities to base units (fixes 10^2 showing instead of 10^14 for TFLOPs values) - Add md_frac(), md_sci(), md_math() helpers for LaTeX in Markdown() - Update ml_systems.qmd with proper LaTeX fraction rendering - Add freeze: false to _quarto.yml to prevent caching issues - Update CLAUDE.md with QMD inline Python conventions - Fix LATEX_ADJACENT issues across multiple QMD files (Unicode symbols)	2026-02-02 01:18:32 -05:00
Vijay Janapa Reddi	a75e8b80e5	Update book chapters and clean up testing artifacts - Update all vol1 and vol2 chapter content with formatting improvements - Add pre-commit hooks for additional validation checks - Remove obsolete testing artifacts (appendix_dam, appendix_data, dl_primer, glossary) - Add new testing logs for vol2 chapters and appendix_assumptions/notation - Add utility scripts for table rendering checks and prettification - Remove deprecated hw_acceleration.rmarkdown file	2026-02-01 23:28:30 -05:00
Vijay Janapa Reddi	8578982175	Update grid-to-pipe table converter with alignment support - Properly preserves left/center/right alignment from grid tables - Added --check mode for pre-commit warning - Added book-check-grid-tables hook to warn about grid tables - Grid tables should be converted to pipe for better inline Python support	2026-02-01 22:50:09 -05:00
Vijay Janapa Reddi	a85d513cd1	Fix list spacing: add before, remove between items Updated fix_bullet_spacing.py to handle both cases: 1. Add blank line BEFORE lists (intro text followed by bullet) 2. Remove blank lines BETWEEN consecutive list items Fixed 70 issues across 17 files in vol1, vol2, and frontmatter.	2026-02-01 22:21:32 -05:00
Vijay Janapa Reddi	f94e5514cf	Add bullet spacing check to pre-commit hooks - Updated fix_bullet_spacing.py with --check mode for CI validation - Added book-fix-bullet-spacing hook to auto-fix missing blank lines before bullet lists during commits - Script now provides clear error messages with line numbers	2026-02-01 22:18:19 -05:00
Vijay Janapa Reddi	6a343e8767	Add blank line before bullet lists for proper PDF rendering Fixed 19 bullet lists across vol1 and vol2 that were missing the blank line before the list starts. This ensures proper rendering in PDF/LaTeX. Added fix_bullet_spacing.py utility script for automated detection and fixing of this pattern.	2026-02-01 21:19:03 -05:00
Vijay Janapa Reddi	89df0affb4	Replace LaTeX symbols with Unicode near inline Python Use Unicode symbols to prevent decimal point stripping: - × instead of $\times$ - ≈ instead of $\approx$ - ≪ instead of $\ll$ - μ instead of $\mu$ - 10⁹, 10⁶ for powers LaTeX parsing was interfering with inline Python output, causing "5.9" to render as "59". Unicode symbols avoid this. Verified via HTML render - all computed values correct.	2026-02-01 20:50:28 -05:00
Vijay Janapa Reddi	86d2e15372	Convert all appendix grid tables to pipe tables - appendix_data.qmd: Data Gravity and Serialization tables - appendix_dam.qmd: DAM Components, Troubleshooting, Tooling, Scorecard tables - appendix_algorithm.qmd: Tensor Primitives table - appendix_machine.qmd: Numerical Formats table Pipe tables handle inline Python code better than grid tables. Also adds utility script for future grid-to-pipe conversions.	2026-02-01 20:36:06 -05:00

1 2

100 Commits