cs249r_book

github-starred/cs249r_book

Fork 0

mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-10 15:49:25 -05:00

RSS Feed

mlsysim-v0.1.1 2732a26740

Compare
Stable

GiteaMirror released this 2026-04-24 13:59:31 -05:00 | 1545 commits to dev since this release
📅 Originally published on GitHub: Fri, 24 Apr 2026 19:02:57 GMT
🏷️ Git tag created: Fri, 24 Apr 2026 18:59:31 GMT

MLSys·im 0.1.1 — Paper Title Correction

First-principles infrastructure modeling for the Machine Learning Systems textbook.

Metadata-only patch release. No code or API changes; safe drop-in
replacement for 0.1.0. Corrects the paper title cited in three places
to match the actual title of the companion paper.

📚 Documentation
- Paper title corrected across CITATION.cff, the BibTeX snippet in
  README.md, and the reference docstring in mlsysim/core/walls.py.
  Was: "A Composable Analytical Framework for Machine Learning Systems."
  Now: "MLSys·im: First-Principles Infrastructure Modeling for Machine
  Learning Systems."
🏗️ Packaging & Dependencies
- Version bumped to 0.1.1 across pyproject.toml, mlsysim/__init__.py,
  and CITATION.cff; date-released updated to 2026-04-24.
Contributors
- @profvjreddi
Install
```
pip install --upgrade mlsysim==0.1.1
```
Links
About MLSys·im

MLSys·im is the first-principles infrastructure modeling engine that
produces every quantitative result in the Machine Learning Systems
textbook — memory bandwidth calculations, roofline analyses, TCO projections,
sustainability estimates, and every other number a reader encounters. Each
figure and equation in the textbook is computed, not hand-typed; mlsysim
is the computation.

The framework codifies 22 systems walls — the physical and logical
constraints that bound ML system performance — into composable solvers,
with SI units enforced at runtime. Designed for three audiences: students
building quantitative intuition, instructors running live classroom
demonstrations, and researchers doing rapid what-if analysis.
Downloads
- mlsysim-0.1.1-py3-none-any.whl
  160 KiB
  2026-05-08 09:58:32 -05:00
- mlsysim-0.1.1.tar.gz
  139 KiB
  2026-05-08 09:58:34 -05:00
- Source Code (ZIP)
- Source Code (TAR.GZ)
mlsysim-v0.1.0 caa57a46d7

Compare
Stable

GiteaMirror released this 2026-04-24 12:56:51 -05:00 | 1600 commits to dev since this release
📅 Originally published on GitHub: Fri, 24 Apr 2026 17:57:22 GMT
🏷️ Git tag created: Fri, 24 Apr 2026 17:56:51 GMT

MLSys·im 0.1.0 — Initial Release

Release date: 2026-04-01

MLSys·im is a first-principles analytical engine for predicting performance, cost, and carbon footprint of ML systems. It is the computational companion to the Machine Learning Systems textbook.

This is the first public release. The API surface, the 22-wall taxonomy, and the solver portfolio are all considered stable for the 0.1.x line.

Install
```
pip install mlsysim
```
Verify the install:
```
python -c "import mlsysim; print(mlsysim.__version__)"
mlsysim eval Llama3_8B H100 --batch-size 32
```
Requires Python 3.10+. No GPU required — the engine computes from closed-form equations.

Five-line quickstart
```
import mlsysim
from mlsysim import Engine

profile = Engine.solve(
    model      = mlsysim.Models.ResNet50,
    hardware   = mlsysim.Hardware.Cloud.A100,
    batch_size = 1,
    precision  = "fp16",
)

print(f"Bottleneck: {profile.bottleneck}")              # → Memory
print(f"Latency:    {profile.latency.to('ms'):~.2f}")   # → 0.54 ms
print(f"Throughput: {profile.throughput:.0f}")          # → 1843 / second
```
Highlights

Core framework
- 22-wall taxonomy organizing every constraint that bounds ML system performance, across six domains (Node, Data, Algorithm, Fleet, Ops, Analysis).
- 26 analytical solvers (Model, Solver, Optimizer classes) covering all 22 walls.
- Pint unit system with dimensional analysis enforced at runtime.
- TraceableConstant pattern — every default carries a citation.
- Pipeline composer for chaining solvers with explain() and run().
- 3-tier evaluation scorecard: Feasibility → Performance → Macro/Economics.
- Design Space Exploration (DSE) engine with declarative search and constraint evaluation.
Hardware Registry (15+ accelerators)

V100, A100, H100, H200, B200, GB200 NVL72, MI300X, TPUv5p, T4, Cerebras CS-3, Jetson Orin NX, ESP32-S3, nRF52840, Himax WE-I Plus, DGX Spark, MacBook M3 Max, iPhone 15 Pro, Pixel 8.

Full precision support: FP32, TF32, BF16, FP16, FP8, INT8, INT4. Multi-level memory hierarchy with HBM, SRAM, and Flash (TinyML). All specifications verified against manufacturer datasheets.

Model Registry

GPT-2/3/4, Llama-2/3 (7B/8B/70B), BERT Base/Large, ResNet-50, MobileNetV2, AlexNet, Mamba, Stable Diffusion v1.5, DS-CNN, WakeVision. HuggingFace importer included for arbitrary Transformer workloads.

Analytical models

SingleNodeModel, NetworkRooflineModel, EfficiencyModel, ForwardModel, ServingModel, ContinuousBatchingModel, WeightStreamingModel, TailLatencyModel, DataModel, TransformationModel, TopologyModel, ScalingModel, InferenceScalingModel, CompressionModel, DistributedModel, ReliabilityModel, OrchestrationModel, EconomicsModel, SustainabilityModel, CheckpointModel, ResponsibleEngineeringModel, SensitivitySolver, SynthesisSolver, ParallelismOptimizer, BatchingOptimizer, PlacementOptimizer.

CLI
```
mlsysim eval        Evaluate the analytical physics of an ML system (YAML or CLI flags)
mlsysim serve       Evaluate LLM serving (prefill + decode)
mlsysim optimize    Search the design space for optimal configurations
mlsysim zoo         Explore the built-in registries
mlsysim audit       Profile your local hardware against the Iron Law
mlsysim schema      Export the JSON Schema for the mlsys.yaml configuration file
```
Testing

367 tests, 100% pass rate. Coverage includes formula unit tests with known answers, full solver suite, physics-bound validation across all registry hardware, wall-taxonomy completeness, pipeline composition, and three optimization backends (exhaustive, OR-tools, scipy).

Documentation
- Site: mlsysbook.ai/mlsysim/
- Tutorials: roofline, memory wall, KV cache, scaling to 1000 GPUs, geography, sensitivity, full-stack audit
- API reference: mlsysbook.ai/mlsysim/api/
Known limitations & gotchas
- First-order analytical model. Predictions are typically within 15–30% of measured throughput on well-optimized workloads. Use MLSys·im to compare options and identify bottlenecks; validate with empirical benchmarks before committing to a production SLA. See accuracy.qmd for full validation against MLPerf v4.0.
- Slide PDFs. Many tutorials cross-link to lecture slides at github.com/harvard-edge/cs249r_book/releases/download/slides-latest/*.pdf. The slides-latest release tag is not yet published; these links will resolve once the slides ship.
- Hosted notebook launchers. Google Colab and Binder buttons are planned but not wired up for 0.1.0. Tutorials run locally on any Python 3.10+ environment.
Project links
- Source: github.com/harvard-edge/cs249r_book/tree/dev/mlsysim
- Issues: github.com/harvard-edge/cs249r_book/issues
- License: Apache-2.0 (code) · CC-BY-NC-SA-4.0 (documentation)
- Citation: see CITATION.cff
Downloads
- mlsysim-0.1.0-py3-none-any.whl
  159 KiB
  2026-05-08 09:58:22 -05:00
- mlsysim-0.1.0.tar.gz
  139 KiB
  2026-05-08 09:58:25 -05:00
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.10 d9afa1b970

Compare
Stable

GiteaMirror released this 2026-04-23 20:13:04 -05:00 | 1574 commits to dev since this release
📅 Originally published on GitHub: Fri, 24 Apr 2026 14:11:23 GMT
🏷️ Git tag created: Fri, 24 Apr 2026 01:13:04 GMT

TinyTorch v0.1.10

Largest Lab Guide upgrade since the series began, plus substantive framework work: new Tensor API surface (view, masked_fill, ndim, numel, contiguous), no_grad() context manager, Python 3.10+ baseline, 28 security alerts resolved, seven batches of module-audit fixes, reproducibility via seeded default_rng(7), and 847/847 tests green.

✨ New Features

Lab Guide PDF (the flagship of this release)
- Glossary back matter: 90 alphabetical entries covering tensor/memory, autograd, training systems, architecture, optimization, and ML basics, with module cross-references (see glossary.qmd).
- Module opener hooks: every module starts with a 2–3 sentence systems-first framing paragraph leading with memory, bandwidth, arithmetic intensity, HBM, or roofline implications before the ML story.
- Code listing captions + List of Listings: ~60 substantive code blocks carry Listing N.M — Description captions; populates a new List of Listings in the front matter.
- List of Figures + List of Tables in front matter (151 newly-captioned tables).
- Running headers with chapter + section: verso Chapter N · Title, recto N.M · Section, wordmark centered. H&P / CLRS convention.
- Personal instructor note signed callout at the end of the conclusion.
- Single-source build: make install-deps && make pdf works identically locally and in CI; deps live in pdf/apt-requirements.txt and pdf/tex-requirements.txt.
TinyTorch Framework
- Tensor API expansion: added view(), masked_fill(), Tensor stacking, ndim, numel(), and contiguous() (closes #1298; PR #1392 by @Shashank-Tripathi-07). PyTorch-compat test coverage added for all new methods.
- no_grad() context manager: autograd now supports with no_grad(): inference blocks plus graph cleanup between passes.
- Tito CLI: module path --about renamed to module path --guide and repointed at the Quarto chapter (consistent with the Lab Guide becoming the reference).
🐛 Framework Bug Fixes

Autograd and training
- Tanh wired into enable_autograd() — was silently producing zero gradients.
- Trainer.evaluate accuracy for regression models corrected (was misreporting).
- GELU gradient mismatch + float32 test precision fixed.
- Trainer init: guard requires_grad loop against non-Tensor params; ensure model params have requires_grad=True (2 commits).
- M06 _reduce_broadcast_grad aligned with module conventions.
Module-level
- Quantization (M15): constant tensor quantized to all-zeros, losing the original value (#1444).
- MaxPool2d: API mismatch in milestone 04 CIFAR script fixed (#1278).
- Export paths: corrected for modules 09 and 13.
- Token constants: refactor cleanup from PR #1279 (#1256).
- M19 benchmarking: MLPerf trademark attribution added, educational-purposes disclaimer, table alignment fix, addresses feedback from #1196.
- M02 activations: improved activation graph visualization.
Module audit fixes
- Seven batches of audit fixes landed: batches 1–7 covering critical fixes, medium/low documentation and accuracy, and test-infrastructure cleanup. Final state: 847/847 tests passing.
🔬 Tests
- Finite-difference gradient correctness tests added for Module 06.
- Module 08 training infrastructure coverage tests added.
- Gradient correctness suite restored with per-op tolerances (#1342).
- Module 10 tokenization tests now use real Tensor params instead of raw numpy arrays.
- Module 08 scheduler lr assertion corrected (epoch 0, not 1).
🔧 Engineering
- Reproducibility: migrated from legacy np.random to default_rng(7) — seeded, per-call RNG across all modules.
- Python baseline: minimum version bumped to 3.10. Milestone 05 docs updated.
- src/*/ABOUT.md cleanup: 20 stale duplicates deleted (−20,876 lines); the single-source ABOUT.md now lives in the correct companion-doc location.
- Security: all 28 GitHub code-scanning alerts resolved.
- Tito: register --tinytorch pytest flag in conftest; fix UnicodeDecodeError on Windows in tito module complete (#1184); null-synced_modules guard in submission progress response.
📖 Content Improvements
- Systems-first narrative: every module hook leads with the systems angle (memory, bandwidth, compute, hardware utilization) before pivoting to ML theory.
- Check Your Understanding callouts: converted from prose sections to callout-tip format with 3–5 technical-specific checkboxes per module.
- Key Takeaways: 3–4 bullet recap plus next-module hook at the end of every module chapter.
- Systems Implication callouts unified to callout-note across all 21 instances; answers converted to callout-tip collapse="true" across 101 Q&A pairs.
- Cross-reference audit: 216 orphan table/figure/listing labels got natural prose references (87% coverage).
- Further Reading hyperlinks: 20 external URLs verified and linked (Jay Alammar, arXiv papers, Karpathy's blog, Jurafsky & Martin SLP3).
- Broadcasting pitfall now taught alongside the broadcasting feature (M01 tensor).
- log_softmax implementation cleaned up with clearer variable names and reuse.
- Type hints: added to M03 layers, M04 losses, M05 dataloader (#1167).
- Big-picture diagram: redesigned as a 4-layer stack (Capstone → Optimization → Architecture → Foundation) in neutral palette with MIT-red Capstone accent.
🎨 Design and Typography
- Book-style typography: linestretch: 1.1, first-line indent (parindent: 1.2em), tight parskip. Stripe-Press / Swift-Book density.
- Thin single orange header rule (previously double rule).
- 23 module-diagram SVGs aligned to the book palette via a Gemini multimodal audit pass.
- Arraystretch 1.2 + enumitem for table and list breathing room.
- Text-only callout titles (no stripped emojis; class semantics drive visual distinction).
🐛 Lab Guide Bug Fixes
- Tokenization module: restored missing ```{python} fence that caused Pandoc to render Python variable-definition comments as chapter headings.
- Single-PDF guarantee: Makefile self-heals when Quarto's post-render cleanup strands the artifact at pdf/ instead of pdf/_build/. Build-end banner prints the canonical path.
- 3 broken URLs fixed: GPT-2 cloudfront → OpenAI CDN, PyTorch .md → .html, mlu-explain /relu/ → /neural-networks/.
- Duplicate trailing ## Get Started removed from 4 modules (copy-paste artifact).
- Orphan big-picture-module-flow.svg removed from images/diagrams/ (canonical lives at images/svg/).
🔧 CI / Infrastructure
- Single-source deps: make install-deps reads pdf/apt-requirements.txt and pdf/tex-requirements.txt — same command works locally and in CI.
- tinytorch-build-pdfs.yml and tinytorch-update-pdfs.yml simplified to make install-deps && make pdf (no inline tlmgr package list).
- make clean extended to remove stale *_files/ directories at the Quarto project root.
📚 Documentation
- Early Explorer callout removed from getting-started.qmd — no longer appropriate now that the Lab Guide is shipping.
- Callout convention documented in the preamble: six semantic callout types keyed off Quarto's five shipped classes plus title conventions.
- README tables converted from markdown to HTML format for consistent rendering across GitHub and the Lab Guide.
👥 Contributors

Thanks to everyone who contributed to this release:
- @profvjreddi — editorial direction and polish across all 20 modules
- @hzeljko — sustained code, diagram, and infrastructure contributions
- @Shashank-Tripathi-07 — Tensor PyTorch-compat API (ndim, numel, view, contiguous, masked_fill; PR #1392; first-time contributor!)
- @farhan523 — ongoing documentation and module improvements
- @adityamulik — null synced_modules fix in tito submission progress
- @harishb00 — type hints across M03/M04/M05
🆕 New Contributors
- @Shashank-Tripathi-07 made their first code contribution (PR #1392)
Full Changelog: https://github.com/harvard-edge/cs249r_book/compare/tinytorch-v0.1.9...tinytorch-v0.1.10

Website: https://mlsysbook.ai/tinytorch/

PDF: https://mlsysbook.ai/tinytorch/assets/downloads/TinyTorch-Guide.pdf

Glossary (new): https://mlsysbook.ai/tinytorch/glossary.html
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
slides-latest f9ad401c08

Compare
Stable

GiteaMirror released this 2026-04-21 16:28:33 -05:00 | 1912 commits to dev since this release
📅 Originally published on GitHub: Tue, 21 Apr 2026 21:37:27 GMT
🏷️ Git tag created: Tue, 21 Apr 2026 21:28:33 GMT

ML Systems Lecture Slides

Complete slide decks for the ML Systems curriculum.

Volume I (17 decks): Introduction through Conclusion — single-machine ML systems
Volume II (18 decks): Introduction through Conclusion — distributed ML at scale
TinyML (5 chapters): HarvardX Professional Certificate — edge & embedded ML

Vol I/II decks include speaker notes, active learning exercises, and original SVG diagrams.

Downloads — PDF
- MLSysBook-Slides-All-PDF.zip — All 35 Vol I/II decks
- MLSysBook-Slides-Vol1-PDF.zip — Volume I only
- MLSysBook-Slides-Vol2-PDF.zip — Volume II only
Downloads — PowerPoint

Image-based PPTX (not editable text) — use for presenter mode and annotations.
- MLSysBook-Slides-All-PPTX.zip — All 35 Vol I/II decks
- MLSysBook-Slides-Vol1-PPTX.zip — Volume I only
- MLSysBook-Slides-Vol2-PPTX.zip — Volume II only
Downloads — TinyML (HarvardX edX)
- MLSysBook-TinyML-All.zip — All slides, readings, and supplementary materials
- MLSysBook-TinyML-Slides.zip — 178 slide decks only
- MLSysBook-TinyML-Readings.zip — 127 readings only
Individual Vol I/II chapter files (PDF + PPTX) are also attached below.

Source
- Vol I/II LaTeX source: slides/
- TinyML courseware: tinyMLx/courseware
Downloads
- 00_course_overview.pdf
  1.1 MiB
  2026-04-27 16:22:26 -05:00
- 00_course_overview.pptx
  512 KiB
  2026-04-27 16:22:27 -05:00
- 01_introduction.pdf
  1.3 MiB
  2026-04-27 16:22:28 -05:00
- 01_introduction.pptx
  723 KiB
  2026-04-27 16:22:29 -05:00
- 02_compute_infrastructure.pdf
  1.2 MiB
  2026-04-27 16:22:30 -05:00
- 02_compute_infrastructure.pptx
  560 KiB
  2026-04-27 16:22:31 -05:00
- 02_ml_systems.pdf
  1.2 MiB
  2026-04-27 16:22:31 -05:00
- 02_ml_systems.pptx
  647 KiB
  2026-04-27 16:22:32 -05:00
- 03_ml_workflow.pdf
  1.2 MiB
  2026-04-27 16:22:33 -05:00
- 03_ml_workflow.pptx
  571 KiB
  2026-04-27 16:22:34 -05:00
- 03_network_fabrics.pdf
  1.3 MiB
  2026-04-27 16:22:35 -05:00
- 03_network_fabrics.pptx
  828 KiB
  2026-04-27 16:22:36 -05:00
- 04_data_engineering.pdf
  1.2 MiB
  2026-04-27 16:22:38 -05:00
- 04_data_engineering.pptx
  569 KiB
  2026-04-27 16:22:39 -05:00
- 04_data_storage.pdf
  1.2 MiB
  2026-04-27 16:22:40 -05:00
- 04_data_storage.pptx
  516 KiB
  2026-04-27 16:22:42 -05:00
- 05_distributed_training.pdf
  1.2 MiB
  2026-04-27 16:22:45 -05:00
- 05_distributed_training.pptx
  616 KiB
  2026-04-27 16:22:46 -05:00
- 05_nn_computation.pdf
  1.3 MiB
  2026-04-27 16:22:49 -05:00
- 05_nn_computation.pptx
  929 KiB
  2026-04-27 16:22:50 -05:00
- 06_collective_communication.pdf
  1.2 MiB
  2026-04-27 16:22:53 -05:00
- 06_collective_communication.pptx
  632 KiB
  2026-04-27 16:22:55 -05:00
- 06_nn_architectures.pdf
  1.2 MiB
  2026-04-27 16:22:59 -05:00
- 06_nn_architectures.pptx
  561 KiB
  2026-04-27 16:23:02 -05:00
- 07_fault_tolerance.pdf
  1.3 MiB
  2026-04-27 16:23:05 -05:00
- 07_fault_tolerance.pptx
  690 KiB
  2026-04-27 16:23:07 -05:00
- 07_frameworks.pdf
  1.2 MiB
  2026-04-27 16:23:09 -05:00
- 07_frameworks.pptx
  910 KiB
  2026-04-27 16:23:11 -05:00
- 08_fleet_orchestration.pdf
  1.2 MiB
  2026-04-27 16:23:13 -05:00
- 08_fleet_orchestration.pptx
  839 KiB
  2026-04-27 16:23:14 -05:00
- 08_training.pdf
  1.3 MiB
  2026-04-27 16:23:16 -05:00
- 08_training.pptx
  1.2 MiB
  2026-04-27 16:23:17 -05:00
- 09_data_selection.pdf
  1.2 MiB
  2026-04-27 16:23:18 -05:00
- 09_data_selection.pptx
  499 KiB
  2026-04-27 16:23:19 -05:00
- 09_inference.pdf
  1.2 MiB
  2026-04-27 16:23:20 -05:00
- 09_inference.pptx
  591 KiB
  2026-04-27 16:23:21 -05:00
- 10_model_compression.pdf
  1.3 MiB
  2026-04-27 16:23:23 -05:00
- 10_model_compression.pptx
  709 KiB
  2026-04-27 16:23:27 -05:00
- 10_performance_engineering.pdf
  1.2 MiB
  2026-04-27 16:23:27 -05:00
- 10_performance_engineering.pptx
  720 KiB
  2026-04-27 16:23:29 -05:00
- 11_edge_intelligence.pdf
  1.2 MiB
  2026-04-27 16:23:30 -05:00
- 11_edge_intelligence.pptx
  569 KiB
  2026-04-27 16:23:31 -05:00
- 11_hw_acceleration.pdf
  1.7 MiB
  2026-04-27 16:23:31 -05:00
- 11_hw_acceleration.pptx
  876 KiB
  2026-04-27 16:23:32 -05:00
- 12_benchmarking.pdf
  1.2 MiB
  2026-04-27 16:23:33 -05:00
- 12_benchmarking.pptx
  767 KiB
  2026-04-27 16:23:34 -05:00
- 12_ops_scale.pdf
  1.2 MiB
  2026-04-27 16:23:34 -05:00
- 12_ops_scale.pptx
  454 KiB
  2026-04-27 16:23:36 -05:00
- 13_model_serving.pdf
  1.4 MiB
  2026-04-27 16:23:37 -05:00
- 13_model_serving.pptx
  897 KiB
  2026-04-27 16:23:38 -05:00
- 13_security_privacy.pdf
  1.2 MiB
  2026-04-27 16:23:39 -05:00
- 13_security_privacy.pptx
  787 KiB
  2026-04-27 16:23:40 -05:00
- 14_ml_ops.pdf
  1.1 MiB
  2026-04-27 16:23:41 -05:00
- 14_ml_ops.pptx
  527 KiB
  2026-04-27 16:23:41 -05:00
- 14_robust_ai.pdf
  1.2 MiB
  2026-04-27 16:23:42 -05:00
- 14_robust_ai.pptx
  710 KiB
  2026-04-27 16:23:44 -05:00
- 15_responsible_engr.pdf
  1.4 MiB
  2026-04-27 16:23:45 -05:00
- 15_responsible_engr.pptx
  921 KiB
  2026-04-27 16:23:47 -05:00
- 15_sustainable_ai.pdf
  1.2 MiB
  2026-04-27 16:23:49 -05:00
- 15_sustainable_ai.pptx
  465 KiB
  2026-04-27 16:23:49 -05:00
- 16_conclusion.pdf
  1.1 MiB
  2026-04-27 16:23:50 -05:00
- 16_conclusion.pptx
  427 KiB
  2026-04-27 16:23:51 -05:00
- 16_responsible_ai.pdf
  1.2 MiB
  2026-04-27 16:23:54 -05:00
- 16_responsible_ai.pptx
  561 KiB
  2026-04-27 16:23:55 -05:00
- 17_conclusion.pdf
  1.1 MiB
  2026-04-27 16:23:58 -05:00
- 17_conclusion.pptx
  591 KiB
  2026-04-27 16:24:00 -05:00
- MLSysBook-Slides-All-PDF.zip
  44 MiB
  2026-04-27 16:24:11 -05:00
- MLSysBook-Slides-All-PPTX.zip
  24 MiB
  2026-04-27 16:24:16 -05:00
- MLSysBook-Slides-Vol1-PDF.zip
  22 MiB
  2026-04-27 16:24:20 -05:00
- MLSysBook-Slides-Vol1-PPTX.zip
  13 MiB
  2026-04-27 16:24:27 -05:00
- MLSysBook-Slides-Vol2-PDF.zip
  21 MiB
  2026-04-27 16:24:30 -05:00
- MLSysBook-Slides-Vol2-PPTX.zip
  11 MiB
  2026-04-27 16:24:34 -05:00
- MLSysBook-TinyML-All.zip
  249 MiB
  2026-04-27 16:25:06 -05:00
- MLSysBook-TinyML-Readings.zip
  87 MiB
  2026-04-27 16:25:16 -05:00
- MLSysBook-TinyML-Slides.zip
  159 MiB
  2026-04-27 16:25:34 -05:00
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.9 6a28376b0a

Compare
Stable

GiteaMirror released this 2026-02-17 18:11:17 -06:00 | 5073 commits to dev since this release
📅 Originally published on GitHub: Wed, 18 Feb 2026 00:19:13 GMT
🏷️ Git tag created: Wed, 18 Feb 2026 00:11:17 GMT

TinyTorch v0.1.9

Computed values across all ABOUT.md docs, VS Code extension thin client, progressive disclosure improvements, and community contributions.

✨ New Features
- Computed Values in ABOUT.md: Converted all 20 module ABOUT.md files to MyST Markdown Notebooks with inline Python-computed values via {glue:text} — eliminates hardcoded arithmetic errors and ensures all numerical claims are always correct
- VS Code Extension: New thin client architecture over Tito CLI with notebook editor support, build tree, and module explorer
- Version Badge: Auto-updating version badge in site navbar, refreshed on every release via CI
📖 Content Improvements
- Progressive Disclosure: Enforced scaffolding across 9 modules — solution blocks decomposed for pedagogical consistency
- Function Decomposition: Standardized naming conventions and formatting across all 20 modules
- Module 15 (Quantization): Corrected INT8 zero-point values in quantization docs
- Module 16 (Compression): Fixed sparsity percentage bugs
- Module 19 (Benchmarking): Aligned MLPerf box-drawing characters and tree indentation
- EmbeddingBackward: Moved from Module 06 to Module 11 where it belongs conceptually
🐛 Bug Fixes
- Windows Install: Fixed install issues on Windows/Git Bash by @adil-mubashir-ch in #1169
- SocratiQ Typo: Fixed typo in SocratiQ introduction by @BunningsWarehouseOfficial in #1170
- Google Auth iframe: Fixed Google auth and slow index.html loading by @kai4avaya in #1172
- Notebook Filenames: Aligned notebook filenames with Tito convention across all docs (fixes #1176 — thanks @sotoblanco)
- Missing Exports: Added missing #| export directives across 10 modules
- PDF Build: Capped Mermaid figure sizes and fixed nested code fences for LaTeX output
- VS Code Extension: Fixed notebooks opening in raw JSON instead of interactive editor
📚 Documentation
- Updated TITO reference docs to match actual CLI commands
- Fixed broken paths in CONTRIBUTING.md and INSTRUCTOR.md
- Added intra-module scaffolding subsection to progressive disclosure paper
🔧 CI/Infrastructure
- Slide decks download from release during deployment
- VS Code extension artifacts properly gitignored
👥 Contributors

Thanks to all contributors who made this release possible:
- @adil-mubashir-ch
- @BunningsWarehouseOfficial
- @kai4avaya
- @sotoblanco
- @harishb00a
- @profvjreddi
🆕 New Contributors
- @adil-mubashir-ch made their first contribution in #1169
- @sotoblanco reported #1176 (notebook filename mismatch)
- @harishb00a contributed documentation improvements
Full Changelog: https://github.com/harvard-edge/cs249r_book/compare/tinytorch-v0.1.8...tinytorch-v0.1.9

Website: https://mlsysbook.ai/tinytorch/
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.8 2d9a5b46e5

Compare
Stable

GiteaMirror released this 2026-02-07 15:02:14 -06:00 | 5125 commits to dev since this release
📅 Originally published on GitHub: Sun, 08 Feb 2026 04:06:00 GMT
🏷️ Git tag created: Sat, 07 Feb 2026 21:02:14 GMT

TinyTorch v0.1.8

Content updates, website improvements, and community contributions.

✨ New Features
- Team Page: Auto-generated team page from .all-contributorsrc with reorganized Community section
- Slide Viewer: Embedded PDF slide viewer on all module pages for in-browser viewing
- Milestone Visualization: Step-by-step visualization for milestones by @AndreaMattiaGaravagno in #1151
- Site-Only Deploy: New workflow option to deploy website without version bump
🐛 Bug Fixes
- Attention Module: Corrected O(n²) complexity explanation and memory table bug — reported in #1150
- Activations Module: Fixed misleading GELU hint about 1.702 constant — reported in #1154
- Activations Module: Expanded GELU explanation with both approximation forms
- Layers Module: Corrected Xavier/Glorot initialization terminology
- Tito CLI: Resolved Jupyter kernel mismatch causing ModuleNotFoundError (#1147)
- Paper Build: Escaped special LaTeX characters breaking PDF build
- Milestones: Fixed bold cyan frame alignment by @AndreaMattiaGaravagno in #1152
- Content: Fixed small typo by @minhdang26403 in #1163
📚 Documentation
- Specify GenAI usage in slides by @AndreaMattiaGaravagno in #1149
- Added @oscarf189 and @Takosaga as contributors
🔧 CI/Infrastructure
- Download slide decks from release during deployment
- Fixed auto-label permissions for fork PRs (#1153)
- Handle branch names with slashes in fresh install test (#1158)
👥 Contributors

Thanks to all contributors who made this release possible:
- @AndreaMattiaGaravagno
- @minhdang26403
- @oscarf189
- @Takosaga
- @profvjreddi
🆕 New Contributors
- @AndreaMattiaGaravagno made their first contribution in #1149
- @minhdang26403 made their first contribution in #1163
- @oscarf189
- @Takosaga
Full Changelog: https://github.com/harvard-edge/cs249r_book/compare/tinytorch-v0.1.7...tinytorch-v0.1.8

Website: https://mlsysbook.ai/tinytorch/
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.7 01fc7df6a0

Compare
Stable

GiteaMirror released this 2026-01-29 15:57:48 -06:00 | 5180 commits to dev since this release
📅 Originally published on GitHub: Thu, 29 Jan 2026 22:48:20 GMT
🏷️ Git tag created: Thu, 29 Jan 2026 21:57:48 GMT

TinyTorch v0.1.7

Critical fix for module exports that were silently failing in CI and some user environments.

🐛 Bug Fixes
- Export System: Uses nbdev Python API instead of CLI for reliable cross-platform exports
- Export System: Fixed directory detection when running from tinytorch/ directory
- Export System: Failures now show full error details for debugging - reported by @lalalostcode in #1146
- Milestones: Fixed Tensor class passing in MLPerf step functions
✨ Improvements
- Paper Link: Now links to arXiv with external link icon (↗) instead of download
- CLI: Invalid commands show helpful error messages
🔧 CI/Infrastructure
- Renamed Publish (Dev) → Preview (Dev) for clearer workflow naming
- All tests run on all platforms by default
- Test types aligned with CLI naming (--user-journey)
👥 Contributors

Thanks to all contributors who made this release possible:
- @lalalostcode
- @profvjreddi
Full Changelog: https://github.com/harvard-edge/cs249r_book/compare/tinytorch-v0.1.6...tinytorch-v0.1.7

Website: https://mlsysbook.ai/tinytorch/
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.6 cb1062355e

Compare
Stable

GiteaMirror released this 2026-01-27 12:00:22 -06:00 | 5228 commits to dev since this release
📅 Originally published on GitHub: Tue, 27 Jan 2026 18:00:32 GMT
🏷️ Git tag created: Tue, 27 Jan 2026 18:00:22 GMT

Windows/Git Bash Support 🪟

The installer script now works on Windows via Git Bash!

Changes
- Platform detection for OS-specific guidance during installation
- More reliable pip invocation using $PYTHON_CMD -m pip
- Cross-platform line endings via .gitattributes
- Virtual environment activation works correctly on Windows
Contributors

Thanks to the community for Windows support:
- @Kobra299 - reported the Windows issue (#1078)
- @rnjema - developed Windows installation improvements (PR #1105)
- @joeswagson - developed PowerShell installer concept (PR #1083)
Installation

Windows (Git Bash):
```
curl -sSL mlsysbook.ai/tinytorch/install.sh | bash
cd tinytorch
source .venv/Scripts/activate
tito setup
```
macOS/Linux:
```
curl -sSL mlsysbook.ai/tinytorch/install.sh | bash
cd tinytorch
source .venv/bin/activate
tito setup
```
Full Changelog: https://github.com/harvard-edge/cs249r_book/blob/main/tinytorch/CHANGELOG.md
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-v0.1.5 da97497546

Compare
Stable

GiteaMirror released this 2026-01-27 07:55:07 -06:00 | 5251 commits to dev since this release
📅 Originally published on GitHub: Tue, 27 Jan 2026 14:00:21 GMT
🏷️ Git tag created: Tue, 27 Jan 2026 13:55:07 GMT

TinyTorch v0.1.5

This release includes Windows support, bug fixes, and documentation improvements.

✨ New Features
- Windows Support: Full Windows compatibility with Git Bash
  - Added PYTHONUTF8 and PYTHONIOENCODING for proper Unicode/emoji handling
  - Windows OS matrix support in CI for progressive testing
🐛 Bug Fixes
- Activations Module: Fixed Softmax forward pass implementation by @minhdang26403 in #1141
- Activations Module: Removed unnecessary Sigmoid clipping by @minhdang26403 in #1140
- Activations Module: Fixed typo and answer render error by @minhdang26403 in #1139
- Convolutions Module: Fixed computation example (Position 1,1: 8→7) - reported by @ngbolin in #1144
- Convolutions Module: Fixed pooling example element lists and averages
- Tensor Module: Fixed matrix multiplication docstring examples
- Profiling Module: Fixed convolution FLOPs calculation
- Optimizer: Fixed gradient bug and CI improvements by @profvjreddi in #1136
📝 Documentation
- Fixed broken chapter links in README by @BunningsWarehouseOfficial in #1132
- Fixed typos by @didier-durand in #1133
- Star button now links to GitHub stars explanation section
🔧 CI/Infrastructure
- Windows CI improvements (using windows-2022 for stability)
- Validate workflow now only runs on dev push, not main
- Updated workflow references to tinytorch-validate-dev
👥 Contributors

Thanks to all contributors who made this release possible:
- @minhdang26403
- @BunningsWarehouseOfficial
- @didier-durand
- @ngbolin
- @profvjreddi
🆕 New Contributors
- @BunningsWarehouseOfficial made their first contribution in #1132
Full Changelog: https://github.com/harvard-edge/cs249r_book/compare/tinytorch-v0.1.4...tinytorch-v0.1.5

Website: https://mlsysbook.ai/tinytorch/
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
tinytorch-slides-v0.1.0 cdec74a422

Compare
Stable

GiteaMirror released this 2026-01-25 11:10:01 -06:00 | 5362 commits to dev since this release

📅 Originally published on GitHub: Sun, 25 Jan 2026 17:51:59 GMT
🏷️ Git tag created: Sun, 25 Jan 2026 17:10:01 GMT

Initial release of TinyTorch lecture slides (PDF format) for all 18 modules.
Downloads
- 00_tinytorch.pdf
  12 MiB
  2026-05-08 09:57:34 -05:00
- 01_tensor.pdf
  3.5 MiB
  2026-05-08 09:57:35 -05:00
- 02_activations.pdf
  1.5 MiB
  2026-05-08 09:57:36 -05:00
- 03_layers.pdf
  1.7 MiB
  2026-05-08 09:57:37 -05:00
- 04_losses.pdf
  2.3 MiB
  2026-05-08 09:57:38 -05:00
- 05_dataloader.pdf
  2.3 MiB
  2026-05-08 09:57:40 -05:00
- 06_autograd.pdf
  1.5 MiB
  2026-05-08 09:57:41 -05:00
- 07_optimizers.pdf
  1.8 MiB
  2026-05-08 09:57:42 -05:00
- 08_training.pdf
  3.9 MiB
  2026-05-08 09:57:43 -05:00
- 09_cnn.pdf
  2.5 MiB
  2026-05-08 09:57:44 -05:00
- 10_tokenization.pdf
  1.6 MiB
  2026-05-08 09:57:45 -05:00
- 11_embeddings.pdf
  1.8 MiB
  2026-05-08 09:57:46 -05:00
- 12_attention.pdf
  2.0 MiB
  2026-05-08 09:57:46 -05:00
- 13_transformers.pdf
  8.1 MiB
  2026-05-08 09:57:48 -05:00
- 14_profiling.pdf
  1.9 MiB
  2026-05-08 09:57:48 -05:00
- 15_quantization.pdf
  3.3 MiB
  2026-05-08 09:57:50 -05:00
- 16_compression.pdf
  2.6 MiB
  2026-05-08 09:57:51 -05:00
- 17_acceleration.pdf
  2.1 MiB
  2026-05-08 09:57:53 -05:00
- 18_kvcache.pdf
  1.7 MiB
  2026-05-08 09:57:54 -05:00
- 19_benchmarking.pdf
  2.1 MiB
  2026-05-08 09:57:56 -05:00
- 20_capstone.pdf
  11 MiB
  2026-05-08 09:57:58 -05:00
- Source Code (ZIP)
- Source Code (TAR.GZ)

1 2 3

24 Releases 31 Tags

Stable

MLSys·im 0.1.1 — Paper Title Correction

📚 Documentation

🏗️ Packaging & Dependencies

Contributors

Install

Links

About MLSys·im

Stable

MLSys·im 0.1.0 — Initial Release

Install

Five-line quickstart

Highlights

Core framework

Hardware Registry (15+ accelerators)

Model Registry

Analytical models

CLI

Testing

Documentation

Known limitations & gotchas

Project links

Stable

TinyTorch v0.1.10

✨ New Features

🐛 Framework Bug Fixes

🔬 Tests

🔧 Engineering

📖 Content Improvements

🎨 Design and Typography

🐛 Lab Guide Bug Fixes

🔧 CI / Infrastructure

📚 Documentation

👥 Contributors

🆕 New Contributors

Stable

ML Systems Lecture Slides

Downloads — PDF

Downloads — PowerPoint

Downloads — TinyML (HarvardX edX)

Source

Stable

TinyTorch v0.1.9

✨ New Features

📖 Content Improvements

🐛 Bug Fixes

📚 Documentation

🔧 CI/Infrastructure

👥 Contributors

🆕 New Contributors

Stable

TinyTorch v0.1.8

✨ New Features

🐛 Bug Fixes

📚 Documentation

🔧 CI/Infrastructure

👥 Contributors

🆕 New Contributors

Stable

TinyTorch v0.1.7

🐛 Bug Fixes

✨ Improvements

🔧 CI/Infrastructure

👥 Contributors

Stable

Windows/Git Bash Support 🪟

Changes

Contributors

Installation

Stable

TinyTorch v0.1.5

✨ New Features

🐛 Bug Fixes

📝 Documentation

🔧 CI/Infrastructure

👥 Contributors

🆕 New Contributors

Stable