1 Commits

Author SHA1 Message Date
Vijay Janapa Reddi
5edb1df253 fix(mlsysim): round 10 fixes + laws-explained doc
- Fix goodput formula: use steady-state overhead model (checkpoint/interval +
  recovery/MTBF) instead of prob_fail-based formula that approaches zero at scale
- Fix speculative decode: draft model uses its own KV cache, not target's
- Clarify hierarchical AllReduce: document NCCL reduce-scatter design choice
- Add docs/laws-explained.md: plain-English explanation of all 22 walls + Iron Law
2026-04-01 18:15:49 -04:00