Commit Graph

8 Commits

Author SHA1 Message Date
Vijay Janapa Reddi
611de228d9 fix(mlsysim): align docs with *Model naming convention
The solver.py refactoring renamed most solver classes from *Solver to
*Model (e.g. DistributedSolver → DistributedModel). The docs still
referenced the old names, causing the Quarto site build to fail with:
  ImportError: cannot import name 'DistributedSolver' from 'mlsysim'

- Fix executable code cells in tutorials/distributed.qmd
- Update non-executable code examples across 10 doc files
- Rename 19 API reference files from *Solver.qmd to *Model.qmd
- SensitivitySolver and SynthesisSolver retain their names (correct)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-09 08:39:11 -04:00
Vijay Janapa Reddi
2bbe3e1a69 docs(mlsysim): redesign website, add 12 tutorials, and CLI entry points
Replace 9 old tutorials with 12 new numbered tutorials (00-11) covering
roofline through full-stack audit. Redesign landing page, add
models-and-solvers and extending-the-engine guides. Add __main__.py,
cli.py, and cli/ package for command-line interface.
2026-03-12 16:04:51 -04:00
Vijay Janapa Reddi
d594b4abd0 docs(mlsysim): expand to 22-wall taxonomy with paper rewrite and overview figure
Expand walls.py from 17 to 22 walls, adding Serving (4), Batching (5),
Streaming (6), Tail Latency (7), and Checkpoint (19). Update paper.tex
with rewritten abstract, concrete LLaMA-3 motivating example, competitive
positioning against Calculon/ASTRA-sim/Vidur, and new overview figure.
Rebrand docs and tutorials to match.
2026-03-12 16:04:50 -04:00
Vijay Janapa Reddi
7d807316ba docs(mlsysim): update website for 21 solvers, Cerebras, and new workload types
Add 4 new solver API pages (CheckpointSolver, ContinuousBatchingSolver,
WeightStreamingSolver, TailLatencySolver). Update solver-guide, architecture,
getting-started, index, and whitepaper to reflect 21 solvers, SSM/Diffusion
workloads, HuggingFace importer, and MLSYSIM as Infrastructure Modeling.
Restore deleted _quarto-html.yml config with updated quartodoc entries.
2026-03-12 16:04:50 -04:00
Vijay Janapa Reddi
289e018223 refactor(mlsysim): typed results, wall taxonomy, and engineering naming
- Add typed Pydantic result models (Layer A) replacing dict returns
- Add canonical Wall taxonomy registry (walls.py) as single source of truth
- Add Pipeline composer (Layer C) for solver chaining with explain()/run()
- Rename domains: Metabolism→Node, Skeleton→Data, Mind→Algorithm, World→Fleet, Meta→Analysis
- Rename MetabolismSolver→EfficiencySolver and MetabolismResult→EfficiencyResult
- Update all solver classes with walls tuple referencing canonical wall numbers
- Convert all dict access patterns to typed attribute access across codebase
2026-03-12 16:04:50 -04:00
Vijay Janapa Reddi
7b145803c3 docs(mlsysim): update API docs, tutorials, and whitepaper for new architecture
Rewrite API reference pages to match domain subpackage structure.
Add solver doc pages for CompressionSolver, DataSolver,
OrchestrationSolver, and ScalingSolver. Update whitepaper,
math reference, getting-started guide, and tutorial index.
Add extending tutorial for custom solvers.
2026-03-12 16:04:50 -04:00
Vijay Janapa Reddi
aed43c5b81 docs: clean up landing page and centralize math foundations
- Elevate 5-Layer Progressive Lowering mental model to architecture.qmd

- Clean up landing page copy to be a punchy one-liner

- Re-render architecture composition diagram as SVG for reliability

- Move math derivations out of tutorials and into math.qmd with citations

- Add DGX Spark to Silicon Zoo
2026-03-07 18:37:06 -05:00
Vijay Janapa Reddi
a78f1bd8b0 feat(mlsysim): add documentation site, typed registries, and 6-solver core
Complete MLSYSIM v0.1.0 implementation with:

- Documentation website (Quarto): landing page with animated hero
  and capability carousel, 4 tutorials (hello world, LLM serving,
  distributed training, sustainability), hardware/model/fleet/infra
  catalogs, solver guide, whitepaper, math foundations, glossary,
  and full quartodoc API reference
- Typed registry system: Hardware (18 devices across 5 tiers),
  Models (15 workloads), Systems (fleets, clusters, fabrics),
  Infrastructure (grid profiles, rack configs, datacenters)
- Core types: Pint-backed Quantity, Metadata provenance tracking,
  custom exception hierarchy (OOMError, SLAViolation)
- SimulationConfig with YAML/JSON loading and pre-validation
- Scenario system tying workloads to systems with SLA constraints
- Multi-level evaluation scorecard (feasibility, performance, macro)
- Examples, tests, and Jetson Orin NX spec fix (100 → 25 TFLOP/s)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-07 15:59:51 -05:00