5 Commits

Author SHA1 Message Date
Vijay Janapa Reddi
4206f3171b docs(mlsysim): add CLI instructions to getting started guide 2026-03-13 08:47:54 -04:00
Vijay Janapa Reddi
c9b09d5bf4 docs(root): add MLSysim to top-level ecosystem links 2026-03-13 08:26:06 -04:00
Vijay Janapa Reddi
a07a664185 refactor(mlsysim): overhaul solver API, results, and test suite
Restructure solver.py with prompt caching in ServingSolver, improve
results dataclass, update pipeline chaining, and modernize test suite.
Replace hardcoded hardware values with constants throughout.
2026-03-12 16:04:51 -04:00
Vijay Janapa Reddi
289e018223 refactor(mlsysim): typed results, wall taxonomy, and engineering naming
- Add typed Pydantic result models (Layer A) replacing dict returns
- Add canonical Wall taxonomy registry (walls.py) as single source of truth
- Add Pipeline composer (Layer C) for solver chaining with explain()/run()
- Rename domains: Metabolism→Node, Skeleton→Data, Mind→Algorithm, World→Fleet, Meta→Analysis
- Rename MetabolismSolver→EfficiencySolver and MetabolismResult→EfficiencyResult
- Update all solver classes with walls tuple referencing canonical wall numbers
- Convert all dict access patterns to typed attribute access across codebase
2026-03-12 16:04:50 -04:00
Vijay Janapa Reddi
a78f1bd8b0 feat(mlsysim): add documentation site, typed registries, and 6-solver core
Complete MLSYSIM v0.1.0 implementation with:

- Documentation website (Quarto): landing page with animated hero
  and capability carousel, 4 tutorials (hello world, LLM serving,
  distributed training, sustainability), hardware/model/fleet/infra
  catalogs, solver guide, whitepaper, math foundations, glossary,
  and full quartodoc API reference
- Typed registry system: Hardware (18 devices across 5 tiers),
  Models (15 workloads), Systems (fleets, clusters, fabrics),
  Infrastructure (grid profiles, rack configs, datacenters)
- Core types: Pint-backed Quantity, Metadata provenance tracking,
  custom exception hierarchy (OOMError, SLAViolation)
- SimulationConfig with YAML/JSON loading and pre-validation
- Scenario system tying workloads to systems with SLA constraints
- Multi-level evaluation scorecard (feasibility, performance, macro)
- Examples, tests, and Jetson Orin NX spec fix (100 → 25 TFLOP/s)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-07 15:59:51 -05:00