cs249r_book

mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-08 02:28:25 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	542aaf95d2	cleanup(vault): release-ready Phase A — schema hardening + lint calibration + chain repair Closes the cleanup arc (A.1–A.10 in RESUME_PLAN_RELEASE.md). Every gate is now green: vault check --strict, vault lint, vault doctor, vault codegen --check, staffml validate-vault, Playwright (9/9), tsc. A.1 mobile-1962.svg: renamed `Edge` → `RegEdge` in graphviz source (`Edge` is a reserved keyword); SVG renders cleanly. Also fixed tinyml-1570.py (missing `import numpy as np`) which the new failure log surfaced. A.2 render_visuals.py: structured per-ID failure log written to `_validation_results/render_failures.json` on every run; non-zero exit on any per-item crash; new `--fail-fast` and `--failure-log` CLI options. Replaces the prior silent-failure mode. A.3 LinkML visual schema: typed as a structured sub-schema. New `VisualKind` enum (svg only — `mermaid` was reserved but never shipped, dropped to keep the enum honest). Path regex tightened to `^[a-z0-9-]+\.svg$`. Alt minimum length 10, caption required minimum length 5. TypeScript Visual interface + Question.visual field added to staffml-vault-types/index.ts. A.4 Pydantic Visual + Question validators: - Visual.kind hard-rejects anything but `svg` - Visual.path enforces the new regex - Visual.alt min 10 chars, caption required min 5 chars - Question.model_validator: visual.path MUST resolve to a real file under interviews/vault/visuals/<track>/. Skipped in production deploys where the working tree is absent. A.5 Registry repair + doctor split: - tools: repair_registry.py appended 5,269 missing IDs (the rename refactor at `8a5c3ff3c` left the append-only registry unsynced; this brings disk-coverage to 100%). Header block in id-registry.yaml documents the rebuild rationale. - doctor.py: split symmetric `registry-integrity` check into `disk-coverage` (HARD FAIL if any disk YAML id is unregistered) and `registry-history` (INFO ONLY for retired ids — the registry is by design an audit log, retired ids are normal). Pre-existing `_check_schema_version` bug (`versions == {1}` vs string `"1.0"`) fixed. A.6 Lint calibration via 4-expert consensus + bloom-canonical reclassification: - Spawned 4 experts (Vijay Reddi, Chip Huyen, Jeff Dean, education-reviewer) on 42 disputed (zone, level) pairs; consensus-builder aggregated to 15 valid / 19 invalid / 8 borderline. - User arbitrated 8 borderlines: 7 widen / 1 reclassify. - Built ZONE_BLOOM_AFFINITY matrix (Education-Reviewer's idea): every zone admits its dominant Bloom verb + adjacent verbs, rejects clear hierarchy violations. - reclassify_zone_bloom_mismatch.py applied 576 deterministic zone fixes via BLOOM_CANONICAL_ZONE mapping (e.g. fluency+analyze → analyze, recall+analyze → analyze, evaluation+apply → implement). - Question.model_validator(_zone_bloom_compatible): hard-rejects future zone-bloom mismatches at write time. Generated drafts can no longer ship a self-contradicting classification. - ZONE_LEVEL_AFFINITY widened per consensus + arbitration + post-reclassification adjustments. Lint warnings: 1,308 → 0. A.7 Chain integrity: - repair_chains.py: drops chain refs when a chain has <2 published members (chain ceases to exist), renumbers all members of any chain whose positions are non-sequential / duplicated / non-monotonic-by-level. Sort key: level ascending, then old position, then qid (deterministic). - validate-vault.py: relaxed sequential check to unique-positions check. Position gaps from mid-chain deletions are normal; what matters is uniqueness + bloom-monotonicity (vault check --strict enforces both from YAML source-of-truth). A.8 Practice page visual + zoom modal: - QuestionVisual.tsx: wraps the `<img>` in `<Zoom>` from react-medium-image-zoom (4 KB). Click image → fullscreen `<dialog data-rmiz-modal>`; ESC closes. Added test-id `question-visual-img` for stable selector. - New Playwright test: 9th in the suite, deep-links cloud-4492, asserts the dialog opens on click and closes on ESC. - TypeScript: removed `mermaid` from local Visual types in corpus.ts and corpus-vault.ts; tsc clean. A.9 All gates green: - vault check --strict: 0 errors / 0 invariant failures - vault lint: 0 errors / 0 warnings (was 1,308 warnings) - vault codegen --check: artifacts in sync (hash baseline updated) - vault doctor: 0 fails (registry-history info, git-state warn on uncommitted state-pre-this-commit) - staffml validate-vault: 0 errors / 0 warnings, deployment-ready - Playwright: 9/9 pass (was 8; +zoom modal test) - render_visuals: 0 errors (was 2 silent failures pre-A.2) - tsc: clean Distribution after reclassification: 9,544 published unchanged; 576 items moved zone via bloom-canonical mapping (full per-item report at /tmp/reclassify_changes.csv). Chain count 879 → 850 after orphan-singleton drops. release_hash updated. Carry-forward to next session (Phase B): - Priority gap closure for parallelism cells + global L4-L6+ (the run that produced this corpus did not close the targeted cells; B.3 needs specialized prompts per cell-class) - 120 NEEDS_FIX items from coverage_loop/20260425_150712/ still carry judge fix_suggestions; spawn fix-agent in Phase C	2026-04-25 15:12:51 -04:00
Vijay Janapa Reddi	d6c7fe5685	feat(vault): batched Gemini generator + coverage-gap analyzer Two new scripts and a schema/renderer cleanup: 1. analyze_coverage_gaps.py: quantifies imbalance across track × zone × level × competency-area, ranks weakest cells by priority weight, and emits both a Markdown report and a machine-readable JSON plan that the batched generator can consume. Critically, this surfaces gaps like tinyml/parallelism (15 vs ~100 expected), mobile/parallelism, global L4-L6+ (essentially empty), and the two missing visual archetypes (kv-cache-management, memory-hierarchy-design). 2. gemini_cli_generate_questions.py: refactored to BATCH cells per API call (default 12 cells/call, max 25 for visual). At 250 calls/day, this scales the generation budget from 250 q/day to 3,000 q/day while making auto-balanced selections across tracks × topics × zones × levels via round-robin. Replaces the wasteful 1-q-per-call pattern. 3. render_visuals.py: source format is now inferred from filesystem (presence of <id>.dot or <id>.py next to <id>.svg) rather than from a YAML field. The Pydantic schema is unchanged, so generated YAMLs stay valid. Plus the 9 visual question YAMLs are repaired: provenance set to 'llm-draft' (a valid enum value) and source_format dropped from the visual block (Pydantic forbids extra fields).	2026-04-25 09:06:49 -04:00
Vijay Janapa Reddi	612885a952	refactor(vault): visual schema aligns with website + 5 more Gemini-generated visuals Schema fix: visual.kind is always 'svg' (the format the website ships) and visual.path points to that asset. The build-pipeline format is recorded as optional metadata in visual.source_format ('dot' \| 'matplotlib' \| 'hand'), which the website ignores. This separates "what users render" from "how maintainers built it". Source files live next to the SVG by naming convention; the renderer infers the path from the YAML's source_format hint without a dedicated source field. Five new visual exemplars generated by Gemini 3.1 Pro Preview, covering diverse archetypes: - cloud-2849 (DOT): incast-bottleneck topology - cloud-2850 (DOT): leaf-spine fabric with 2:1 oversubscription - cloud-2851 (matplotlib): bandwidth bar chart for data pipeline diagnosis - cloud-2852 (matplotlib): checkpoint/recovery timeline with RPO/RTO - edge-0972 (matplotlib): Poisson vs bursty queueing curves Plus the four prior exemplars (cloud-2846, 2847, 2848, tinyml-0816) re-emitted under the new schema. cloud-visual-001 unchanged — already had the correct shape. ARCHITECTURE.md rewritten to document the simpler three-layer separation (website / build / authoring).	2026-04-25 08:57:26 -04:00
Vijay Janapa Reddi	38e5c99f17	feat(vault): multi-format visual question architecture (DOT + matplotlib + SVG) ARCHITECTURE.md establishes that visuals are a property of any question, not a separate category. Three supported formats let the layout engine do the work: DOT for graph topology, matplotlib for curves and Gantt charts, hand SVG for custom layouts. render_visuals.py is the single entry point that dispatches by visual.kind, runs the appropriate tool, and normalizes the rendered SVG to the book's font stack. It is idempotent and supports --dry-run. Three exemplars cover the three formats: - cloud-2846 (DOT): Tree AllReduce on 8 ranks — auto-laid-out topology - cloud-2847 (matplotlib): Queueing hockey-stick curve with SLO line - cloud-2848 (matplotlib): Pipeline-bubble Gantt for GPipe schedule All three are status:draft pending math review and promotion in a later batch. Existing cloud-visual-001 remains unchanged as the canonical hand-SVG exemplar.	2026-04-25 08:42:59 -04:00

4 Commits