cs249r_book

mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-07 18:18:42 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	c824ac6ed1	refactor(staffml): retire prod static-fallback; opt-in dev-only (#1598 ) The bundled corpus.json was serving as a prod safety net behind the Cloudflare Worker. Post-cutover the Worker has been the real data source, and the static path was silently degrading rather than helping (corpus.json is a generated artifact whose prose `details` are blank in corpus-summary.json). This change: - Stops emitting corpus.json in the publish-live workflow - Removes the Worker-error fallback in getQuestionFullDetail — errors now propagate to useFullQuestion and the UI shows a "details unavailable" banner instead of silently filling blanks - Drops the localhost auto-trigger in shouldUseStaticDetails — the static path now requires explicit NEXT_PUBLIC_VAULT_FALLBACK=static - Switches taxonomy.ts to corpus-summary.json (was corpus.json) - Rewrites the publish-live smoke tests against corpus-summary.json - Collapses validate-vault.py to sparse-only (per-question deep validation lives in `vault check --strict`) Static-fallback remains as an OPT-IN local-dev affordance: set NEXT_PUBLIC_VAULT_FALLBACK=static and run `vault build --legacy-json` to materialize corpus.json. The Function-constructor dynamic import keeps Turbopack from requiring corpus.json at build time. useFullQuestion hook signature changed from `Question \| undefined` to `{ question, status }`. Callers updated: practice and plans pages (both render an amber "details unavailable" banner when status is 'error'). Deleted dead cutover scaffolding: corpus-source.ts (router with no UI consumers), corpus-vault.ts (worker-only mirror, never wired up), useVaultQuestion.ts (unused migration hook), vault-fallback.ts (only consumer was corpus-source.ts). Deleted stale docs: staffml/scripts/DEPRECATED.md, vault-cli/docs/ CUTOVER_QA.md, three vault/docs/RESUME_PLAN_*.md. Verified locally: tsc clean, vitest 37/37, next build produces all 15 static routes. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 18:47:03 -04:00
Vijay Janapa Reddi	ece763b785	fix(ci,staffml): align validate-vault with corpus as build artifact - Drop staffml-validate-vault from pre-commit: full per-question checks need vault build --legacy-json; book and unrelated pre-commit runs no longer fail on a missing gitignored corpus.json. - validate-vault.py: sparse mode (taxonomy + manifest) when corpus.json is absent; full path unchanged when the bundle exists locally or after build. - staffml-validate-dev smoke job: install vault-cli, run vault build --legacy-json before validate-vault and corpus invariants (same contract as preview/publish), raise job timeout for the build step.	2026-04-26 11:12:00 -04:00
Vijay Janapa Reddi	542aaf95d2	cleanup(vault): release-ready Phase A — schema hardening + lint calibration + chain repair Closes the cleanup arc (A.1–A.10 in RESUME_PLAN_RELEASE.md). Every gate is now green: vault check --strict, vault lint, vault doctor, vault codegen --check, staffml validate-vault, Playwright (9/9), tsc. A.1 mobile-1962.svg: renamed `Edge` → `RegEdge` in graphviz source (`Edge` is a reserved keyword); SVG renders cleanly. Also fixed tinyml-1570.py (missing `import numpy as np`) which the new failure log surfaced. A.2 render_visuals.py: structured per-ID failure log written to `_validation_results/render_failures.json` on every run; non-zero exit on any per-item crash; new `--fail-fast` and `--failure-log` CLI options. Replaces the prior silent-failure mode. A.3 LinkML visual schema: typed as a structured sub-schema. New `VisualKind` enum (svg only — `mermaid` was reserved but never shipped, dropped to keep the enum honest). Path regex tightened to `^[a-z0-9-]+\.svg$`. Alt minimum length 10, caption required minimum length 5. TypeScript Visual interface + Question.visual field added to staffml-vault-types/index.ts. A.4 Pydantic Visual + Question validators: - Visual.kind hard-rejects anything but `svg` - Visual.path enforces the new regex - Visual.alt min 10 chars, caption required min 5 chars - Question.model_validator: visual.path MUST resolve to a real file under interviews/vault/visuals/<track>/. Skipped in production deploys where the working tree is absent. A.5 Registry repair + doctor split: - tools: repair_registry.py appended 5,269 missing IDs (the rename refactor at `8a5c3ff3c` left the append-only registry unsynced; this brings disk-coverage to 100%). Header block in id-registry.yaml documents the rebuild rationale. - doctor.py: split symmetric `registry-integrity` check into `disk-coverage` (HARD FAIL if any disk YAML id is unregistered) and `registry-history` (INFO ONLY for retired ids — the registry is by design an audit log, retired ids are normal). Pre-existing `_check_schema_version` bug (`versions == {1}` vs string `"1.0"`) fixed. A.6 Lint calibration via 4-expert consensus + bloom-canonical reclassification: - Spawned 4 experts (Vijay Reddi, Chip Huyen, Jeff Dean, education-reviewer) on 42 disputed (zone, level) pairs; consensus-builder aggregated to 15 valid / 19 invalid / 8 borderline. - User arbitrated 8 borderlines: 7 widen / 1 reclassify. - Built ZONE_BLOOM_AFFINITY matrix (Education-Reviewer's idea): every zone admits its dominant Bloom verb + adjacent verbs, rejects clear hierarchy violations. - reclassify_zone_bloom_mismatch.py applied 576 deterministic zone fixes via BLOOM_CANONICAL_ZONE mapping (e.g. fluency+analyze → analyze, recall+analyze → analyze, evaluation+apply → implement). - Question.model_validator(_zone_bloom_compatible): hard-rejects future zone-bloom mismatches at write time. Generated drafts can no longer ship a self-contradicting classification. - ZONE_LEVEL_AFFINITY widened per consensus + arbitration + post-reclassification adjustments. Lint warnings: 1,308 → 0. A.7 Chain integrity: - repair_chains.py: drops chain refs when a chain has <2 published members (chain ceases to exist), renumbers all members of any chain whose positions are non-sequential / duplicated / non-monotonic-by-level. Sort key: level ascending, then old position, then qid (deterministic). - validate-vault.py: relaxed sequential check to unique-positions check. Position gaps from mid-chain deletions are normal; what matters is uniqueness + bloom-monotonicity (vault check --strict enforces both from YAML source-of-truth). A.8 Practice page visual + zoom modal: - QuestionVisual.tsx: wraps the `<img>` in `<Zoom>` from react-medium-image-zoom (4 KB). Click image → fullscreen `<dialog data-rmiz-modal>`; ESC closes. Added test-id `question-visual-img` for stable selector. - New Playwright test: 9th in the suite, deep-links cloud-4492, asserts the dialog opens on click and closes on ESC. - TypeScript: removed `mermaid` from local Visual types in corpus.ts and corpus-vault.ts; tsc clean. A.9 All gates green: - vault check --strict: 0 errors / 0 invariant failures - vault lint: 0 errors / 0 warnings (was 1,308 warnings) - vault codegen --check: artifacts in sync (hash baseline updated) - vault doctor: 0 fails (registry-history info, git-state warn on uncommitted state-pre-this-commit) - staffml validate-vault: 0 errors / 0 warnings, deployment-ready - Playwright: 9/9 pass (was 8; +zoom modal test) - render_visuals: 0 errors (was 2 silent failures pre-A.2) - tsc: clean Distribution after reclassification: 9,544 published unchanged; 576 items moved zone via bloom-canonical mapping (full per-item report at /tmp/reclassify_changes.csv). Chain count 879 → 850 after orphan-singleton drops. release_hash updated. Carry-forward to next session (Phase B): - Priority gap closure for parallelism cells + global L4-L6+ (the run that produced this corpus did not close the targeted cells; B.3 needs specialized prompts per cell-class) - 120 NEEDS_FIX items from coverage_loop/20260425_150712/ still carry judge fix_suggestions; spawn fix-agent in Phase C	2026-04-25 15:12:51 -04:00
Vijay Janapa Reddi	ec26d2a25c	fix(vault): critical CI + paper accuracy fixes - Remove sync-vault.py calls from staffml-publish-live.yml and publish-all-live.yml (script was deleted, would break production deploy) - Fix vault export-paper: numareas=13 (was 87), numzones=11 (was 8), numedges=57 (was hardcoded 123) — now queries taxonomy/zones tables - Add areas, edges, zones_count to corpus_stats.json - Delete dead corpus-index.json (stale 8,053 count, no imports) - Fix validate-vault.py warning to reference vault build instead of deleted generate-manifest.py	2026-04-18 12:54:11 -04:00
Vijay Janapa Reddi	4d60732a8c	fix(staffml): handle list-type chain_ids and dict chain_positions New vault questions use a richer schema: chain_ids as list, chain_positions as dict mapping chain_id to position. Both generate-manifest.py and validate-vault.py now handle both old (scalar) and new (list/dict) formats.	2026-04-01 09:37:46 -04:00
Vijay Janapa Reddi	a0214cb8a1	staffml: add vault integrity validation to CI, disable old Quarto workflow - validate-vault.py: schema checks, uniqueness, taxonomy consistency, chain integrity, manifest sync, distribution sanity - Runs in both dev preview and live publish workflows before deploy - Errors block deployment, warnings are logged for review - Disabled old interviews-preview-dev Quarto trigger (no longer exists)	2026-03-25 15:25:57 -04:00

6 Commits