cs249r_book

mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-10 15:49:25 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	bc553017b4	docs(vault): roadmap status + Phase 3 authoring conventions D-cleanups folded into one commit: - CHAIN_ROADMAP.md status header reflects current state (Phase 1+2 complete, Phase 3 pilot landed, Phase 4 mostly shipped). - Phase 4.1 / 4.6 / 4.7 / 4.9 entries marked complete with commit refs. - ARCHITECTURE.md gains a §3.6.1 documenting the two YAML-body conventions introduced when LLM-authored questions started landing in Phase 3: - _authoring private metadata block on drafts (stripped at promotion) - gap-bridge:<from>-<to> tag added at promotion for traceability Neither is schema-enforced (Pydantic accepts extra); both are stable across the pipeline. No code changes.	2026-05-01 17:33:36 -04:00
Vijay Janapa Reddi	202397f594	Merge origin/dev into yaml-audit Pull in the dev work that landed since yaml-audit was last synced: - --legacy-json renamed to --local-json (`2b381bb949`) — script/doc updates needed below in this branch - CI workflow refactor (validate-dev / validate-vault now reusable) - all-contributors automation, gitignore tightening, codespell list - PR #1622 navbar URL rewrite for dev preview - PR #1619 clone-size refactor, #1618 milestone3 xor fix, #1617 perceptron seed, #1616 tito status M3 - Chapter 9 PDF layout refinement - assorted staffml/practice fixes (pickRandom deps, GitHub star gate) This merges the canonical dev state into yaml-audit so subsequent work continues on top of the freshest base. Conflicts in practice/page.tsx + corpus.ts + ARCHITECTURE.md resolved to keep both sides' additive changes (Phase 2 tier work + dev's later refactors).	2026-05-01 17:11:31 -04:00
Vijay Janapa Reddi	cbb28ebf26	docs(vault): document v1.1 sidecar + hierarchy + tier model Phase 4.8 of CHAIN_ROADMAP.md. ARCHITECTURE.md gains a new §3.6 capturing the three deltas that landed during the chain workstream — additive to v1, not replacements: - hierarchical question layout (`<track>/<area>/<id>.yaml`) - sidecar chain architecture (chains.json authoritative; YAML chains: field retired) - chain tier model (primary/secondary, default-primary on read) README.md updates: - status line: v1.1, points at CHAIN_ROADMAP.md and ARCHITECTURE.md §3.6 - new "Chain build pipeline" section with the diagnose / build / apply / merge invocations - layout listing reflects scripts/ and the actual src/ contents (was stuck on Phase 0 scaffolding shape) No code changes. The v1 release-pipeline invariants absorb the v1.1 deltas without modification (chains.json is a Merkle leaf; tier flows into that leaf transparently).	2026-04-30 20:26:09 -04:00
Vijay Janapa Reddi	9fdbfb9a4c	refactor(vault-cli): rename --legacy-json to --local-json The flag is the StaffML frontend's local-dev fallback (read corpus.json from disk via NEXT_PUBLIC_VAULT_FALLBACK=static), not a deprecated path. "Legacy" implied "soon to be removed"; "local-json" describes its actual role and reads correctly in scripts and docs. - vault-cli: rename CLI flag, parameter, result key, and help text. - CI workflows + pre-commit config: invoke the new flag name. - All scripts that print the command (suggest_exemplars, pre_commit_corpus_guard, promote_validated, rename_legacy_ids, export_to_staffml, the paper analyze_corpus/generate_*) updated. - Comments and docs (ARCHITECTURE, CHANGELOG, REVIEWS, TESTING, MASSIVE_BUILD_RUNBOOK, DEPRECATED, AUTHORING, plus frontend comments and .env.example / .gitignore) updated. The "legacy_json" sentinel string in corpus_stats.json._meta.source is intentionally NOT renamed — it is a stable artifact format read by downstream paper-generation tooling.	2026-04-30 09:30:28 -04:00
Vijay Janapa Reddi	43dedf9948	docs(vault): update architecture docs and audit scripts for 87-topic baseline Update ARCHITECTURE.md to reflect 87 curated topics and 131 edges. Refactor exemplar_coverage_audit.py to use vault.db instead of retired corpus.json. Update exemplar-gaps.yaml inventory.	2026-04-26 16:47:56 -04:00
Vijay Janapa Reddi	3f9b044b31	chore(ci): rename vault-ci.yml → staffml-validate-vault.yml Brings the last outlier workflow file into the repo-wide <cluster>-<verb>-<scope>.yml naming convention. Every other cluster (book, tinytorch, kits, labs, instructors, mlsysim, slides, site, staffml) uses this pattern; vault-ci.yml was the only one that didn't. vault-ci.yml → staffml-validate-vault.yml name: '🎯 StaffML · 🔎 Vault CI' → '🎯 StaffML · ✅ Validate (Vault)' Now staffml-validate-vault.yml is a direct sibling of staffml-validate-dev.yml — the former validates the vault data + CLI + worker, the latter validates the site build. Same verb, different scope, easy to reason about. Updated references: .github/workflows/staffml-validate-vault.yml — self-reference in the paths trigger (so the workflow still fires when it's edited) interviews/vault/ARCHITECTURE.md §19.3 and §51 — both path refs interviews/vault/TESTING.md §4.1 — workflow name + display name interviews/vault-cli/scripts/check_registry_append_only.py — docstring No branch-protection settings change needed — GitHub matches required checks on the workflow's 'name:' field, not the filename. Anyone with a bookmark to the old Actions-tab URL will get a 404 (harmless). Other workflow naming I surveyed but deliberately LEFT alone (all consistent with existing conventions): staffml-update-paper.yml matches tinytorch-update-pdfs pattern staffml-auto-pr.yml matches bot-workflow convention staffml-welcome.yml single-word verb, standard auto-label / update-contributors / infra-* / publish-all-live are cross-cutting (no cluster prefix) by design	2026-04-22 11:27:37 -04:00
Vijay Janapa Reddi	ed58b56cf4	docs(vault): archive obsolete scripts + post-mortem the v1.0 migration Archives pre-v1.0 scripts under scripts/archive/ in both interviews/vault/ and interviews/vault-cli/. ARCHITECTURE.md §3.3 rewritten with a post-mortem on why path-as-classification could not represent the paper's full 11-zone × 6-level taxonomy. CHANGELOG.md added documenting the full v1.0 migration.	2026-04-21 18:02:05 -04:00
Vijay Janapa Reddi	d2731d8bc3	chore: remove leftover AI-session planning and audit docs Clean up planning, kickoff, audit, and persona-feedback documents accumulated during prior AI-assisted work sessions. These are session artifacts, not durable documentation — the decisions they captured have either shipped, been retired, or are traceable via git history. interviews/vault/REVIEWS.md is intentionally kept: it is cited by section ID (H-6, H-7, H-21, C-6, ...) from production code in interviews/vault-cli/ and interviews/vault/ and published as the pyproject.toml Review-Ledger URL, which makes it engineering documentation rather than a session artifact. Deletions: - RELEASE-PREP.md, review_prompt.md (root handoff / review prompts) - interviews/vault/KICKOFF.md, BOOK_LINKING_PLAN.md, EXPANSION_PLAN.md - interviews/staffml/FEEDBACK_SYNTHESIS.md, V1_REDESIGN_SPEC.md, STAFFML_UX_PLAN.md, VAULT_DESIGN_PLAN.md - interviews/staffml/.gemini-reviews/ (2 review call logs) - book/docs/SVG_FIGURE_AUDIT_PLAN.md, book/tools/agent_personas.md - mlsysim/docs/WEBSITE_AUDIT.md - periodic-table/iteration-log.md, refinement-log.md Reference fixes for pointers into deleted files: - interviews/vault/ARCHITECTURE.md: drop section 21 (pointed at KICKOFF.md) - interviews/vault/schema/question_schema.yaml: drop BOOK_LINKING_PLAN.md reference in the author-curated resource description - interviews/staffml/src/components/Footer.tsx: drop BOOK_LINKING_PLAN.md reference from the docstring; rationale preserved Also removes the untracked gemini_prompts/ directory at repo root.	2026-04-21 11:23:41 -04:00
Vijay Janapa Reddi	369f59744b	chore(vault): clean up stale files and update architecture status - Delete SYSTEM.md (superseded by ARCHITECTURE.md since v2.0) - Delete deprecated scripts: sync-vault.py, generate-manifest.py, format-napkin-math.py (replaced by vault CLI commands) - Gitignore tsconfig.tsbuildinfo (build artifact) - Update ARCHITECTURE.md status to v3.0 DEPLOYED	2026-04-18 09:40:21 -04:00
Vijay Janapa Reddi	d9fcf8af23	refactor(vault): replace singular deep_dive with author-curated resources list Shape change ============ Old: details.deep_dive: {title, url} (singular, optional) New: details.resources: [{name, url}] (multivalued, optional) Rationale ========= The singular deep_dive field paired with a 178-line hostname classifier (interviews/staffml/src/lib/refs.ts) that labeled each link based on its host. This model couples question content to a registry of "known hosts" and forces every question to a single reference. The resources-list model flips the responsibility: authors write a human-readable name per reference, the UI renders a plain labeled link, and questions can cite zero, one, or many references. It also dissolves the deferred book-linking problem — when book URLs stabilize, authors add a book entry to whichever questions benefit, with no schema, registry, or classifier changes required. Scope (this commit) =================== - schema/question_schema.yaml: replace DeepDive class with Resource (name+url), change Details.deep_dive → Details.resources (multivalued) - schema.py: add Resource pydantic model with https-only + name-length validators (XSS guard per REVIEWS.md H-6); replace flat deep_dive_title/deep_dive_url on QuestionDetails with resources list - vault.py: update field-coverage metric + LLM prompt template - scripts/generate_hard_questions.py: remove KA_URLS auto-fill (contradicted the author-curation principle), update prompt template - scripts/generate_gaps.py: update prompt template + renderer to iterate resources list - scripts/build_corpus.py: legacy markdown '📖 Deep Dive:' parser now appends to resources list instead of setting flat fields - ARCHITECTURE.md: schema example, SQL DDL, validation rules - REVIEWS.md: H-6 wording (deep_dive_url → resources[].url) - corpus.json: scrub 9,495 stale deep_dive_title / deep_dive_url fields that pre-dated the vault YAML cleanup; add empty resources [] default to all 9,657 questions for shape stability What this does NOT change ========================= - Zero question YAMLs are modified. Phase 0 audit confirmed 0 YAMLs have the deep_dive field populated (see audit script output in the preceding commit). - schema_version stays at 1. EVOLUTION.md §2 classifies this as a breaking-major change that technically warrants schema_version: 2. However, no data or external consumer depends on the old shape — the field is uniformly absent in YAML — so the bump is ceremonial. Deferred until the first breaking change that requires a reader adapter. - staffml/src/data/corpus.json (the shipped browser bundle) already has 0 deep_dive_url fields and 9,199 items; equivalence hash is unaffected because release_hash is computed from YAML inputs. - No UI or consumer changes — deep-UI removal and refs.ts shrink follow as separate atomic commits. Validation ========== - All touched Python modules py_compile cleanly - validate_corpus(corpus.json) against new schema.py: 9247/9657 pass; the 410 failures are pre-existing 'sustainability-carbon-accounting' topic taxonomy errors unrelated to this change - Re-ran audit: still 0 deep_dive fields in YAMLs Vault-Override: corpus-json-hand-edit: schema-migration artifact scrub removes stale deep_dive_* fields that predate the YAML cleanup and inserts empty resources [] defaults matching the new schema shape. YAML inputs unchanged; release_hash unaffected.	2026-04-16 18:22:08 -04:00
Vijay Janapa Reddi	5131cb28fc	docs: R11 stability cleanup + v2.6 \u2014 11 rounds, convergence declared R11 (David, fresh-eyes stability check): 0 Critical + 0 High + 1 Medium (doc cleanup from R10-F-2 closure itself). R11-M-1 (MEDIUM): CUTOVER_QA.md + vault-cli/README.md still referenced --canary-percent flag after R10-F-2 removed it from code + ARCHITECTURE.md. Operator following CUTOVER_QA.md step 1 of cutover day would hit 'Error: no such option --canary-percent' \u2014 the one document whose entire purpose is cutover correctness. Fix: CUTOVER_QA.md \u00a71 replaces canary-staged rollout with all-or-nothing ship language + Phase-7-deferred note pointing at \u00a74.3. README.md:57 drops [--canary-percent N] from the ship example. STABILITY DECLARED after R11. Three consecutive rounds (R7, R8, R11) with zero new Criticals. R11 explicit: 'convergence confirmed.' Finding-density trajectory across 11 rounds (new Criticals per round): R1: 3, R2: 1, R3: 2, R4: 3, R5: 3, R6: skipped, R7: 0, R8: 0, R9: 1* (regression-detect, not new), R10: 0, R11: 0 Total findings closed across all rounds: ~120. No further rounds scheduled. ARCHITECTURE.md header bumped v2.5 \u2192 v2.6. REVIEWS.md adds 'Rounds 7\u201311' section with per-round finding counts, notable findings, meta-observation on R9 (tooling/persistence issue Gemini caught that individual-file reviewers couldn't), and the convergence signal.	2026-04-16 16:42:39 -04:00
Vijay Janapa Reddi	ce285ebaaa	fix: R9 (Gemini) + R10 (Soumith) \u2014 1C + 4H + 1M closed R9 Gemini-1M found: my R7 worker-side edits silently failed to persist (file wasn't touched in R7+R8 commit). Gemini caught the real state of the worker on disk. Re-applied. R9-C-1 (CRITICAL \u2014 real, discovered missing): Worker's checkSchemaFingerprint did NOT filter FTS5 shadow tables. Gemini caught that the filter I 'added' in R7 never actually landed in the commit. Re-applied: worker's sqlite_master query now has the same AND name NOT IN (...) exclusion as compiler.py. Mismatch risk: worker in permanent degraded mode the first time a fresh D1 is queried across SQLite versions. R9-H-1 (HIGH): ship's d1_forward didn't take R2 snapshot. Soumith R10-F-1 caught the same issue. Fixed together below. R9-H-2 (HIGH, real): handleSearch FTS5 probe not memoized. Module-level ftsProbed added; reset on release_id change. R9-H-3 (HIGH, real): SLI workflow reads releases/<latest>/vault.db from disk after checkout, but vault.db is gitignored per M-4. Workflow would permanently fail on the 'db_path.exists()' check. Fix: added 'vault build' step to workflow before the Python SLI script runs. Deterministic; same YAML \u2192 same hashes. R9-M-1 (MEDIUM, real): schemaOk had no retry on transient D1 failure. Added schemaCheckedAt + SCHEMA_RECHECK_MS=5min. A single network blip no longer pins the worker to degraded mode until next release. R10 Soumith (framework/API lens): R10-F-1 (HIGH): vault ship bypassed vault deploy's snapshot logic. Refactored to _do_deploy() shared helper called from BOTH deploy_cmd and ship_cmd's d1_forward leg. Ship now takes the R2 snapshot guaranteed; \u00a76.2 'default rollback = snapshot restore \u2014 always works' contract is now enforced by composition. R10-F-2 (MEDIUM): --canary-percent flag in spec, not in code. Removed from ARCHITECTURE.md \u00a74 + marked explicitly 'DEFERRED to Phase 7' with rationale (CF Workers doesn't expose % traffic-split for non-enterprise; requires Argo or custom routing). Spec + --help no longer disagree. R10-F-3 (LOW): tag-or-skip logic duplicated in tag_cmd + paper_forward. Extracted _ensure_tag(version) helper; both callers now use it. Test matrix: pytest: 38 green in 0.16s vitest: 7 green in 127ms ruff: All checks passed tsc: clean Convergence tracking: R1-R8: ~102 findings closed R9 (Gemini): 1C + 3H + 1M (mostly R7 edits that didn't persist; genuinely 1 new finding R9-H-3 on SLI vault.db) R10 (Soumith): 0C + 1H + 1M + 1L (F-1 overlapped with R9; genuinely 1 new on F-2 canary spec mismatch) Honest assessment: still not stable at Round 10. The R9 discovery that R7 worker edits never persisted is a meta-finding about my own tooling, not the code design. Post-R10 the worker code is in the state R7 claimed it was in. One more round should confirm.	2026-04-16 16:38:46 -04:00
Vijay Janapa Reddi	84588577c2	docs: v2.5 stamp + Round-5 Gemini review ledger + README count 5700+\u21929000+ ARCHITECTURE.md header bumped v2.4 \u2192 v2.5 marking final pre-deploy state. REVIEWS.md \u2014 new 'Round 5 Gemini 1M-context holistic review' section: per-finding table (R5-C-1 through R5-L-1), resolution summary, and meta-observation on why context-size diversity matters for adversarial review (Gemini caught cross-file issues that per-file Claude subagents consistently missed). interviews/README.md \u2014 question count was stale ('5,700+'); now '9,000+' matching the post-migration published count of 9,199.	2026-04-16 16:14:16 -04:00
Vijay Janapa Reddi	cbdb566381	feat(vault): Phase-1 migration contract fully closed in-repo v2.3 \u2192 v2.4. ARCHITECTURE.md header + Appendix reflect the completed migration. WHAT CLOSED (\u00a711.1 contract): 1. `vault build --legacy-json` regenerates the site's interviews/staffml/src/data/corpus.json from YAML. 9,199 published questions, site-compatible shape (chain_positions back to 0-indexed dict form, bloom_level derived from zone, competency_area aliased from topic, scope aliased from track). Deterministic via sort_keys + id-sort. 2. Pre-commit hook INSTALLED via worktree-aware Makefile target (`make -C interviews/vault-cli hooks`). Symlink points at pre_commit_corpus_guard.py. Tested end-to-end: direct edit to vault/corpus.json triggers exit-1 with §11.1 reference. 3. CI equivalence check added to .github/workflows/vault-ci.yml: regenerates corpus.json from YAML, diffs against committed. Fails PR on drift with actionable error message. 4. Legacy generators demoted with DEPRECATED headers: - interviews/paper/scripts/analyze_corpus.py \u2192 vault export-paper - interviews/staffml/scripts/sync-vault.py \u2192 vault build --legacy-json - interviews/staffml/scripts/generate-manifest.py \u2192 vault publish - interviews/vault/scripts/export_to_staffml.py \u2192 vault build --legacy-json 5. New DEPRECATED.md files at interviews/vault/scripts/ and interviews/staffml/scripts/ map every legacy script to its replacement. Both directories keep the old scripts for git-history legibility and archaeology; new contributors see the vault CLI first. 6. ARCHITECTURE.md \u00a7Appendix rewritten as current-state table instead of aspirational "gone. replaced by..." entries. NEW TESTS (interviews/vault-cli/tests/test_legacy_export.py \u2014 +4): - test_legacy_shape_matches_site_interface: every field corpus.ts declares is present in regenerated JSON. - test_chain_positions_legacy_shape: 1-indexed new schema \u2192 0-indexed legacy dict form. - test_emitter_deterministic: byte-stable across reversed input order (required for CI diff-check). - test_competency_area_aliases_topic: legacy alias fields populated correctly. FULL MATRIX GREEN: pytest: 38/38 passed in 0.19s (34 + 4 legacy-export) ruff: All checks passed hook: exit 0 on clean diff / exit 1 on corpus.json direct edit e2e: vault build --legacy-json regenerates a bit-identical corpus.json vs the committed one; CI check wired to catch drift WHAT'S LEFT (deploy-gated, \u00a720.5 #1, #5, #6 partial, #8, #9): - Production serves from D1: requires Phase-3 wrangler d1 create + deploy - Manual QA per CUTOVER_QA.md: requires live staging - Zero data loss D1-side verification: requires live D1 - 48h monitoring: requires production traffic These are intrinsically user-action; the YAML-side migration is done.	2026-04-16 14:57:24 -04:00
Vijay Janapa Reddi	ed18c5dc6a	docs(vault): v2.3 stamp + Round-4 security review ledger ARCHITECTURE.md header bumped to v2.3. REVIEWS.md: added Round-4 section with 12-item findings table (3C/4H/3M/2L all resolved), Chip's code-level security audit of the post-Bucket-B state. Columns: Severity, ID, Finding, Resolution. Verdict: GREEN for Phase-0/1/2; deploy gates still per CUTOVER_QA.md \u00a70.	2026-04-16 14:15:24 -04:00
Vijay Janapa Reddi	ba3ed8e6e4	feat(vault): B.14 equivalence docs + B.16 ChainBadge mount + worker cursor/rate-limit scaffold B.14 (Soumith R3-F-3): ARCHITECTURE.md \u00a711.5 now documents the provenance of corpus-equivalence-hash.txt \u2014 it is the release_hash from vault build against the post-split YAML, not an independent hash of legacy corpus.json. Clarifies what the CI check proves vs does not, and points external verifiers at 'vault verify --git-ref' for citation-grade reproducibility. B.16: ChainBadge now mounted pre-reveal on practice/page.tsx just above the question title. Wired to existing chainInfo + showAnswer state \u2014 hides once user reveals answer so ChainStrip (post-reveal) takes over. Analytics events chain_badge_shown/clicked fire per its component contract. Worker scaffold (mid-flight of B.1/B.3/B.4 \u2014 wiring in next commit): - src/types.ts: Cursor switched from {offset, filter_hash} to {after_id, filter_hash}. Server will page WHERE id > after_id ORDER BY id LIMIT N so cost is O(N) per page, not O(offset+N). Closes Chip R3-H2 concern about deep-offset cost. - src/types.ts: Env adds optional RATE_LIMIT_KV + overrides for per-endpoint-class rate limits. - src/rate_limit.ts (NEW): KV-backed token-bucket, per-(IP,class) windowed at 60s. 'default' class \u2014 60 req/min, 'search' \u2014 10 req/min. Open-allows if KV not bound (e.g., local shim).	2026-04-16 13:58:32 -04:00
Vijay Janapa Reddi	a33296df5f	feat(vault): corpus licensed CC-BY-NC-4.0 (explicit user decision) User concern: preventing commercial reuse of the corpus (e.g., a vendor training a paid product on the questions, selling access to them). CC-BY-NC-4.0 permits research citation + non-commercial derivatives while requiring written permission for commercial use. interviews/vault/questions/LICENSE (NEW) CC-BY-NC-4.0 full text with BibTeX template tied to release_hash. Commercial licensing contact noted. interviews/vault/ARCHITECTURE.md §15 #1 Marked DECIDED. Rationale recorded. vault-cli license intentionally left at historical status (not relicensed as part of this change). interviews/vault/REVIEWS.md License state: DECIDED. Removed from Phase-3 blocker list. interviews/CONTRIBUTING.md New 'License' section: NC constraint explicit. External corpus PRs assumed offered under same CC-BY-NC-4.0. Contact for commercial licensing specified.	2026-04-16 13:48:29 -04:00
Vijay Janapa Reddi	be29712e2d	revert: drop LICENSE files added without explicit approval Reverts the LICENSE additions from `1bc93374e`. I inferred consent from 'proceed with 2' (which was about Round 3 adversarial review) and rolled CC-BY-4.0 + MIT into the polish commit. The user never explicitly approved a license choice; defaulting to status-quo (no LICENSE file shipped) preserves the original implicit position and leaves the decision with the user. ARCHITECTURE.md and REVIEWS.md updated to note the license state remains OPEN; §15 item 1 'recommendation' status unchanged from v2.0.	2026-04-16 13:44:11 -04:00
Vijay Janapa Reddi	0ad41c693d	docs(vault): architecture v2.2 + Round-3 ledger + paper-agree-by-SQL ARCHITECTURE.md header bumped to v2.2. Full changelog block added (v2.1 → v2.2) keyed to Round-3 finding IDs. §7.1 + §10.2 edited to align X-Vault-Release soft-signal semantics with §6.1.1 (Soumith F-1). REVIEWS.md §Round-3 added: per-reviewer verdicts (Chip YELLOW, Dean YELLOW→GREEN, Soumith GREEN-conditional, David YELLOW→GREEN), convergence map of 11 integrated items, explicitly-deferred list (Cache API, breaker half-open, rate-limit KV, cross-lang hash path, worker vitest, LSH dedup — all documented as Phase-3-entry gates). CONTRIBUTING.md quickstart corrected (David R3-H5): step 3 dropped the Phase-1+ 'doctor'/'stats' references; step 4 shows 'vault build' before 'vault api' so the shim has something to serve. paper/scripts/generate_macros.py rewritten as thin wrapper over 'vault export-paper' (B.1 — closes §20.5 #2 + #7). Uses sys.executable -m vault_cli.main so PATH isn't required. paper/macros.tex (regenerated): 66-line emission with both \staffml* and legacy \num* namespaces. paper.tex needs no edits during transition. Paper and site now agree by construction — the structural fix for H-21 (9,199 vs 8,053) bug class. paper/corpus_stats.json (regenerated): full superset of the v1 analyze_corpus.py output, driven by SQL over vault.db with 'by_zone', 'by_level', 'by_track', chain 'by_length' distribution, 'bloom_distribution' (zone→bloom derived mapping), applicability.	2026-04-16 13:10:16 -04:00
Vijay Janapa Reddi	aa5db46a0b	docs(vault): architecture v2.1 (Round-2 review integration) Round 2 verdicts from 4 reviewers: all Round-1 Critical/High items marked RESOLVED by the reviewers themselves. One new Critical surfaced with 3/4 reviewer convergence (vault ship journaling + ordered commit protocol); ~10 new High items, mostly convergent across 2-3 reviewers. Round 3 DECLINED per §18 'explicit engineering decision documented inline' clause — new items are surgical specifications, not architectural rework, and reviewer consensus was GREEN after a small v2.1 pass. User may override this decision. v2.1 changes (keyed to REVIEWS.md R2 IDs): §6.1.1 (new) — vault ship commit protocol - Ordered legs: D1 → Next.js → paper-last (last leg must have hardest rollback; paper tag is remote-durable and cannot be force-rolled). - Journal at releases/<v>/.ship-journal.json; --resume primitive for interrupted ships. - Per-leg rollback matrix: D1 → R2 snapshot restore; Next.js → wrangler rollback; paper-leg → manual, never force-push, remediate via forward-fix release. - Paging triggers defined: pager on paper-leg failure (manual review required) and on any rollback-of-rollback failure. - Resolves Chip N-C1 + Dean N-1 + Soumith H-NEW-1 (convergent Critical). §6.1.1 + §10.2 — X-Vault-Release header demoted from hard-reject to informational SLI. Correctness boundary is release-keyed Cache API key, not the header. 10-min cross-release grace window at the worker (Dean N-3, Soumith H-NEW-2). §10.1 — schema_fingerprint verified against actual sqlite_master at worker cold start. On mismatch: degraded read-only mode (Cache API only, banner header), not 5xx outage. Fixes Chip N-H1 + Dean N-4. §7.1 — Service-worker cache release-keyed; skipWaiting + claim on release change; TTL 7d. Phase-4 rollback drill explicitly tests stale-SW scenario. Fixes Chip N-H2 + Dean N-6 + David N-4. §3.5 — release_hash Merkle now includes __policy__ and __canon_version__ leaves. Nested-dict canonicalization test fixture asserts key-order invariance. Chip N-H5 + Soumith M-NEW-4. §11.5 — CI equivalence compares Merkle release_hash to corpus-equivalence-hash.txt, not 28MB byte-diff. CI budget ≤2min stated. David N-1 + Chip N-H4. §3.3 — ID collision recovery: vault renumber command bumps seq, renames file, updates id field, updates chain refs. vault check --strict enforces registry/file consistency. Dean N-2 + David N-2. §10.6 — FTS5 Phase-4 entry now gates on ≤500 D1 row-reads/query in addition to latency, not just p99. Dean N-5. §13 — Codegen 'who runs it' contract: PR author runs vault codegen locally; CI runs vault codegen --check; CI never pushes follow-up commits. TS package via pnpm workspace protocol. Soumith H-NEW-3 + M-NEW-1. §14 Phase 0 — exemplar-coverage audit produces vault/exemplar-gaps.yaml. Chip N-H3. §7.1 — Static fallback retention extended to 'first post-cutover schema-major bump OR 2 releases, whichever is later'. Dean N-10. §4.3 — canary soak = max(15min, ≥100 sessions observed). David N-5. §4.4 — vault mark-exemplar + vault renumber added to CLI surface. David N-9. §4.2 — vault verify exit code reconciled from 2 → 1 (integrity failure per §4.6). Soumith M-NEW-3. §12.6 / §4.4 — vault promote --reviewed-by CI-enforced against committer email (closes spoofing vector). Soumith L-NEW-1 + David dup. §13 — Removed duplicated Observability subsection. David N-8 cleanup. REVIEWS.md updated with: - Per-reviewer Round-1 resolution status (all RESOLVED). - Round-2 aggregated severity table. - Cross-reviewer convergence map. - v2.1 resolution table (R2-3). - Round-3 decline rationale (R2-4). - Readiness assessment GREEN with 3 documented conditions (license, EVOLUTION.md, measurement gates). Remaining OPEN items for user: - License decision (CC-BY-4.0 corpus + MIT vault-cli recommended); BLOCKS PHASE 3. - Round 3 override if desired.	2026-04-15 18:03:27 -04:00
Vijay Janapa Reddi	c51ca7bae7	docs(vault): architecture v2 (Round-1 review integration) Resolves all 8 Critical and 21 High items from REVIEWS.md Round 1. Defers two Lows with rationale. Licensing open question remains but now has a recommended default (CC-BY-4.0) blocking Phase 3. Load-bearing structural changes: §3.3 Per-question YAML - Content-addressed IDs (topic + short-hash + dedup suffix); registry is append-only, merge-conflicts are semantic (C-5). - Structured 'chain: {id, position}'; legacy compact form rejected on main (H-4). - Closed 'provenance' enum replacing free-text 'generated_by' (H-5). - Content-format rules per field: plaintext / restricted-Markdown / HTTPS-only URLs (H-6). - Path components enum-validated and lowercase-enforced (H-9). - Optional 'authors' list populated from git config (M-15). §3.5 Content hashing (new) - Per-question SHA-256 over canonical JSON of whitelisted semantic fields; release hash is a Merkle root. Hash inputs, never the SQLite binary (which is not reproducible) (C-3). - 'vault verify' reconstructs from YAML — citation-grade verification. §4 CLI decomposed into primitives + composed products - 'vault ship' as atomic release verb coordinating D1 + Next.js + paper push with auto-rollback (C-4). - 'vault publish' staged via POSIX rename(2); '--resume' safe (C-7). - 'vault api' localhost Worker-surface shim for contributors (H-17). - 'vault verify', 'vault restore', 'vault move --dry-run', typed confirmations on destructive commands, stable exit codes (H-11). §5 Invariants hardened - Fast tier adds YAML-size cap, depth cap, alias rejection, URL scheme allowlist, path lowercase enforcement (H-7, H-9). - Scenario-dedup moved to LSH-blocked tier; MinHash + embedding complement to Jaro-Winkler (M-6, M-7). §6 Release workflow - Primary rollback is R2 snapshot restore, not inverse SQL (C-1). - Schema-changing releases gated; require hand-authored up/down. - Pre-deploy R2 snapshot synchronous (C-8, M-5). §7 Website integration - corpus.json retained for 2 releases post-cutover; rollback is a feature flag, not a file delete (C-1). - SWR with retry/backoff/circuit-breaker; service worker cache; top-200 questions inlined for offline resilience (M-10, M-12). - Bundle-claim gated on measured FCP/TTI Lighthouse CI gates (M-21). - §7.4 search UX spec added (M-13). §8 Chain discoverability - Cut to single pre-reveal indicator + instrumentation. Additional interventions gated on measured engagement (H-19). §10 D1 + Worker - Cache keys include release_id for atomic POP invalidation (H-14). - Cursor pagination + ETag + SWR-friendly Cache-Control (H-20). - Admin endpoint removed from phases 0-6 (H-10). - Cost forecast recomputed with 150 calls/session + row-reads accounting: paid tier budgeted from day one (H-13). - §10.5 data-plane SLIs for silent-corruption detection (H-15). - §10.6 FTS5 performance gate before Phase 4 entry (H-12). §11 Workflow continuity — full rewrite - YAML is sole authoring surface from Day 1 of Phase 1 (C-2). - corpus.json becomes generated-only; pre-commit hook enforces; CI equivalence check on every PR. - Single release-policy predicate enforced by import-graph (H-21). §12 vault generate hardened - vault/exemplars/ curated human-only pool; corpus never feeds its own generation (C-6). - Cost ceiling ledger; secrets from ~/.config/vault/secrets.toml; dry-run default; hard cap on --count (H-8). - Full prompt stored in vault/generation-log/ (L-2). §13 Security - Consolidated v2 rules; shared-types contract via LinkML codegen with CI drift check (H-2). - Schema evolution pointed at schema/EVOLUTION.md (H-1). - License recommendation: CC-BY-4.0 corpus, MIT vault-cli (L-10). §14 Phases - Timeline 18-22 days (was 11); explicit overrun policy per phase (H-18). - Phase 0 adds vault-cli/README.md, CONTRIBUTING.md, EVOLUTION.md, JSON_OUTPUT.md, EXIT_CODES.md (H-17, L-7). - Phase 3 gated on license decision (L-10) and FTS5 load test. - Phase 4 gated on Lighthouse CI and rollback drill. §19 Testing - Rollback-symmetry property test per release (M-1). - Commit release manifest, not binary vault.db (M-4). Full finding-by-finding response map in REVIEWS.md §3. Round 2 reviewers will be sent v2 + REVIEWS.md and asked 'does this resolve your Round-1 concerns; anything new you see now?'	2026-04-15 17:52:05 -04:00
Vijay Janapa Reddi	a1fae272fd	docs(vault): add review protocol, testing plan skeleton, autonomous gates Extends ARCHITECTURE.md with three new sections and a standalone kickoff file so a fresh Claude Code session can pick this up cleanly: §18 Review & iteration protocol — 2–3 rounds of adversarial expert review (chip-huyen, jeff-dean, soumith-chintala, student-david) with a severity-ranked findings table committed to REVIEWS.md after each round. §19 Testing plan skeleton — 10-layer test inventory (unit / integration / contract / migration / export-parity / worker-contract / e2e / smoke / load / rollback), CI workflow spec, cutover QA checklist, observability protocol. Full spec lives in a future TESTING.md written during Stage 2. §20 Autonomous mode — explicit gate between "plan hardened" and "execute." Pre-autonomous checklist, per-phase commit/push/ checkpoint rules, stop conditions, 9-point success definition. §21 Pointer to KICKOFF.md. KICKOFF.md is the copy-paste prompt for starting the next session. Self-contained, tells the operator: read context, run Stage 1 review, write Stage 2 testing plan, WAIT for user green-light before Stage 4 autonomous execution. Scoped directories, commit style, stop conditions. No implementation work begins until the plan clears review and the user explicitly green-lights. Intentionally slow at the start to be fast and safe later.	2026-04-15 16:53:50 -04:00
Vijay Janapa Reddi	b4d4dfcbc1	docs(vault): add comprehensive architecture plan for post-release migration 17-section design document covering the full migration from monolithic corpus.json (19 MB inlined per page bundle) to: - Per-question YAML files as the authoring source of truth - SQLite (vault.db) as the built artifact - Cloudflare D1 + Worker as production distribution - Typer + Rich CLI (`vault new`, `vault build`, `vault publish`, ...) - Single release gate that enforces consistency between site + paper Supersedes the stale SYSTEM.md (which still references 5,786 questions and 839 concepts from a year-old state). Captures three bugs found in the current pipeline (paper/site filter disagreement, orphan topics, no release enforcement) and the fix for each. Includes sections on: chain discoverability UX, About-page paper prominence, workflow continuity during migration, LLM-assisted generation via `vault generate`, security/safety/rollback, and a 6-phase rollout plan totaling ~11 working days. Does not yet implement any of this. Phase 0 scheduled for after the 2026-04-22 MIT Press copyedit deadline.	2026-04-15 16:06:57 -04:00

23 Commits