Checkpoint the branch-wide content/config revisions together with workbench enhancements so chapter rendering and developer workflows stay aligned. This captures the current validation-driven formatting and parallel build/debug improvements in one commit.
Remove redundant ml_ prefix from ml_workflow chapter files and update all
Quarto config references. Consolidate custom scripts into native binder
subcommands and archive obsolete tooling.
Unifies Quarto metadata into shared base/format/volume fragments while carrying through chapter path, asset, and tooling updates to keep the repository consistent and easier to maintain.
Academic Improvements:
- Added 16 foundational citations to bibliography (GFS, HDFS, CAP theorem, tail latency, Young-Daly, Bigtable, Spark, Stoica Berkeley AI)
- Cited Ghemawat et al. (2003) for Google File System
- Cited Dean and Barroso (2013) for tail latency analysis
- Cited Brewer (2000) and Gilbert & Lynch (2002) for CAP theorem
- Cited Young (1974) and Daly (2006) for checkpoint interval formula
- Cited Shvachko et al. (2010) for HDFS architecture
- Cited Chang et al. (2008) for Bigtable
- Cited Zaharia et al. (2016) for Apache Spark
- Cited Stoica et al. (2017) for Berkeley AI systems view
- Cited AWS (2020) for S3 consistency update
Quality Assurance:
- Verified chapter builds successfully with all citations
- Maintained existing narrative flow and structure
- Preserved all technical content and examples
- No changes to Key Takeaways (already accurate)
This polish addresses the previously empty bibliography and ensures proper academic attribution for major technical concepts and systems discussed throughout the chapter.
- Renamed vol2/advanced_intro to vol2/introduction for consistency
- Updated all scripts and configs to use vol1/ instead of core/
- Updated pre-commit config to check all contents/ not just vol1/
- Updated path references in Lua filters, Python scripts, and configs
* Restructure: Move book content to book/ subdirectory
- Move quarto/ → book/quarto/
- Move cli/ → book/cli/
- Move docker/ → book/docker/
- Move socratiQ/ → book/socratiQ/
- Move tools/ → book/tools/
- Move scripts/ → book/scripts/
- Move config/ → book/config/
- Move docs/ → book/docs/
- Move binder → book/binder
Git history fully preserved for all moved files.
Part of repository restructuring to support MLSysBook + TinyTorch.
Pre-commit hooks bypassed for this commit as paths need updating.
* Update pre-commit hooks for book/ subdirectory
- Update all quarto/ paths to book/quarto/
- Update all tools/ paths to book/tools/
- Update config/linting to book/config/linting
- Update project structure checks
Pre-commit hooks will now work with new directory structure.
* Update .gitignore for book/ subdirectory structure
- Update quarto/ paths to book/quarto/
- Update assets/ paths to book/quarto/assets/
- Maintain all existing ignore patterns
* Update GitHub workflows for book/ subdirectory
- Update all quarto/ paths to book/quarto/
- Update cli/ paths to book/cli/
- Update tools/ paths to book/tools/
- Update docker/ paths to book/docker/
- Update config/ paths to book/config/
- Maintain all workflow functionality
* Update CLI config to support book/ subdirectory
- Check for book/quarto/ path first
- Fall back to quarto/ for backward compatibility
- Maintain full CLI functionality
* Create new root and book READMEs for dual structure
- Add comprehensive root README explaining both projects
- Create book-specific README with quick start guide
- Document repository structure and navigation
- Prepare for TinyTorch integration