8428 Commits

Author SHA1 Message Date
Vijay Janapa Reddi
8a036c13d3 Adds citations and expands AGI discussion
Adds several missing citations to the frontiers.bib file.

Expands upon the discussion of AGI, including scaling
hypotheses, neurosymbolic systems, embodied intelligence,
and multi-agent systems.

Clarifies limitations of current models and explores
potential future directions in AI research.
2025-11-05 18:22:30 -05:00
Vijay Janapa Reddi
f9feaf8504 Removes DOI verification reports
Removes obsolete reports related to DOI verification and definition placement audits.
These reports are no longer needed as the tasks are completed and documented elsewhere.
2025-11-05 18:21:22 -05:00
Vijay Janapa Reddi
49a803e5da fix(deploy): make PDF and EPUB links work in dev preview
Addresses #1016

Changes:
- Changed PDF/EPUB navbar links from absolute URLs (mlsysbook.ai) to
  relative URLs (/pdf, /epub) so they work on both main and dev sites
- Updated deploy-preview workflow to download PDF and EPUB artifacts
  in addition to HTML artifact
- Added step to copy PDF and EPUB files to assets/downloads directory
- Added _redirects file to dev deployment for proper routing

This ensures dev preview site serves its own PDF/EPUB versions rather
than redirecting users to the main production site.
2025-11-05 16:04:53 -05:00
Vijay Janapa Reddi
37e40dee36 fix(quizzes): correct MCQ answer explanations and add validation (#1035)
Addresses #1034

Fixed 47 instances across 20 quiz files where MCQ answer explanations
incorrectly referenced the correct option as one of the incorrect options.

Changes:
1. Fixed all quiz JSON files with incorrect option references
   - Fixed patterns like 'Options A, C, and D' when A is correct
   - Fixed patterns like 'Option C is incorrect' when C is correct
   - Fixed patterns like 'Option A describes...' when A is correct

2. Created fix_mcq_answer_explanations.py script
   - Automatically detects and fixes incorrect option references
   - Handles plural and singular patterns
   - Can be run on all quiz files or specific files

3. Enhanced quizzes.py with validation and opt-in redistribution
   - Added validate_mcq_option_references() function
   - Validation runs during quiz generation to catch LLM errors
   - MCQ redistribution now requires --redistribute-mcq flag (opt-in)
   - Prevents bug from being reintroduced during answer shuffling

All 445 MCQ questions validated across 35 quiz files.
2025-11-05 15:58:54 -05:00
Vijay Janapa Reddi
7deafc9781 Merge revert-container-default-changes into dev
Restoring default build method to 'both' to re-enable baremetal builds.
Container-only default will be reconsidered for future deprecation.
2025-11-05 13:50:10 -05:00
Vijay Janapa Reddi
876ead061d Revert "ci(workflows): default validate-dev to container builds only"
This reverts commit 905bfda9eb.
2025-11-05 13:31:31 -05:00
Vijay Janapa Reddi
905bfda9eb ci(workflows): default validate-dev to container builds only
Change default build method from 'both' to 'container' to improve
efficiency and reduce unnecessary baremetal builds. Baremetal and both
options remain available for manual workflow dispatch when needed.

Changes:
- Set default build_method to 'container'
- Auto-triggered builds now use container only
- Manual dispatch still supports all three options
- Updated summary messages to reflect container default
2025-11-05 12:34:37 -05:00
Vijay Janapa Reddi
805434671e chore(docs): remove completed missing definitions analysis
The missing_definitions_analysis.md working document has served its purpose.
All Tier 1 and Tier 2 recommendations have been successfully implemented:
- 6 definitions added (Tensor, Overfitting, Transfer Learning, Distributed
  Training, Quantization, Batch Processing)
- Plus 2 additional critical definitions (Gradient Descent, Backpropagation)

The definition_placement_audit.md remains as the permanent quality
documentation showing all 47 definitions are optimally placed and ready
for publication.
2025-11-05 09:50:17 -05:00
Vijay Janapa Reddi
711db0f0bc Merge feature/canonical-definitions into dev
Complete overhaul of all 47 textbook definitions to canonical university standard:

STANDARDIZATION (39 existing definitions):
- Reduced to 1-2 sentence format with 3-6 strategic italics
- Removed enumeration and list-heavy structures
- Eliminated leading articles for consistent grammar
- Standardized title format (title="Term" not "Definition of Term")
- Enhanced academic tone (e.g., dramatically → substantially)

NEW DEFINITIONS (8 added):
- Gradient Descent (dl_primer.qmd)
- Backpropagation (dl_primer.qmd)
- Tensor (frameworks.qmd)
- Overfitting (dl_primer.qmd)
- Transfer Learning (workflow.qmd)
- Distributed Training (training.qmd)
- Quantization (optimizations.qmd)
- Batch Processing (training.qmd)

PLACEMENT OPTIMIZATION:
- All 47 definitions audited for optimal pedagogical placement
- Moved Overfitting to precede first usage
- All definitions now follow: context → definition → usage pattern
- 100% placement quality score

DOCUMENTATION:
- Created missing_definitions_analysis.md (Tier 1-3 missing terms)
- Created definition_placement_audit.md (complete quality audit)

QUALITY METRICS:
- 47 canonical definitions (39 standardized + 8 new)
- 100% optimal pedagogical placement
- 100% consistent formatting
- 100% academic tone compliance
- Ready for university textbook publication

Files modified: 21 chapters + 2 documentation files
Total commits: 17 commits on feature branch
2025-11-05 09:46:55 -05:00
Vijay Janapa Reddi
7250385e8e docs(definitions): add comprehensive placement audit report
Complete audit of all 47 definitions confirms 100% optimal placement quality.
All definitions follow textbook best practices:
- Positioned after motivating context
- Placed before substantive usage
- Located in dedicated sections
- Supporting optimal pedagogical flow

Only one repositioning required (Overfitting), now fixed.
All definitions ready for academic review.
2025-11-05 09:43:16 -05:00
Vijay Janapa Reddi
079aa6c135 fix(definitions): move Overfitting definition to optimal pedagogical location
Moved Overfitting definition from after its first mention (line 2044) to the
start of 'Convergence and Stability Considerations' section (line 2036). This
follows textbook best practice: definitions should appear BEFORE the concept
is used extensively, not after.

Placement now follows optimal pattern:
- Section header and brief introduction
- Formal definition callout
- Detailed explanation and usage

All 8 new definitions now properly placed at section starts after motivating
context and before substantive usage.
2025-11-05 09:31:57 -05:00
Vijay Janapa Reddi
0920e6e03e feat(definitions): add Tier 1 and Tier 2 missing definitions
Added 6 critical definitions identified in missing_definitions_analysis.md:
- Tensor (frameworks chapter)
- Overfitting (dl_primer chapter)
- Transfer Learning (workflow chapter)
- Distributed Training (training chapter)
- Quantization (optimizations chapter)
- Batch Processing (training chapter)

All definitions follow the canonical format: single sentence, 3-6 strategic
italics, no enumeration, no leading articles. These foundational concepts
are now formally defined to match university textbook standards.

Addresses missing_definitions_analysis.md Tier 1 and Tier 2 items.
2025-11-05 09:26:15 -05:00
Vijay Janapa Reddi
4b701aba1f feat(definitions): add Backpropagation and Gradient Descent definitions
Added 2 critical Tier 1 definitions following canonical standard:

1. Backpropagation (dl_primer.qmd): Algorithm for computing gradients via chain rule
2. Gradient Descent (dl_primer.qmd): Iterative optimization algorithm for minimizing loss

Both definitions follow canonical format:
- Single sentence
- 3-4 strategic italics
- Academic tone
- Technically precise

Progress: 41/47 definitions complete (39 original + 2 new)
Remaining: 6 more from Tier 1 + Tier 2
2025-11-05 09:22:44 -05:00
Vijay Janapa Reddi
3cfde6346c refactor(definitions): standardize definition title format
Changed all definition titles from inconsistent formats to simple term names:
- 'Definition of X' → 'X'
- 'X Definition' → 'X'
- 'Definition of the X' → 'X'

Affected 39 definitions across 18 files for consistent callout-definition presentation.

Examples:
- 'Definition of Deep Learning' → 'Deep Learning'
- 'Security Definition' → 'Security'
- 'Definition of the Machine Learning Lifecycle' → 'Machine Learning Lifecycle'
2025-11-05 09:20:57 -05:00
Vijay Janapa Reddi
e063aeefaa refine(definitions): enhance academic tone in CNNs definition
Changed 'dramatically fewer' to 'substantially fewer' for maximum academic formality.

Rubio's academic tone audit identified this as the only instance requiring refinement
across all 39 definitions. All definitions now maintain consistent formal academic voice.
2025-11-05 09:09:16 -05:00
Vijay Janapa Reddi
9e62f0f670 docs(definitions): revise benchmarking and AGI definitions to canonical form
Phase 8 - Benchmarking (6):
- ML Benchmarking: Removed problematic phrasing, reduced to 1 sentence
- ML Algorithmic Benchmarks: Reduced from 3 to 1 sentence, cleaner focus
- ML System Benchmarks: Reduced from 3 to 1 sentence, clear infrastructure focus
- ML Data Benchmarks: Reduced from 3 to 1 sentence, emphasized quality assessment
- ML Training Benchmarks: Reduced from 3 to 1 sentence, focused on training phase
- ML Inference Benchmarks: Reduced from 4 to 1 sentence, deployment focus clear

Phase 9 - Forward-Looking (1):
- AGI: Changed 'refers to' to 'represents', reduced from 4 to 1 sentence

All follow canonical standard. Phases 8-9 complete. IMPLEMENTATION: 39/39 definitions (100%).
2025-11-05 09:01:27 -05:00
Vijay Janapa Reddi
f23193c631 docs(definitions): revise Responsible AI suite definitions to canonical form
- Responsible AI: Reduced from 3 to 1 sentence, focused on transformation process
- Sustainable AI: Reduced from 3 to 1 sentence, emphasized first-class constraint
- Resilient AI: Changed 'refers to ability' to 'describes systems', reduced to 1 sentence
- Security: Removed 'in machine learning systems', reduced from 3 to 1 sentence
- Privacy: Removed 'in machine learning systems', reduced from 3 to 1 sentence
- AI for Good: Changed 'refers to' to 'is', reduced from 2 to 1 sentence

All follow canonical standard. Phase 7 (6/6) complete. Progress: 32/39 definitions (82%).
2025-11-05 09:00:13 -05:00
Vijay Janapa Reddi
a667083c88 docs(definitions): revise deployment paradigm definitions to canonical form
ML Systems (5):
- Cloud ML: Reduced to 1 sentence, emphasized capacity vs latency trade-off
- Edge ML: Reduced to 1 sentence, highlighted localized processing benefits
- Mobile ML: Changed 'execution' to 'deployment', standardized structure
- Tiny ML: Changed 'execution' to 'deployment', emphasized long battery life
- Hybrid ML: Reduced to 1 sentence, focused on synergistic integration

On-Device Learning (2):
- On-Device Learning: Reduced from 4 to 1 sentence, clearer constraint statement
- Federated Learning: Changed 'where' to 'in which', reduced from 4 to 1 sentence

All follow canonical standard. Phase 6 (7/7) complete. Progress: 26/39 definitions (67%).
2025-11-05 08:58:52 -05:00
Vijay Janapa Reddi
1f09140615 docs(definitions): revise hardware & optimization definitions to canonical form
- ML Accelerators: Removed article, reduced from 3 to 1 sentence, focused on efficiency
- Mapping: Reduced from 3 to 1 sentence, emphasized three core dimensions
- Model Optimization: Reduced from 4 to 1 sentence, highlighted efficiency vs performance trade-off
- Pruning: Reduced from 2 to 1 sentence, cleaner redundancy removal statement
- ML System Efficiency: Reduced from 3 to 1 sentence, three optimization dimensions clear

All follow canonical standard. Phase 5 (5/5) complete. Progress: 19/39 definitions.
2025-11-05 08:57:22 -05:00
Vijay Janapa Reddi
171f175a1c docs(definitions): revise lifecycle & operations definitions to canonical form
- ML Lifecycle: Removed 'The', reduced from 3 to 1 sentence, focused on iterative nature
- MLOps: Reduced from 3 to 1 sentence, highlighted unique ML challenges
- Training Systems: Reduced from 4 to 1 sentence, emphasized iterative optimization
- ML Frameworks: Removed article, reduced from 2 to 1 sentence, focused on bridging role
- Data Engineering: Added 'systematic', reduced from 2 to 1 sentence, clearer transformation

All follow canonical standard. Phase 4 (5/5) complete. Progress: 14/39 definitions.
2025-11-05 08:55:57 -05:00
Vijay Janapa Reddi
da81f3d81d docs(definitions): revise neural architecture definitions to canonical form
- MLPs: Reduced from 4 sentences to 1, focused on fully-connected nature and trade-offs
- CNNs: Reduced from 3 sentences to 1, emphasized spatial structure exploitation
- RNNs: Reduced from 4 sentences to 1, highlighted sequential processing trade-off
- Attention: Reduced from 3 sentences to 1, focused on content-dependent relationships
- Transformers: Reduced from 3 sentences to 1, emphasized parallelization advantage

All definitions now follow canonical standard. Phase 3 (6/6) complete.
2025-11-05 08:54:50 -05:00
Vijay Janapa Reddi
781faabf81 docs(definitions): revise Deep Learning definition to canonical form
- Reduced from 3 sentences to 1 sentence
- Focused on core distinguishing characteristic: hierarchical representations
- Removed application examples (belong in body text)
- Reduced italics from 11 to 3 focused concepts

Phase 3 (1/6) complete.
2025-11-05 08:53:47 -05:00
Vijay Janapa Reddi
6ffff44752 docs(definitions): revise foundational definitions to canonical form
- AI & ML: Reduced to single sentences, removed 'goal' language, clearer distinction
- Machine Learning System: From 2 sentences to 1, removed numbered list, emphasizes interdependency
- AI Engineering: From 2 sentences to 1, focused on systems-level integration vs enumeration

All definitions now follow canonical standard: single sentence, 3-6 strategic italics,
no articles, technically precise. Phase 2 of 39-definition standardization complete.
2025-11-05 08:53:00 -05:00
github-actions[bot]
b0f06bd57f Update contributors list [skip ci] 2025-11-05 13:29:39 +00:00
Vijay Janapa Reddi
8a7cda3685 Merge pull request #1031 from foundingnimo/foundingnimo-patch-7
Expand on data drift and model adaptation challenges
2025-11-05 08:25:38 -05:00
Vijay Janapa Reddi
b7537359a4 Merge pull request #1032 from foundingnimo/foundingnimo-patch-8
Small changes to address formatting
2025-11-05 08:24:35 -05:00
Vijay Janapa Reddi
87b977c1d2 Clarifies model adaptability constraints
Explains the limitations of model adaptation during deployment.

Highlights that models apply fixed learned distributions,
emphasizing the importance of retraining for adapting to data
drift rather than runtime modification.
2025-11-05 08:19:21 -05:00
Nimo
9e01b3ba8a Small changes to address formatting
Fixed formatting.

Suggested grammar change to clarify sentence and structure
2025-11-06 00:18:07 +13:00
Nimo
6cfd6ddb20 Expand on data drift and model adaptation challenges
Discuss the challenges of data drift and distribution shift in ML systems, emphasizing the limitations of fixed model parameters and the need for retraining.
2025-11-05 17:36:25 +13:00
github-actions[bot]
a1065ce166 Update contributors list [skip ci] 2025-11-05 01:43:06 +00:00
Vijay Janapa Reddi
f13af1ab7b Merge pull request #1030 from GabrielAmazonas/patch-2
CI: Fix "Update contributors list" action failing on non-fast-forward
2025-11-04 20:37:59 -05:00
Gabriel Amazonas
3f1b37d822 removes the final line 2025-11-04 23:53:01 +00:00
Gabriel Amazonas
4bde03679b Fix missing newline at end of update-contributors.yml 2025-11-04 20:20:00 -03:00
Gabriel Amazonas
b1994ab093 Fix git add command in update-contributors.yml 2025-11-04 19:58:17 -03:00
github-actions[bot]
d4385276e7 Update contributors list [skip ci] 2025-11-04 14:05:15 +00:00
Vijay Janapa Reddi
325dadc1fe Merge pull request #1029 from GabrielAmazonas/patch-1
Update DOI for article in introduction.bib
2025-11-04 09:00:24 -05:00
Gabriel Amazonas
d7b714957f Update publication year and journal details in introduction.bib 2025-11-04 10:49:39 -03:00
Gabriel Amazonas
019a08af56 Update DOI for article in introduction.bib 2025-11-04 10:44:24 -03:00
github-actions[bot]
a019b1e40b Update contributors list [skip ci] 2025-11-04 13:05:37 +00:00
Vijay Janapa Reddi
0c68dc4ed0 Merge pull request #1028 from foundingnimo/foundingnimo-patch-6
Tidy up sentence structure in ML code->data shift
2025-11-04 08:01:10 -05:00
Nimo
f3e62ac18c Suggest idiomatic use of phrase
Clarified the description of cloud-based ML systems and their operational complexities.
2025-11-04 23:21:02 +13:00
Nimo
4ee2f00f38 Tidy up sentence structure in ML code->data shift 2025-11-04 20:31:40 +13:00
Vijay Janapa Reddi
4a9674d587 Merge remote-tracking branch 'origin/dev' into dev 2025-11-03 23:27:00 -05:00
kai
1c32b2b0ce Updated bundle.js, removing other js files. Updated search to be just '/' 2025-11-03 23:01:49 -05:00
Vijay Janapa Reddi
fffd3e3832 fix(bibliography): verify and clean invalid DOIs
- Ran betterbib update on all bibliography files (synced 443/460 entries)
- Removed 2 papers with invalid/fabricated DOIs:
  * Kannan2023chiplet (placeholder DOI 10.1109/MM.2022.1234567)
  * chen2019edge (unverifiable DOI 10.1109/SEC.2019.00035)
- Fixed DOI typo in taylor2022 (changed -01331-1 to -01331-9)
- Removed corresponding citations from .qmd files
- Created verification reports documenting all findings

17 DOIs returned 404 errors. 2 removed, 1 fixed, 14 need further investigation.
Famous papers (ZeRO, Eyeriss, MYCIN) confirmed to exist but have incorrect DOIs.
2025-11-03 22:57:20 -05:00
Vijay Janapa Reddi
44ae290324 Merge remote-tracking branch 'origin/dev' into dev 2025-11-03 22:54:18 -05:00
github-actions[bot]
374d74be00 Update contributors list [skip ci] 2025-11-04 02:41:45 +00:00
Vijay Janapa Reddi
57923478f7 chore: remove docs/releases directory
Release notes are now managed directly on GitHub releases.
All releases have been updated with actual release dates in their
descriptions. Future releases will be handled by publish-live workflow.
2025-11-03 21:35:26 -05:00
Vijay Janapa Reddi
7c2b2c1270 Merge pull request #1027 from foundingnimo/foundingnimo-patch-5
Refine grammar of ImageNet breakthrough explanation
2025-11-03 21:33:42 -05:00
Vijay Janapa Reddi
1786354220 docs(release): archive release notes in docs/releases/
Move release notes files from root to docs/releases/ for better
organization and discoverability. These files serve as local backups
of GitHub release notes.
2025-11-03 21:20:54 -05:00