TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-09 13:42:01 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	f09759a476	Fix Transformer gradient flow with EmbeddingBackward and proper residual connections - Imported and attached EmbeddingBackward to Embedding.forward() - Fixed residual connections to use tensor addition instead of Tensor(x.data + y.data) - Adjusted convergence thresholds for Transformer complexity (12% loss decrease) - Relaxed weight update criteria to accept LayerNorm tiny updates (60% threshold) - All 19 Transformer parameters now receive gradients and update properly - Transformer learning verification test now passes	2025-11-22 17:33:28 -05:00
Vijay Janapa Reddi	857ab221d8	Fix CNN gradient flow with Conv2dBackward and MaxPool2dBackward - Implemented Conv2dBackward class in spatial module for proper gradient computation - Implemented MaxPool2dBackward to route gradients through max pooling - Fixed reshape usage in CNN test to preserve autograd graph - Fixed conv gradient capture timing in test (before zero_grad) - All 6 CNN parameters now receive gradients and update properly - CNN learning verification test now passes with 74% accuracy and 63% loss decrease	2025-11-22 17:29:20 -05:00
Vijay Janapa Reddi	d05daeb83b	Add comprehensive milestone learning verification tests - Created test suite that verifies actual learning (gradient flow, weight updates, loss convergence) - Fixed MLP Digits (1986): increased training epochs from 15 to 25 - Added requires_grad=True to Conv2d weights (partial fix) - Identified gradient flow issues in Conv2d, Embedding, and Attention layers - Comprehensive documentation of issues and fixes needed	2025-11-22 17:02:10 -05:00
Vijay Janapa Reddi	b7c32d9878	Remove archived and unnecessary files from git tracking - Remove COMMIT_LOG.txt (already in .gitignore) - Remove archived competition module (20_competition_ARCHIVED) - Remove missing text files (ISSUES_DIAGRAM.txt, REVIEW_SUMMARY.txt)	2025-11-19 22:06:29 -05:00
Vijay Janapa Reddi	7d82bca242	Clean up Module 18: Remove unused warnings import	2025-11-19 08:54:10 -05:00
Vijay Janapa Reddi	13e56f2506	Clean up Module 20: Remove unused time and matplotlib imports	2025-11-19 08:54:05 -05:00
Vijay Janapa Reddi	41b5f7e65f	Clean up Module 17: Remove unused time import	2025-11-19 08:54:02 -05:00
Vijay Janapa Reddi	cb2059c06f	Clean up Module 05: Remove unused sys and os imports	2025-11-19 08:54:00 -05:00
Vijay Janapa Reddi	42470e64d8	Clean up Module 03: Remove unused sys and os imports	2025-11-19 08:53:58 -05:00
Vijay Janapa Reddi	f31865560e	Add enumitem package to fix itemize formatting The itemize environment parameters [leftmargin=*, itemsep=1pt, parsep=0pt] were appearing as visible text in the PDF because the enumitem package wasn't loaded. This fix adds \usepackage{enumitem} to the preamble. All itemized lists now format correctly with proper spacing and margins. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-19 08:43:41 -05:00
Vijay Janapa Reddi	fbe91d4c5e	Configure natbib for standard academic citation format Changes: - Reverted invalid natbib options (maxcitenames/maxbibnames are biblatex-only) - natbib with plainnat already uses "et al." for in-text citations with 3+ authors - Bibliography shows full author lists (standard academic practice) - Restored full author lists in references.bib for proper attribution Current behavior: - In-text: "Reddi et al. (2020)" for papers with many authors - Bibliography: Shows all authors (e.g., all 51 authors for MLPerf paper) To truncate bibliography author lists to "10 + et al.", would need: 1. Custom .bst bibliography style file, OR 2. Switch from natbib to biblatex package Compiled successfully: paper.pdf (22 pages) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-18 17:54:44 -05:00
Vijay Janapa Reddi	9dfa8ae6ae	Add sustainable AI and systems citations to future work section Added citations for sustainable ML, energy-efficient computing, mixed precision training, and TinyML benchmarking to strengthen the future work discussion. New citations: - Strubell et al. (2019): Energy and Policy Considerations for Deep Learning in NLP - foundational work on ML carbon footprint - Patterson et al. (2021): Carbon Emissions and Large Neural Network Training - comprehensive analysis of energy use in large models - Micikevicius et al. (2018): Mixed Precision Training - ICLR paper on FP16/FP32 training techniques - Banbury et al. (2021): Benchmarking TinyML Systems - TinyMLPerf benchmarking framework for edge AI Citations integrated into: - Roofline Models section (mixed precision advantages) - Energy and Power Profiling section (sustainable ML and edge AI) These citations ground the future work proposals in established research on green AI, energy-efficient ML, and edge deployment. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-18 17:31:21 -05:00
Vijay Janapa Reddi	3d14d67955	Update development files: streamline benchmarking and capstone dev modules - Clean up benchmarking_dev.py implementation - Refine capstone_dev.py development workflow	2025-11-13 10:46:14 -05:00
Vijay Janapa Reddi	5024c29ad5	Improve module implementations: code quality and functionality updates - Enhance tensor operations and autograd functionality - Improve activation functions and layer implementations - Refine optimizer and training code - Update spatial operations and transformer components - Clean up profiling, quantization, and compression modules - Streamline benchmarking and acceleration code	2025-11-13 10:42:49 -05:00
Vijay Janapa Reddi	65c973fac1	Update module documentation: enhance ABOUT.md files across all modules - Improve module descriptions and learning objectives - Standardize documentation format and structure - Add clearer guidance for students - Enhance module-specific context and examples	2025-11-13 10:42:47 -05:00
Vijay Janapa Reddi	57111ea139	Fix failing module tests - Fix 14_profiling: Replace Tensor with Linear model in test_module, fix profile_forward_pass calls - Fix 15_quantization: Increase error tolerance for INT8 quantization test, add export marker for QuantizedLinear - Fix 19_benchmarking: Return Tensor objects from RealisticModel.parameters(), handle memoryview in pred_array.flatten() - Fix 20_capstone: Make imports optional (MixedPrecisionTrainer, QuantizedLinear, compression functions) - Fix 20_competition: Create Flatten class since it doesn't exist in spatial module - Fix 16_compression: Add export markers for magnitude_prune and structured_prune All modules now pass their inline tests.	2025-11-12 14:19:33 -05:00
Vijay Janapa Reddi	5bbf2a1a37	Module improvements: Advanced modules (16-20) - Update memoization module and notebook - Enhance acceleration module - Improve benchmarking module - Refine capstone module - Update competition module	2025-11-11 19:05:02 -05:00
Vijay Janapa Reddi	1f581f5bf0	Module improvements: Core modules (01-08) - Update tensor module notebook - Enhance activations module - Expand layers module functionality - Improve autograd implementation - Add optimizers enhancements - Update training module - Refine dataloader notebook	2025-11-11 19:05:00 -05:00
Vijay Janapa Reddi	69abbe8754	Add systems analysis: Autograd profiling - Add memory profiling with tracemalloc - Add backward pass performance benchmarking - Add computational complexity analysis - Demonstrates autograd overhead and performance characteristics	2025-11-11 19:04:59 -05:00
Vijay Janapa Reddi	cb5ad9ccf1	Cleanup: Remove old/unused files - Remove datasets analysis and download scripts (replaced by updated README) - Remove archived book development documentation - Remove module review reports (16_compression, 17_memoization)	2025-11-11 19:04:56 -05:00
Vijay Janapa Reddi	ae33298805	Fix NBGrader metadata for Modules 15 and 16 Module 15 (Quantization): - Added locked=true to test_module cell (line 1523) - Added NBGrader metadata to systems-thinking markdown cell (line 1751) - Added schema_version: 3 to both cells Module 16 (Compression): - Added NBGrader metadata to 6 solution cells: * measure-sparsity (line 380) * magnitude-prune (line 511) * structured-prune (line 675) * low-rank-approx (line 843) * distillation (line 1013) * compress-model-comprehensive (line 1234) - Added NBGrader metadata to 6 test cells: * test-measure-sparsity (line 427) - 5 points * test-magnitude-prune (line 567) - 10 points * test-structured-prune (line 733) - 10 points * test-low-rank (line 888) - 10 points * test-distillation (line 1133) - 15 points * test-compression-integration (line 1300) - 20 points - Total: 70 points for Module 16 Result: - Module 15: 0 P0-BLOCKER, 0 P1-IMPORTANT (was 1 P0 + 1 P1) - Module 16: 0 P0-BLOCKER, 0 P1-IMPORTANT (was 12 P0) - Both modules now production-ready for NBGrader deployment(https://claude.com/claude-code)	2025-11-11 14:50:37 -05:00
Vijay Janapa Reddi	78d0ca6afc	Remove redundant review documentation Removed redundant and superseded review reports: - Module 15: COMPREHENSIVE_REVIEW_REPORT.md, FINAL_VALIDATION_REPORT.md, REVIEW_SUMMARY.md - Docs: RESTRUCTURING_VERIFICATION.md, book-development/CLEANUP_SUMMARY.md Also removed untracked files: - Module 11: REVIEW_REPORT_FINAL.md (superseded by REVIEW_REPORT.md) - Module 12: REVIEW_SUMMARY.md (redundant with REVIEW_REPORT.md) - Module 20: COMPLIANCE_CHECKLIST.md (redundant with REVIEW_REPORT.md) - Module 6, 8, 14, 18: COMPLIANCE_SUMMARY.md and QUICK_SUMMARY.md files Retained comprehensive REVIEW_REPORT.md files which contain the most complete QA documentation.	2025-11-11 12:15:36 -05:00
Vijay Janapa Reddi	ac6f88ec0b	Remove temporary analysis and fix documentation Removed 31 temporary markdown files that documented completed work: - Module-specific fix reports (Module 07, 16, 17, 19-20) - Hasattr audit files (completed audit) - Module progression review reports (completed) - Infrastructure analysis reports (completed) - Renumbering and restructuring summaries (completed) Retained valuable documentation: - All REVIEW_REPORT.md files (comprehensive QA documentation) - All COMPLIANCE_SUMMARY.md files (quick reference) - COMPREHENSIVE_MODULE_REVIEW_STATUS.md (tracking) - MODULE_DEPENDENCY_MAP.md and MODULE_PROGRESSION_GUIDE.md (guides)	2025-11-11 12:09:31 -05:00
Vijay Janapa Reddi	d1fe4d2f8e	Remove temporary analysis files from modules Cleaned up temporary AI-generated analysis files: - modules/15_quantization/FIXES_APPLIED.md - modules/15_quantization/FIXES_TO_APPLY.md - modules/16_compression/FIXES_REQUIRED.md - modules/17_memoization/FIXES_APPLIED.md - Plus other untracked analysis files These were temporary debugging/review artifacts. Now covered by .gitignore patterns to prevent future accumulation.	2025-11-10 19:50:43 -05:00
Vijay Janapa Reddi	2725e31f90	Add module metadata for competition module Added module.yaml for Module 20 (Competition & Validation): - Module configuration and learning objectives - Prerequisites and skill development tracking - Test coverage and connection documentation This module brings together all optimization techniques learned in modules 14-18 for competition preparation.	2025-11-10 19:44:06 -05:00
Vijay Janapa Reddi	ec7168dc90	Add module development files to new structure Added all module development files to modules/XX_name/ directories: Module notebooks and scripts: - 18 modules with .ipynb and .py files (01-20, excluding some gaps) - Moved from modules/source/ to direct module directories - Includes tensor, autograd, layers, transformers, optimization modules Module README files: - Added README.md for modules with additional documentation - Complements ABOUT.md files added earlier This completes the module restructuring: - Before: modules/source/XX_name/_dev.{py,ipynb} - After: modules/XX_name/_dev.{py,ipynb} All development happens directly in numbered module directories now.	2025-11-10 19:43:36 -05:00
Vijay Janapa Reddi	d03435c5c3	Update documentation for site/ migration and restructuring Documentation updates across the codebase: Root documentation: - README.md: Updated references from book/ to site/ - CONTRIBUTING.md: Updated build and workflow instructions - .shared-ai-rules.md: Updated AI assistant rules for new structure GitHub configuration: - Issue templates updated for new module locations - Workflow references updated from book/ to site/ docs/ updates: - STUDENT_QUICKSTART.md: New paths and structure - module-rules.md: Updated module development guidelines - NBGrader documentation: Updated for module restructuring - Archive documentation: Updated references Module documentation: - modules/17_memoization/README.md: Updated after reordering All documentation now correctly references: - site/ instead of book/ - modules/XX_name/ instead of modules/source/	2025-11-10 19:42:48 -05:00
Vijay Janapa Reddi	d25861c68e	Remove modules/source/ directory structure Completed restructuring: modules/source/XX_name/ → modules/XX_name/ All module development files moved to their numbered directories: - modules/01_tensor/tensor_dev.{py,ipynb} - modules/02_activations/activations_dev.{py,ipynb} - ... (modules 03-20) Removed obsolete source structure: - modules/source/01_tensor/ through modules/source/20_capstone/ - modules/source/20_competition/ (legacy competition module) - 43 files total (21 modules × 2 files each + 1 module.yaml) This simplifies the module structure and makes development files easier to find alongside their ABOUT.md and README.md files.	2025-11-10 19:41:24 -05:00
Vijay Janapa Reddi	a2e4586f18	Update documentation after module reordering All module references updated to reflect new ordering: - Module 15: Quantization (was 16) - Module 16: Compression (was 17) - Module 17: Memoization (was 15) Updated by module-developer and website-manager agents: - Module ABOUT files with correct numbers and prerequisites - Cross-references and "What's Next" chains - Website navigation (_toc.yml) and content - Learning path progression in LEARNING_PATH.md - Profile milestone completion message (Module 17) Pedagogical flow now: Profile → Quantize → Prune → Cache → Accelerate	2025-11-10 19:37:41 -05:00
Vijay Janapa Reddi	a71e0eded5	Reorder modules for better pedagogical flow Moved memoization (KV-cache) after compression to align with optimization tier milestones. Changes: - Module 15: Quantization (was 16) - Module 16: Compression (was 17) - Module 17: Memoization (was 15) Pedagogical Rationale: This creates clear alignment with the optimization milestone structure: - M06 (Profiling): Module 14 - M07 (Compression): Modules 15-16 (Quantization + Compression) - M08 (Acceleration): Modules 17-18 (Memoization/KV-cache + Acceleration) Before: Students learned KV-cache before understanding why models are slow After: Students profile → compress → then optimize with KV-cache Updated milestone reference in profile_kv_cache.py: Module 15 → Module 17	2025-11-10 19:29:10 -05:00
Vijay Janapa Reddi	caca0e3903	Fix Module 16 quantization syntax and imports Fix misplaced triple-quote causing syntax error and add Sequential import	2025-11-10 07:30:40 -05:00
Vijay Janapa Reddi	cf3cb87bd4	Fix Module 15 memoization: Add optional mask parameter to MockTransformerBlock forward method	2025-11-10 07:26:11 -05:00
Vijay Janapa Reddi	dd622bb5ae	Fix Module 12 attention: Correct masking logic to use 0 for masked positions instead of negative values	2025-11-10 07:26:09 -05:00
Vijay Janapa Reddi	ca9198875c	Fix Module 06 optimizers: Use duck typing for Tensor validation and extract grad data properly in AdamW	2025-11-10 07:26:07 -05:00
Vijay Janapa Reddi	bec5f5ce45	Remove internal restructuring documentation - Delete modules/source/14_profiling/RESTRUCTURING_SUMMARY.md - Internal implementation notes no longer needed after refactoring completion	2025-11-09 17:03:43 -05:00
Vijay Janapa Reddi	474016e91f	Remove outdated kvcaching module files - Delete kvcaching_dev.py (superseded by memoization_dev.py) - Delete kvcaching_dev.ipynb (superseded by memoization_dev.ipynb) - memoization_dev files are the current versions with complete content	2025-11-09 17:03:31 -05:00
Vijay Janapa Reddi	fb77c327f1	Remove outdated development reports - Delete MODULE_14_COMPLETION_REPORT.md - Delete MODULE_14_REVIEW.md - Delete RESTRUCTURE_COMPLETE.md - Delete OPTIMIZATION_TIER_RESTRUCTURE_PLAN.md - Delete PROGRESS_SUMMARY.md - Delete PROJECT_STATUS.md - Delete SCAFFOLDING_COMPLIANCE_REPORT.md - Delete modules/COMPLIANCE_REPORT_FINAL.md - Delete modules/GOLD_STANDARD_ANALYSIS.md - Delete modules/MODULES_14-20_AUDIT.md	2025-11-09 16:56:08 -05:00
Vijay Janapa Reddi	40b7fb8290	Remove obsolete backup files - Delete tinytorch/core/training.py.bak - Delete tinytorch/core/optimizers.py.bak - Delete modules/source/14_profiling/profiling_dev.py.backup	2025-11-09 16:55:49 -05:00
Vijay Janapa Reddi	0ed16a1553	Update release documentation and advanced modules - Updated release checklist and December 2024 release notes - Updated student version tooling documentation - Modified modules 15-19 (memoization, quantization, compression, benchmarking) - Added milestone dashboard and progress tracking - Added compliance reports and module audits - Added checkpoint tests for modules 15-20 - Added activation script and book configuration	2025-11-09 16:51:55 -05:00
Vijay Janapa Reddi	bbaa449da6	build: add generated memoization notebook Generated from memoization_dev.py after module restructuring	2025-11-09 14:41:24 -05:00
Vijay Janapa Reddi	1c299cddb0	docs: add comprehensive docstrings to optimization modules 16-19 - Add Args/Returns/Example/Hints to key functions - Improve documentation for compare_model_sizes (16) - Enhance function documentation in compression (17) - Add docstring details for acceleration (18) - Improve benchmarking function docs (19)	2025-11-09 14:38:44 -05:00
Vijay Janapa Reddi	a6e57ff379	docs: add Args/Returns docstrings to quantization functions	2025-11-09 13:03:43 -05:00
Vijay Janapa Reddi	a272030037	build: regenerate profiling notebook from updated dev file	2025-11-09 13:03:30 -05:00
Vijay Janapa Reddi	9e22c3caf6	refactor: Remove old module and chapter files after reorganization Cleanup of renamed files: - Deleted old module source files (14_kvcaching, 15_profiling, 16_acceleration, etc.) - Deleted old chapter markdown files - These have been replaced by reorganized versions in previous commits	2025-11-09 12:26:47 -05:00
Vijay Janapa Reddi	cbd275e4aa	refactor(modules): Reorganize optimization tier structure (14-19) Module renaming and reordering: - 15_profiling → 14_profiling (now first in optimization tier) - 14_kvcaching → 15_memoization (renamed to emphasize pattern) - 17_quantization → 16_quantization - 18_compression → 17_compression - 16_acceleration → 18_acceleration (moved after compression) - 19_benchmarking (unchanged) All module metadata updated (numbers, prerequisites, connection maps)	2025-11-09 12:26:13 -05:00
Vijay Janapa Reddi	ef1a5ec7fd	feat(modules): Add profiling motivation sections to optimization modules - Quantization: Shows FP32 memory usage, motivates precision reduction - Compression: Shows weight distribution, motivates pruning - Acceleration: Shows CNN compute bottleneck, motivates vectorization Each module now follows pattern: Profile → Discover → Fix	2025-11-09 12:26:03 -05:00
Vijay Janapa Reddi	976f0ed278	feat(memoization): Add profiling motivation section - Shows O(n²) latency growth in transformer generation - Demonstrates problem before teaching solution - Prepares module for reorganization to Module 15	2025-11-09 09:16:08 -05:00
Vijay Janapa Reddi	b52b762545	feat(profiler): Add helper functions for optimization modules - Add quick_profile() for simplified profiling interface - Add analyze_weight_distribution() for compression module - Both functions will be used by modules 15-18	2025-11-09 09:15:13 -05:00
Vijay Janapa Reddi	16660d921d	Implement MLPerf Edu Competition module (Module 20) Complete capstone competition implementation: - Two division tracks: Closed (optimize) and Open (innovate) - Baseline CNN model for CIFAR-10 - Validation and submission generation system - Integration with Module 19 normalized scoring - Honor code and GitHub repo submission workflow - Worked examples and student templates Module 20 is now a pedagogically sound capstone that applies all Optimization Tier techniques in a fair competition format.	2025-11-07 20:04:57 -05:00
Vijay Janapa Reddi	3cefcf192e	Add normalized scoring and MLPerf principles to Module 19 Enhancements to benchmarking module: - Added calculate_normalized_scores() for fair hardware comparison - Implemented speedup, compression ratio, accuracy delta metrics - Added MLPerf principles section to educational content - Updated module to support competition fairness These changes enable Module 20 competition to work across different hardware.	2025-11-07 20:04:46 -05:00

1 2 3 4 5 ...

548 Commits