TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-03 15:38:47 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	8546e3e694	🤖 Fix transformer module exports and milestone 05 imports Module export fixes: - Add #\|default_exp models.transformer directive to transformers module - Add imports (MultiHeadAttention, GELU, etc.) to export block - Export dataloader module (08_dataloader) - All modules now properly exported to tinytorch package Milestone 05 fixes: - Correct import paths (text.embeddings, data.loader, models.transformer) - Fix Linear.weight vs Linear.weights typo - Fix indentation in training loop - Call .forward() explicitly on transformer components Status: Architecture test mode works, model builds successfully TODO: Fix TransformerBlock/MultiHeadAttention signature mismatch in module 13	2025-10-27 16:17:55 -04:00
Vijay Janapa Reddi	95274448bd	feat: Add Milestone 04 (CNN Revolution 1998) + Clean spatial imports Milestone 04 - CNN Revolution: ✅ Complete 5-Act narrative structure (Challenge → Reflection) ✅ SimpleCNN architecture: Conv2d → ReLU → MaxPool → Linear ✅ Trains on 8x8 digits dataset (1,437 train, 360 test) ✅ Achieves 84.2% accuracy with only 810 parameters ✅ Demonstrates spatial operations preserve structure ✅ Beautiful visual output with progress tracking Key Features: - Conv2d (1→8 channels, 3×3 kernel) detects local patterns - MaxPool2d (2×2) provides translation invariance - 100× fewer parameters than equivalent MLP - Training completes in ~105 seconds (50 epochs) - Sample predictions table shows 9/10 correct Module 09 Spatial Improvements: - Removed ugly try/except import pattern - Clean imports: 'from tinytorch.core.tensor import Tensor' - Matches PyTorch style (simple and professional) - No fallback logic needed All 4 milestones now follow consistent 5-Act structure!	2025-09-30 17:04:41 -04:00
Vijay Janapa Reddi	828c3d9081	feat: Add CrossEntropyLoss autograd support + Milestone 03 MLP on digits Key Changes: - Implemented CrossEntropyBackward for gradient computation - Integrated CrossEntropyLoss into enable_autograd() patching - Created comprehensive loss gradient test suite - Milestone 03: MLP digits classifier (77.5% accuracy) - Shipped tiny 8x8 digits dataset (67KB) for instant demos - Updated DataLoader module with ASCII visualizations Tests: - All 3 losses (MSE, BCE, CrossEntropy) now have gradient flow - MLP successfully learns digit classification (6.9% → 77.5%) - Integration tests pass Technical: - CrossEntropyBackward: softmax - one_hot gradient - Numerically stable via log-softmax - Works with raw class labels (no one-hot needed)	2025-09-30 16:22:09 -04:00
Vijay Janapa Reddi	de3b837bee	Fix nbdev export system across all 20 modules PROBLEM: - nbdev requires #\| export directive on EACH cell to export when using # %% markers - Cell markers inside class definitions split classes across multiple cells - Only partial classes were being exported to tinytorch package - Missing matmul, arithmetic operations, and activation classes in exports SOLUTION: 1. Removed # %% cell markers INSIDE class definitions (kept classes as single units) 2. Added #\| export to imports cell at top of each module 3. Added #\| export before each exportable class definition in all 20 modules 4. Added __call__ method to Sigmoid for functional usage 5. Fixed numpy import (moved to module level from __init__) MODULES FIXED: - 01_tensor: Tensor class with all operations (matmul, arithmetic, shape ops) - 02_activations: Sigmoid, ReLU, Tanh, GELU, Softmax classes - 03_layers: Linear, Dropout classes - 04_losses: MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss classes - 05_autograd: Function, AddBackward, MulBackward, MatmulBackward, SumBackward - 06_optimizers: Optimizer, SGD, Adam, AdamW classes - 07_training: CosineSchedule, Trainer classes - 08_dataloader: Dataset, TensorDataset, DataLoader classes - 09_spatial: Conv2d, MaxPool2d, AvgPool2d, SimpleCNN classes - 10-20: All exportable classes in remaining modules TESTING: - Test functions use 'if __name__ == "__main__"' guards - Tests run in notebooks but NOT on import - Rosenblatt Perceptron milestone working perfectly RESULT: ✅ All 20 modules export correctly ✅ Perceptron (1957) milestone functional ✅ Clean separation: development (modules/source) vs package (tinytorch)	2025-09-30 11:21:04 -04:00
Vijay Janapa Reddi	cc7c7526c8	Clean up module imports: convert tinytorch.core to sys.path style - Remove circular imports where modules imported from themselves - Convert tinytorch.core imports to sys.path relative imports - Only import dependencies that are actually used in each module - Preserve documentation imports in markdown cells - Use consistent relative path pattern across all modules - Remove hardcoded absolute paths in favor of relative imports Affected modules: 02_activations, 03_layers, 04_losses, 06_optimizers, 07_training, 09_spatial, 12_attention, 17_quantization	2025-09-30 08:58:58 -04:00
Vijay Janapa Reddi	f8104f726a	Restructure TinyTorch into three-part learning journey (17 modules) - Part I: Foundations (Modules 1-5) - Build MLPs, solve XOR - Part II: Computer Vision (Modules 6-11) - Build CNNs, classify CIFAR-10 - Part III: Language Models (Modules 12-17) - Build transformers, generate text Key changes: - Renamed 05_dense to 05_networks for clarity - Moved 08_dataloader to 07_dataloader (swap with attention) - Moved 07_attention to 13_attention (Part III) - Renamed 12_compression to 16_regularization - Created placeholder dirs for new language modules (12,14,15,17) - Moved old modules 13-16 to temp_holding for content migration - Updated README with three-part structure - Added comprehensive documentation in docs/three-part-structure.md This structure gives students three natural exit points with concrete achievements at each level.	2025-09-22 09:50:48 -04:00
Vijay Janapa Reddi	89566d26f3	Update generated notebooks and package exports - Regenerate all .ipynb files from fixed .py modules - Update tinytorch package exports with corrected implementations - Sync package module index with current 16-module structure These generated files reflect all the module fixes and ensure consistent .py ↔ .ipynb conversion with the updated module implementations.	2025-09-18 16:42:57 -04:00
Vijay Janapa Reddi	3883c369c4	Add tito grade command for simplified NBGrader interface Implement comprehensive grading workflow wrapped behind tito CLI: • tito grade setup - Initialize NBGrader course structure • tito grade generate - Create instructor version with solutions • tito grade release - Create student version without solutions • tito grade collect - Collect student submissions • tito grade autograde - Automatically grade submissions • tito grade manual - Open manual grading interface • tito grade feedback - Generate student feedback • tito grade export - Export grades to CSV This allows users to only learn tito commands without needing to understand NBGrader's complex interface. All grading functionality is accessible through simple, consistent tito commands.	2025-09-17 19:22:02 -04:00
Vijay Janapa Reddi	3a7c6a1701	Complete north star validation and demo pipeline - Export all modules with CIFAR-10 and checkpointing enhancements - Create demo_cifar10_training.py showing complete pipeline - Fix module issues preventing clean imports - Validate all components work together - Confirm students can achieve 75% CIFAR-10 accuracy goal Pipeline validated: ✅ CIFAR-10 dataset downloading ✅ Model creation and training ✅ Checkpointing for best models ✅ Evaluation tools ✅ Complete end-to-end workflow	2025-09-17 00:32:13 -04:00
Vijay Janapa Reddi	cec401af65	🧹 Remove Jupyter notebooks from modules/source - Python-first workflow - Delete all 15 .ipynb files from modules/source directories - Align with TinyTorch's Python-first development philosophy - .py files are the source of truth, .ipynb files are temporary outputs - Prevents version control conflicts with notebook metadata - Students work directly with .py files using Jupytext format - Notebooks can be regenerated when needed via 'tito nbdev generate' Removed files: - All *_dev.ipynb files across modules 01-15 - Keeps repository clean and focused on source code	2025-07-20 08:41:26 -04:00
Vijay Janapa Reddi	6c2b253751	refactor: Implement learner-focused module progression with better naming ✅ Renamed modules for clearer pedagogical flow: - 05_networks → 05_dense (multi-layer dense/fully connected networks) - 06_cnn → 06_spatial (convolutional networks for spatial patterns) - 06_attention → 07_attention (attention mechanisms for sequences) ✅ Shifted remaining modules down by 1: - 07_dataloader → 08_dataloader - 08_autograd → 09_autograd - 09_optimizers → 10_optimizers - 10_training → 11_training - 11_compression → 12_compression - 12_kernels → 13_kernels - 13_benchmarking → 14_benchmarking - 14_mlops → 15_mlops - 15_capstone → 16_capstone ✅ Updated module metadata (module.yaml files): - Updated names, descriptions, dependencies - Fixed prerequisite chains and enables relationships - Updated export paths to match new names New learner progression: Foundation → Individual Layers → Dense Networks → Spatial Networks → Attention Networks → Training Pipeline Perfect pedagogical flow: Build one layer → Stack dense layers → Add spatial patterns → Add attention mechanisms → Learn to train them all.	2025-07-18 00:12:50 -04:00

11 Commits