TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-04 22:42:16 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	95274448bd	feat: Add Milestone 04 (CNN Revolution 1998) + Clean spatial imports Milestone 04 - CNN Revolution: ✅ Complete 5-Act narrative structure (Challenge → Reflection) ✅ SimpleCNN architecture: Conv2d → ReLU → MaxPool → Linear ✅ Trains on 8x8 digits dataset (1,437 train, 360 test) ✅ Achieves 84.2% accuracy with only 810 parameters ✅ Demonstrates spatial operations preserve structure ✅ Beautiful visual output with progress tracking Key Features: - Conv2d (1→8 channels, 3×3 kernel) detects local patterns - MaxPool2d (2×2) provides translation invariance - 100× fewer parameters than equivalent MLP - Training completes in ~105 seconds (50 epochs) - Sample predictions table shows 9/10 correct Module 09 Spatial Improvements: - Removed ugly try/except import pattern - Clean imports: 'from tinytorch.core.tensor import Tensor' - Matches PyTorch style (simple and professional) - No fallback logic needed All 4 milestones now follow consistent 5-Act structure!	2025-09-30 17:04:41 -04:00
Vijay Janapa Reddi	828c3d9081	feat: Add CrossEntropyLoss autograd support + Milestone 03 MLP on digits Key Changes: - Implemented CrossEntropyBackward for gradient computation - Integrated CrossEntropyLoss into enable_autograd() patching - Created comprehensive loss gradient test suite - Milestone 03: MLP digits classifier (77.5% accuracy) - Shipped tiny 8x8 digits dataset (67KB) for instant demos - Updated DataLoader module with ASCII visualizations Tests: - All 3 losses (MSE, BCE, CrossEntropy) now have gradient flow - MLP successfully learns digit classification (6.9% → 77.5%) - Integration tests pass Technical: - CrossEntropyBackward: softmax - one_hot gradient - Numerically stable via log-softmax - Works with raw class labels (no one-hot needed)	2025-09-30 16:22:09 -04:00
Vijay Janapa Reddi	82fd89d5b3	Remove unnecessary matplotlib import from losses module Issue: xor_crisis.py was failing with ImportError on matplotlib architecture mismatch Root cause: losses_dev.py imported matplotlib.pyplot but never used it Fix: - ✅ Removed unused imports: matplotlib.pyplot, time - ✅ Re-exported module 04_losses to update tinytorch package - ✅ Verified both milestone 02 scripts now run successfully The matplotlib import was causing failures on M2 Macs where matplotlib was installed for wrong architecture (x86_64 vs arm64). Since it was never used, removing it eliminates the dependency entirely. Tested: - ✅ milestones/02_xor_crisis_1969/xor_crisis.py (49% accuracy - expected failure) - ✅ milestones/02_xor_crisis_1969/xor_solved.py (100% accuracy - perfect!)	2025-09-30 14:16:42 -04:00
Vijay Janapa Reddi	af1c313d16	Reset package and export modules 01-07 only (skip broken spatial module)	2025-09-30 13:41:00 -04:00
Vijay Janapa Reddi	eeb308a691	WIP: Manual edits to tinytorch (WRONG APPROACH - needs revert) WARNING: I incorrectly edited files in tinytorch/ directly: - tinytorch/core/autograd.py - added enable_autograd() manually - tinytorch/core/activations.py - tried to add gradient tracking - tinytorch/core/losses.py - restored from git CORRECT APPROACH: 1. Make ALL changes in modules/source/XX_*/YY_dev.py 2. Add #\| export directives for classes to export 3. Run: tito export XX_module 4. NEVER edit tinytorch/ files directly Next steps: - Revert tinytorch/ manual edits - Add proper exports to source modules - Export cleanly	2025-09-30 13:31:31 -04:00
Vijay Janapa Reddi	0015a8cab1	WIP: Add SigmoidBackward and BCEBackward classes to autograd Added: - SigmoidBackward class to modules/source/05_autograd/autograd_dev.py with #\| export - BCEBackward class to modules/source/05_autograd/autograd_dev.py with #\| export - Both classes exported to tinytorch/core/autograd.py - Updated Sigmoid activation to track gradients using SigmoidBackward - Updated BCE loss to track gradients using BCEBackward ISSUE: Training still not learning - gradients not flowing properly - Loss stays constant at 0.7911 - Weights don't update - Sigmoid.forward() code looks correct but a.requires_grad stays False - Need to investigate why gradient tracking isn't working through activations	2025-09-30 13:23:56 -04:00
Vijay Janapa Reddi	76da686ce0	Update loss function examples to use PyTorch-style callable API Updated docstring examples to use cleaner callable syntax: - loss_fn(predictions, targets) instead of loss_fn.forward(predictions, targets) Applied to: - MSELoss - CrossEntropyLoss - BinaryCrossEntropyLoss Demonstrates proper usage with __call__ methods for cleaner, more Pythonic code.	2025-09-30 12:36:27 -04:00
Vijay Janapa Reddi	17cb8049c6	Add __call__ methods to enable PyTorch-style API Enable cleaner API usage by adding __call__ methods to all activation, layer, and loss classes. This allows students to write: - relu(x) instead of relu.forward(x) - layer(x) instead of layer.forward(x) - loss_fn(pred, target) instead of loss_fn.forward(pred, target) Changes: - Module 02 (Activations): Add __call__ to ReLU, Tanh, GELU, Softmax * Sigmoid already had __call__ - Module 03 (Layers): Add __call__ to Dropout * Linear already had __call__ - Module 04 (Losses): Add __call__ to MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss This matches PyTorch's API convention where model(x) calls model.__call__(x) which internally calls model.forward(x). Makes code more Pythonic and intuitive for students familiar with PyTorch. Expected impact: Test pass rates should improve significantly as tests expect PyTorch-style callable API.	2025-09-30 12:33:45 -04:00
Vijay Janapa Reddi	de3b837bee	Fix nbdev export system across all 20 modules PROBLEM: - nbdev requires #\| export directive on EACH cell to export when using # %% markers - Cell markers inside class definitions split classes across multiple cells - Only partial classes were being exported to tinytorch package - Missing matmul, arithmetic operations, and activation classes in exports SOLUTION: 1. Removed # %% cell markers INSIDE class definitions (kept classes as single units) 2. Added #\| export to imports cell at top of each module 3. Added #\| export before each exportable class definition in all 20 modules 4. Added __call__ method to Sigmoid for functional usage 5. Fixed numpy import (moved to module level from __init__) MODULES FIXED: - 01_tensor: Tensor class with all operations (matmul, arithmetic, shape ops) - 02_activations: Sigmoid, ReLU, Tanh, GELU, Softmax classes - 03_layers: Linear, Dropout classes - 04_losses: MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss classes - 05_autograd: Function, AddBackward, MulBackward, MatmulBackward, SumBackward - 06_optimizers: Optimizer, SGD, Adam, AdamW classes - 07_training: CosineSchedule, Trainer classes - 08_dataloader: Dataset, TensorDataset, DataLoader classes - 09_spatial: Conv2d, MaxPool2d, AvgPool2d, SimpleCNN classes - 10-20: All exportable classes in remaining modules TESTING: - Test functions use 'if __name__ == "__main__"' guards - Tests run in notebooks but NOT on import - Rosenblatt Perceptron milestone working perfectly RESULT: ✅ All 20 modules export correctly ✅ Perceptron (1957) milestone functional ✅ Clean separation: development (modules/source) vs package (tinytorch)	2025-09-30 11:21:04 -04:00
Vijay Janapa Reddi	b428b63b81	feat: implement professional selective export pattern across all modules BREAKING CHANGE: Refactor from whole-module exports to selective function/class exports What Changed: - Separate development utilities from production exports - Each function/class gets individual #\| export directive - Clean Prerequisites & Setup sections in all modules - Development helpers (import_previous_module) not exported Module Export Summary: - 01_tensor: Tensor class only - 02_activations: Sigmoid, ReLU, Tanh, GELU, Softmax only - 03_layers: Linear, Dropout only - 04_losses: MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss, log_softmax only - 05_autograd: Function class only - 06_optimizers: SGD, Adam, AdamW only Benefits: ✅ Clean public API (matches PyTorch/TensorFlow patterns) ✅ No development utilities in final package ✅ Professional software education standards ✅ Clear separation of concerns ✅ Educational clarity for students This matches industry standards for educational ML frameworks.	2025-09-30 09:48:47 -04:00
Vijay Janapa Reddi	e82ec44e6a	feat: standardize integration testing with import helpers - Add import_previous_module() helper function to all core modules (01-07) - Standardize cross-module imports for integration testing - Add clear Prerequisites & Setup sections explaining module dependencies - Update integration tests to use standardized import pattern - Maintain clean separation between development and production code This provides a consistent, educational approach to module integration while keeping the codebase maintainable and student-friendly.	2025-09-30 09:42:58 -04:00
Vijay Janapa Reddi	cc7c7526c8	Clean up module imports: convert tinytorch.core to sys.path style - Remove circular imports where modules imported from themselves - Convert tinytorch.core imports to sys.path relative imports - Only import dependencies that are actually used in each module - Preserve documentation imports in markdown cells - Use consistent relative path pattern across all modules - Remove hardcoded absolute paths in favor of relative imports Affected modules: 02_activations, 03_layers, 04_losses, 06_optimizers, 07_training, 09_spatial, 12_attention, 17_quantization	2025-09-30 08:58:58 -04:00

12 Commits