TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-05-05 14:12:33 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	43ea5f9a65	Fix MLPerf milestone metrics: FLOPs calculation, quantization compression ratio, pruning delta sign - Fixed FLOPs calculation to handle models with .layers attribute (not just Sequential) - Fixed quantization compression ratio to calculate theoretical INT8 size (1 byte per element) - Fixed pruning accuracy delta sign to correctly show +/- direction - Added missing export directives for Tensor and numpy imports in acceleration module Results now correctly show: - FLOPs: 4,736 (was incorrectly showing 64) - Quantization: 4.0x compression (was incorrectly showing 1.0x) - Pruning delta: correct +/- sign based on actual accuracy change	2025-12-03 09:36:10 -08:00
Vijay Janapa Reddi	d3a126235c	Restructure: Separate developer source (src/) from learner notebooks (modules/) Major directory restructure to support both developer and learner workflows: Structure Changes: - NEW: src/ directory for Python source files (version controlled) - Files renamed: tensor.py → 01_tensor.py (matches directory naming) - All 20 modules moved from modules/ to src/ - CHANGED: modules/ now holds generated notebooks (gitignored) - Generated from src/.py using jupytext - Learners work in notebooks, developers work in Python source - UNCHANGED: tinytorch/ package (still auto-generated from notebooks) Workflow: src/.py → modules/.ipynb → tinytorch/.py Command Updates: - Updated export command to read from src/ and generate to modules/ - Export flow: discovers modules in src/, converts to notebooks in modules/, exports to tinytorch/ - All 20 modules tested and working Configuration: - Updated .gitignore to ignore modules/ directory - Updated README.md with new three-layer architecture explanation - Updated export.py source mappings and paths Benefits: - Clean separation: developers edit Python, learners use notebooks - Better version control: only Python source committed, notebooks generated - Flexible learning: can work in notebooks OR Python source - Maintains backward compatibility: tinytorch package unchanged Tested: - Single module export: tito export 01_tensor ✅ - All modules export: tito export --all ✅ - Package imports: from tinytorch.core.tensor import Tensor ✅ - 20/20 modules successfully converted and exported	2025-11-25 00:02:21 -05:00
Vijay Janapa Reddi	763cdd2bf2	Implement Tensor slicing with progressive disclosure and fix embedding gradient flow WHAT: Added Tensor.__getitem__ (slicing) following progressive disclosure principles MODULE 01 (Tensor): - Added __getitem__ method for basic slicing operations - Clean implementation with NO gradient mentions (progressive disclosure) - Supports all NumPy-style indexing: x[0], x[:3], x[1:4], x[:, 1] - Ensures scalar results are wrapped in arrays MODULE 05 (Autograd): - Added SliceBackward function for gradient computation - Implements proper gradient scatter: zeros everywhere except sliced positions - Added monkey-patching in enable_autograd() for __getitem__ - Follows same pattern as existing operations (add, mul, matmul) MODULE 11 (Embeddings): - Updated PositionalEncoding to use Tensor slicing instead of .data - Fixed multiple .data accesses that broke computation graphs - Removed Tensor() wrapping that created gradient-disconnected leafs - Uses proper Tensor operations to preserve gradient flow TESTING: - All 6 component tests PASS (Embedding, Attention, FFN, Residual, Forward, Training) - 19/19 parameters get gradients (was 18/19 before) - Loss dropping better: 1.54→1.08 (vs 1.62→1.24 before) - Model still not learning (0% accuracy) - needs fresh session to test monkey-patching WHY THIS MATTERS: - Tensor slicing is FUNDAMENTAL - needed by transformers for position embeddings - Progressive disclosure maintains educational integrity - Follows existing TinyTorch architecture patterns - Enables position embeddings to potentially learn (pending verification) DOCUMENTS CREATED: - milestones/05_2017_transformer/TENSOR_SLICING_IMPLEMENTATION.md - milestones/05_2017_transformer/STATUS.md - milestones/05_2017_transformer/FIXES_SUMMARY.md - milestones/05_2017_transformer/DEBUG_REVERSAL.md - tests/milestones/test_reversal_debug.py (component tests) ARCHITECTURAL PRINCIPLE: Progressive disclosure is not just nice-to-have, it's CRITICAL for educational systems. Don't expose Module 05 concepts (gradients) in Module 01 (basic operations). Monkey-patch when features are needed, not before.	2025-11-22 18:26:12 -05:00
Vijay Janapa Reddi	96880b3133	Update tinytorch and tito with module exports Re-exported all modules after restructuring: - Updated _modidx.py with new module locations - Removed outdated autogeneration headers - Updated all core modules (tensor, autograd, layers, etc.) - Updated optimization modules (quantization, compression, etc.) - Updated TITO commands for new structure Changes include: - 24 tinytorch/ module files - 24 tito/ command and core files - Updated references from modules/source/ to modules/ All modules re-exported via nbdev from their new locations.	2025-11-10 19:42:03 -05:00
Vijay Janapa Reddi	3830fb2038	feat: update profiler with helper functions and module path - Update module path from 15_profiling to 14_profiling - Add quick_profile helper for quick bottleneck discovery - Add analyze_weight_distribution for pruning analysis - Export new helper functions in __all__	2025-11-09 13:02:57 -05:00
Vijay Janapa Reddi	9a5b7ad05b	Module 15: Export ProfilerComplete and create KV cache profiling demo - Added ProfilerComplete class to profiling_dev.py with all measurement methods - Exported ProfilerComplete to tinytorch/profiling/profiler.py - Created profile_kv_cache.py milestone demonstrating scientific performance measurement - Demo shows 19x speedup from KV caching with detailed profiling metrics - Validates Module 14 KV cache optimization impact quantitatively	2025-11-06 14:21:22 -05:00
Vijay Janapa Reddi	ba6bd79a67	Reset package and export modules 01-07 only (skip broken spatial module)	2025-09-30 13:41:00 -04:00
Vijay Janapa Reddi	1f23035a1e	Add exported package files and cleanup This commit includes: - Exported tinytorch package files from nbdev (autograd, losses, optimizers, training, etc.) - Updated activations.py and layers.py with __call__ methods - New module exports: attention, spatial, tokenization, transformer, etc. - Removed old _modidx.py file - Cleanup of duplicate milestone directories These are the generated package files that correspond to the source modules we've been developing. Students will import from these when using TinyTorch.	2025-09-30 12:38:56 -04:00
Vijay Janapa Reddi	8be87d0add	Fix nbdev export system across all 20 modules PROBLEM: - nbdev requires #\| export directive on EACH cell to export when using # %% markers - Cell markers inside class definitions split classes across multiple cells - Only partial classes were being exported to tinytorch package - Missing matmul, arithmetic operations, and activation classes in exports SOLUTION: 1. Removed # %% cell markers INSIDE class definitions (kept classes as single units) 2. Added #\| export to imports cell at top of each module 3. Added #\| export before each exportable class definition in all 20 modules 4. Added __call__ method to Sigmoid for functional usage 5. Fixed numpy import (moved to module level from __init__) MODULES FIXED: - 01_tensor: Tensor class with all operations (matmul, arithmetic, shape ops) - 02_activations: Sigmoid, ReLU, Tanh, GELU, Softmax classes - 03_layers: Linear, Dropout classes - 04_losses: MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss classes - 05_autograd: Function, AddBackward, MulBackward, MatmulBackward, SumBackward - 06_optimizers: Optimizer, SGD, Adam, AdamW classes - 07_training: CosineSchedule, Trainer classes - 08_dataloader: Dataset, TensorDataset, DataLoader classes - 09_spatial: Conv2d, MaxPool2d, AvgPool2d, SimpleCNN classes - 10-20: All exportable classes in remaining modules TESTING: - Test functions use 'if __name__ == "__main__"' guards - Tests run in notebooks but NOT on import - Rosenblatt Perceptron milestone working perfectly RESULT: ✅ All 20 modules export correctly ✅ Perceptron (1957) milestone functional ✅ Clean separation: development (modules/source) vs package (tinytorch)	2025-09-30 11:21:04 -04:00
Vijay Janapa Reddi	62aace8718	cleanup: remove empty tinytorch package directories - Remove 14 empty/unused directories from tinytorch/ package - Keep only essential directories: core/, datasets/, configs/ - All directories removed contained only empty __init__.py files or were completely empty - CLI functionality preserved and tested working - Cleaner package structure for development	2025-07-10 22:59:32 -04:00
Vijay Janapa Reddi	5fc55f8cbe	Been refactoring the structure, got setup working	2025-07-10 11:13:45 -04:00

11 Commits