mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-03-09 17:41:59 -05:00
- Created test suite that verifies actual learning (gradient flow, weight updates, loss convergence) - Fixed MLP Digits (1986): increased training epochs from 15 to 25 - Added requires_grad=True to Conv2d weights (partial fix) - Identified gradient flow issues in Conv2d, Embedding, and Attention layers - Comprehensive documentation of issues and fixes needed