mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-04-28 02:38:06 -05:00
- Change np.dot to np.matmul for proper batched 3D tensor multiplication - Add requires_grad preservation in transpose() operation - Fixes attention mechanism gradient flow issues Regression tests added in tests/regression/test_gradient_flow_fixes.py