mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-03-12 09:43:35 -05:00
- Change np.dot to np.matmul for proper batched 3D tensor multiplication - Add requires_grad preservation in transpose() operation - Fixes attention mechanism gradient flow issues Regression tests added in tests/regression/test_gradient_flow_fixes.py