TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-04-28 21:02:45 -05:00

Files

Vijay Janapa Reddi e384e8827c fix(module-02): Rewrite Softmax to use Tensor operations

- Preserve computation graph by using Tensor arithmetic (x - x_max, exp / sum)
- No more .data extraction that breaks gradient flow
- Numerically stable with max subtraction before exp

Required for transformer attention softmax gradient flow

2025-10-27 20:29:35 -04:00

activations_dev.ipynb

fix(module-02): Rewrite Softmax to use Tensor operations

2025-10-27 20:29:35 -04:00

activations_dev.py

fix(module-02): Rewrite Softmax to use Tensor operations

2025-10-27 20:29:35 -04:00