Files
TinyTorch/tinytorch
Vijay Janapa Reddi 4fa00b51b3 fix(module-02): Rewrite Softmax to use Tensor operations
- Preserve computation graph by using Tensor arithmetic (x - x_max, exp / sum)
- No more .data extraction that breaks gradient flow
- Numerically stable with max subtraction before exp

Required for transformer attention softmax gradient flow
2025-10-27 20:29:35 -04:00
..