TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-07-24 01:30:52 -05:00

Files

T

Vijay Janapa Reddi a512c09e82 Clean up gradient broadcasting logic - more pedagogical

Refactored gradient accumulation to use clearer two-step approach:
1. Remove extra leading dimensions (batch dims)
2. Sum over dimensions that were size-1 (broadcast dims)

Benefits:
- Clearer intent: while loop for variable dims, for loop for fixed dims
- Better comments with concrete examples
- Easier for students to understand broadcasting in backprop
- Matches how you'd explain it verbally

Same functionality, cleaner code.

2025-09-30 13:53:05 -04:00

source

Clean up gradient broadcasting logic - more pedagogical

2025-09-30 13:53:05 -04:00