TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-04 04:36:12 -05:00

Files

Vijay Janapa Reddi 8cff435db9 fix(module-11): Fix Embedding and PositionalEncoding gradient flow

- Embedding.forward() now preserves requires_grad from weight tensor
- PositionalEncoding.forward() uses Tensor addition (x + pos) instead of .data
- Critical for transformer input embeddings to have gradients

Both changes ensure gradient flows from loss back to embedding weights

2025-10-27 20:30:03 -04:00

__init__.py

Add exported package files and cleanup

2025-09-30 12:38:56 -04:00

embeddings.py

fix(module-11): Fix Embedding and PositionalEncoding gradient flow

2025-10-27 20:30:03 -04:00

tokenization.py

feat: Complete transformer integration with milestones

2025-10-19 12:46:58 -04:00