mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-04-25 15:07:30 -05:00
Module 13 (Transformers) fixes: - Remove all try/except fallback implementations (clean imports only) - Fix MultiHeadAttention signature (2 args: x, mask) - Add GELU() class instance to MLP (not standalone function) - Clean imports: Tensor, Linear, MultiHeadAttention, Embedding, PositionalEncoding, GELU Milestone 05 status: ✅ Architecture test passes ✅ Model builds successfully (67M parameters) ✅ Forward pass works ✅ Shakespeare dataset loads and tokenizes ✅ DataLoader creates batches properly Ready for training and text generation cd /Users/VJ/GitHub/TinyTorch && PYTHONPATH=/Users/VJ/GitHub/TinyTorch: python3 milestones/05_2017_transformer/vaswani_shakespeare.py --test-only --quick-test 2>&1 | tail -15