TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-02 08:32:31 -05:00

Files

Vijay Janapa Reddi 12f37fb5db feat: Add comprehensive attention module (06_attention)

- Implement scaled dot-product attention with masking support
- Build multi-head attention with learnable projections
- Create sinusoidal positional encoding for sequence understanding
- Add layer normalization for training stability
- Complete transformer block with residual connections
- Include self-attention wrapper and utility functions
- Full inline testing with 100% pass rate
- Educational content explaining attention mechanisms
- Foundation for modern AI architectures (GPT, BERT, etc.)

This module bridges classical ML (tensors, layers, networks) with
modern transformer architectures that power ChatGPT and contemporary AI.

2025-07-17 22:58:19 -04:00

01_setup

refactor: Replace "Master" with "Reflect" in learning framework

2025-07-16 11:48:28 -04:00

02_tensor

docs: Clean up whitespace and formatting in module READMEs

2025-07-16 11:50:23 -04:00

03_activations

docs: Clean up whitespace and formatting in module READMEs

2025-07-16 11:50:23 -04:00

04_layers

refactor: Implement YAML-based difficulty and time system