mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-03 15:45:51 -05:00

Files

Vijay Janapa Reddi dd706227fd Adds module development documentation

Introduces documentation for TinyTorch module development, including guides for developers and AI assistants.

Provides comprehensive resources for creating high-quality, educational modules, focusing on real-world applications and systems thinking.

2025-07-11 18:38:48 -04:00

README.md

Adds module development documentation

2025-07-11 18:38:48 -04:00

README.md

Transformer Module

Status: 🚧 Coming Soon

Overview

The Transformer module will be a lightweight implementation of transformer architecture, teaching students how modern attention-based models work from the ground up.

Learning Goals

Understand attention mechanisms and their computational complexity
Implement multi-head attention from scratch
Learn about positional encoding and layer normalization
Explore transformer architecture design patterns
Understand memory and computational optimization for attention

Module Dependencies

This module builds on:

tensor - For all computations
layers - For feed-forward networks
networks - For composing transformer blocks
autograd - For training attention models
training - For training transformer models

Planned Components

Attention mechanism implementation
Multi-head attention
Positional encoding
Layer normalization
Transformer blocks
Complete transformer architecture
Memory optimization techniques
Attention visualization tools

Systems Focus

Memory management for attention matrices
Computational complexity analysis
Parallelization of multi-head attention
Optimization techniques (sparse attention, linear attention)
Scaling considerations for large sequences

This module will be implemented after the core modules (tensor, layers, networks, autograd, training) are complete.