mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-12 02:43:35 -05:00

Files

Vijay Janapa Reddi a9fed98b66 Clean up repository: remove temp files, organize modules, prepare for PyPI publication

- Removed temporary test files and audit reports
- Deleted backup and temp_holding directories
- Reorganized module structure (07->09 spatial, 09->07 dataloader)
- Added new modules: 11-14 (tokenization, embeddings, attention, transformers)
- Updated examples with historical ML milestones
- Cleaned up documentation structure

2025-09-24 10:13:37 -04:00

activations_dev_backup.py

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

activations_dev.ipynb

Complete comprehensive system validation and cleanup

2025-09-23 10:00:33 -04:00

activations_dev.py

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

activations_streamlined.py

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

module.yaml

Complete comprehensive system validation and cleanup

2025-09-23 10:00:33 -04:00

README.md

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

README.md

🔥 Module: Activations

📊 Module Info

Difficulty: ⭐⭐ Intermediate
Time Estimate: 3-4 hours
Prerequisites: Tensor module
Next Steps: Layers module

Welcome to the Activations module! This is where you'll implement the mathematical functions that give neural networks their power to learn complex patterns. Without activation functions, neural networks would just be linear transformations—with them, you unlock the ability to learn any function.

🎯 Learning Objectives

By the end of this module, you will be able to:

Understand the critical role of activation functions in enabling neural networks to learn non-linear patterns
Implement the two essential activation functions: ReLU and Softmax with proper numerical stability
Apply mathematical reasoning to understand function properties, ranges, and appropriate use cases
Debug and test activation implementations using both automated tests and visual analysis
Connect theory to practice by understanding when and why to use each activation function

🧠 Build → Use → Analyze

This module follows TinyTorch's Build → Use → Analyze framework:

Build: Implement ReLU and Softmax activation functions with numerical stability
Use: Apply these functions in testing scenarios and visualize their mathematical behavior
Analyze: Understand why these two functions power 90% of modern deep learning

📚 What You'll Build

🎯 STREAMLINED: Focus on What Matters

# ReLU: The workhorse of deep learning
relu = ReLU()
output = relu(Tensor([-2, -1, 0, 1, 2]))  # [0, 0, 0, 1, 2]

# Softmax: Multi-class probability distribution
softmax = Softmax()
output = softmax(Tensor([1.0, 2.0, 3.0]))  # [0.09, 0.24, 0.67] (sums to 1.0)

ReLU (Rectified Linear Unit) - 80% of Hidden Layers

Formula: f(x) = max(0, x)
Properties: Simple, sparse, fast, prevents vanishing gradients
Why Essential: Powers all modern CNNs, Transformers, ResNets
Use Cases: Hidden layers in 95% of architectures

Softmax - Multi-Class Classification

Formula: f(x_i) = e^(x_i) / Σ(e^(x_j))
Properties: Outputs sum to 1.0, probability interpretation
Why Essential: Final layer for classification, attention weights
Use Cases: Classification output, attention mechanisms

🧠 Why Just Two Functions?

ReLU: Solves vanishing gradients, enables deep networks, computationally efficient
Softmax: Converts logits to probabilities, differentiable, temperature control
90% Coverage: These two functions appear in virtually every modern architecture
Simplicity: Focus on mastering essential concepts rather than memorizing many variants

🚀 Getting Started

Prerequisites

Ensure you have completed the tensor module and understand basic tensor operations:

# Activate TinyTorch environment
source bin/activate-tinytorch.sh

# Verify tensor module is working
tito test --module tensor

Development Workflow

Open the development file: modules/source/03_activations/activations_dev.py
Implement functions progressively: Start with ReLU, then Sigmoid (numerical stability), then Tanh
Test each implementation: Use inline tests for immediate feedback
Visualize function behavior: Leverage plotting sections for mathematical understanding
Export and verify: tito export --module activations && tito test --module activations

🧪 Testing Your Implementation

Comprehensive Test Suite

Run the full test suite to verify mathematical correctness:

# TinyTorch CLI (recommended)
tito test --module activations

# Direct pytest execution
python -m pytest tests/ -k activations -v

Test Coverage Areas

✅ Mathematical Correctness: Verify function outputs match expected mathematical formulas
✅ Numerical Stability: Test with extreme values and edge cases
✅ Shape Preservation: Ensure input and output tensors have identical shapes
✅ Range Validation: Confirm outputs fall within expected ranges
✅ Integration Testing: Verify compatibility with tensor operations

Inline Testing & Visualization

The module includes comprehensive educational feedback:

# Example inline test output
🔬 Unit Test: ReLU activation...
✅ ReLU handles negative inputs correctly
✅ ReLU preserves positive inputs
✅ ReLU output range is [0, ∞)
📈 Progress: ReLU ✓

# Visual feedback with plotting
📊 Plotting ReLU behavior across range [-5, 5]...
📈 Function visualization shows expected behavior

Manual Testing Examples

from tinytorch.core.tensor import Tensor
from activations_dev import ReLU, Sigmoid, Tanh

# Test with various inputs
x = Tensor([[-2.0, -1.0, 0.0, 1.0, 2.0]])

relu = ReLU()
sigmoid = Sigmoid()
tanh = Tanh()

print("Input:", x.data)
print("ReLU:", relu(x).data)      # [0, 0, 0, 1, 2]
print("Sigmoid:", sigmoid(x).data) # [0.12, 0.27, 0.5, 0.73, 0.88]
print("Tanh:", tanh(x).data)      # [-0.96, -0.76, 0, 0.76, 0.96]

🎯 Key Concepts

Real-World Applications

Computer Vision: ReLU activations enable CNNs to learn hierarchical features (like those in ResNet, VGG)
Natural Language Processing: Sigmoid/Tanh functions power LSTM and GRU gates for memory control
Recommendation Systems: Sigmoid activations provide probability outputs for binary predictions
Generative Models: Different activations shape the output distributions in GANs and VAEs

Mathematical Properties Comparison

Function	Input Range	Output Range	Zero Point	Key Property
ReLU	(-∞, ∞)	[0, ∞)	f(0) = 0	Sparse, unbounded
Sigmoid	(-∞, ∞)	(0, 1)	f(0) = 0.5	Probabilistic
Tanh	(-∞, ∞)	(-1, 1)	f(0) = 0	Zero-centered

Numerical Stability Considerations

ReLU: No stability issues (simple max operation)
Sigmoid: Requires careful implementation to prevent exp() overflow
Tanh: Generally stable, but NumPy implementation handles edge cases

Performance and Gradient Properties

ReLU: Fastest computation, sparse gradients, can cause "dying ReLU" problem
Sigmoid: Moderate computation, smooth gradients, susceptible to vanishing gradients
Tanh: Moderate computation, stronger gradients than sigmoid, zero-centered helps optimization

🎉 Ready to Build?

The activations module is where neural networks truly come alive! You're about to implement the mathematical functions that transform simple linear operations into powerful pattern recognition systems.

Every major breakthrough in deep learning—from image recognition to language models—relies on the functions you're about to build. Take your time, understand the mathematics, and enjoy creating the foundation of intelligent systems!

:gutter: 3
:margin: 2

{grid-item-card} 🚀 Launch Builder
:link: https://mybinder.org/v2/gh/VJProductions/TinyTorch/main?filepath=modules/source/03_activations/activations_dev.py
:class-title: text-center
:class-body: text-center

Interactive development environment

{grid-item-card} 📓 Open in Colab  
:link: https://colab.research.google.com/github/VJProductions/TinyTorch/blob/main/modules/source/03_activations/activations_dev.ipynb
:class-title: text-center
:class-body: text-center

Google Colab notebook

{grid-item-card} 👀 View Source
:link: https://github.com/VJProductions/TinyTorch/blob/main/modules/source/03_activations/activations_dev.py  
:class-title: text-center
:class-body: text-center

Browse the code on GitHub