mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-05-23 01:30:51 -05:00

Files

Vijay Janapa Reddi d14f92a9b2 Simplify test discovery and clean up test function names across all modules

MAJOR IMPROVEMENT: Simplified test discovery logic
- Removed restrictive valid_patterns requirement from testing framework
- Any function starting with 'test_' is now automatically discovered
- Follows standard pytest conventions - no maintenance overhead
- Eliminates need to manually add patterns for new test functions

CLEANED UP: Test function names across all 10 modules
- Removed redundant '_comprehensive' suffix from all test functions
- Updated 40+ test function names to be more concise and readable:
  * 00_setup: 6 functions (test_personal_info, test_system_info, etc.)
  * 01_tensor: 4 functions (test_tensor_creation, test_tensor_properties, etc.)
  * 02_activations: 1 function (test_activations)
  * 03_layers: 3 functions (test_matrix_multiplication, test_dense_layer, etc.)
  * 04_networks: 4 functions (test_sequential_networks, test_mlp_creation, etc.)
  * 05_cnn: 3 functions (test_convolution_operation, test_conv2d_layer, etc.)
  * 06_dataloader: 4 functions (test_dataset_interface, test_dataloader, etc.)
  * 07_autograd: 6 functions (test_variable_class, test_add_operation, etc.)
  * 08_optimizers: 5 functions (test_gradient_descent_step, test_sgd_optimizer, etc.)
  * 09_training: 6 functions (test_mse_loss, test_crossentropy_loss, etc.)
  * 10_compression: 6 functions (already cleaned up)

VERIFICATION: All tests still pass
- All 10 modules tested successfully with new discovery logic
- Total test count maintained: 47 inline tests across all modules
- No functionality lost, only improved maintainability

RESULT: Much cleaner, more maintainable testing framework following standard conventions

2025-07-14 10:24:04 -04:00

layers_dev_backup.py

Remove module-level tests directories, keep only main tests/ for exported package validation

2025-07-13 17:14:14 -04:00

layers_dev.py

Simplify test discovery and clean up test function names across all modules

2025-07-14 10:24:04 -04:00

module.yaml

Standardize module.yaml files for instructor/staff workflow

2025-07-14 00:08:05 -04:00

README.md

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

README.md

🧱 Module 2: Layers - Neural Network Building Blocks

📊 Module Info

Difficulty: ⭐⭐ Intermediate
Time Estimate: 4-5 hours
Prerequisites: Tensor, Activations modules
Next Steps: Networks module

Build the fundamental transformations that compose into neural networks

🎯 Learning Objectives

After completing this module, you will:

Understand layers as functions that transform tensors: y = f(x)
Implement Dense layers with linear transformations: y = Wx + b
Add activation functions for nonlinearity (ReLU, Sigmoid, Tanh)
See how neural networks are just function composition
Build intuition for neural network architecture before diving into training

🧱 Build → Use → Understand

This module follows the TinyTorch pedagogical framework:

Build: Dense layers and activation functions from scratch
Use: Transform tensors and see immediate results
Understand: How neural networks transform information

📚 What You'll Build

Dense Layer

layer = Dense(input_size=3, output_size=2)
x = Tensor([[1.0, 2.0, 3.0]])
y = layer(x)  # Shape: (1, 2)

Activation Functions

relu = ReLU()
sigmoid = Sigmoid()
tanh = Tanh()

x = Tensor([[-1.0, 0.0, 1.0]])
y_relu = relu(x)      # [0.0, 0.0, 1.0]
y_sigmoid = sigmoid(x)  # [0.27, 0.5, 0.73]
y_tanh = tanh(x)      # [-0.76, 0.0, 0.76]

Neural Networks

# 3 → 4 → 2 network
layer1 = Dense(input_size=3, output_size=4)
activation1 = ReLU()
layer2 = Dense(input_size=4, output_size=2)
activation2 = Sigmoid()

# Forward pass
x = Tensor([[1.0, 2.0, 3.0]])
h1 = layer1(x)
h1_activated = activation1(h1)
h2 = layer2(h1_activated)
output = activation2(h2)

🚀 Getting Started

Prerequisites

Complete Module 1: Tensor ✅
Understand basic linear algebra (matrix multiplication)
Familiar with Python classes and methods

Quick Start

# Navigate to the layers module
cd modules/layers

# Work in the development notebook
jupyter notebook layers_dev.ipynb

# Or work in the Python file
code layers_dev.py

📖 Module Structure

modules/layers/
├── layers_dev.py           # Main development file (work here!)
├── layers_dev.ipynb        # Jupyter notebook version
├── tests/
│   └── test_layers.py      # Comprehensive tests
├── README.md              # This file
└── solutions/             # Reference implementations (if stuck)

🎓 Learning Path

Step 1: Dense Layer (Linear Transformation)

Understand y = Wx + b
Implement weight initialization
Handle matrix multiplication and bias addition
Test with single examples and batches

Step 2: Activation Functions

Implement ReLU: max(0, x)
Implement Sigmoid: 1 / (1 + e^(-x))
Implement Tanh: tanh(x)
Understand why nonlinearity is crucial

Step 3: Layer Composition

Chain layers together
Build complete neural networks
See how simple layers create complex functions

Step 4: Real-World Application

Build an image classification network
Understand how architecture affects capability

🧪 Testing Your Implementation

Module-Level Tests

# Run comprehensive tests
python -m pytest tests/test_layers.py -v

# Quick test
python -c "from layers_dev import Dense, ReLU; print('✅ Layers working!')"

Package-Level Tests

# Export to package
python ../../bin/tito.py sync

# Test integration
python ../../bin/tito.py test --module layers

🎯 Key Concepts

Layers as Functions

Input: Tensor with some shape
Transformation: Mathematical operation
Output: Tensor with possibly different shape

Linear vs Nonlinear

Dense layers: Linear transformations
Activation functions: Nonlinear transformations
Composition: Linear + Nonlinear = Complex functions

Neural Networks = Function Composition

Input → Dense → ReLU → Dense → Sigmoid → Output

Why This Matters

Modularity: Build complex networks from simple parts
Reusability: Same layers work for different problems
Understanding: Know how each part contributes to the whole

🔍 Common Issues

Import Errors

# Make sure you're in the right directory
import sys
sys.path.append('../../')
from modules.tensor.tensor_dev import Tensor

Shape Mismatches

# Check input/output sizes match
layer1 = Dense(input_size=3, output_size=4)
layer2 = Dense(input_size=4, output_size=2)  # 4 matches output of layer1

Gradient Issues (Later)

# Use proper weight initialization
limit = math.sqrt(6.0 / (input_size + output_size))
weights = np.random.uniform(-limit, limit, (input_size, output_size))

🎉 Success Criteria

You've successfully completed this module when:

✅ All tests pass (pytest tests/test_layers.py)
✅ You can build a 2-layer neural network
✅ You understand how layers transform tensors
✅ You see the connection between layers and neural networks
✅ Package export works (tito test --module layers)

🚀 What's Next

After completing this module, you're ready for:

Module 3: Networks - Compose layers into common architectures
Module 4: Training - Learn how networks improve through experience
Module 5: Applications - Use networks for real problems

🤝 Getting Help

Check the tests for examples of expected behavior
Look at the solutions/ directory if you're stuck
Review the pedagogical principles in docs/pedagogy/
Remember: Build → Use → Understand!

Great job building the foundation of neural networks! 🎉

This module implements the core insight: neural networks are just function composition of simple building blocks.