mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-03 02:46:27 -05:00

Files

Vijay Janapa Reddi 60711bf322 Implements neural network architectures module

Initializes the networks module, enabling the composition of layers into complete neural network architectures.

It introduces sequential networks, MLP creation, and network visualization tools to facilitate architecture understanding and analysis.

Adds practical classification and regression network implementations and network behavior analysis capabilities.

2025-07-11 15:30:21 -04:00

tests

feat: add Networks module with forward-pass and visualizations

2025-07-10 23:16:12 -04:00

networks_dev.ipynb

feat: add Networks module with forward-pass and visualizations

2025-07-10 23:16:12 -04:00

networks_dev.py

Implements neural network architectures module

2025-07-11 15:30:21 -04:00

README.md

docs: Clarify MLP is a use case, not a fundamental module; remove empty mlp/ dir

2025-07-10 23:29:47 -04:00

README.md

🧠 Module 3: Networks - Neural Network Architectures

Compose layers into complete neural network architectures with powerful visualizations

🎯 Learning Objectives

After completing this module, you will:

Understand networks as function composition: f(x) = layer_n(...layer_2(layer_1(x)))
Build common architectures (MLP, CNN) from layers
Visualize network structure and data flow
See how architecture affects capability
Master forward pass inference (no training yet!)

Note: MLP (Multi-Layer Perceptron) is not a fundamental building block, but a use case of composing Dense layers and activations in sequence. In TinyTorch, you will learn to build MLPs by composing primitives, not as a separate module. This approach helps you see that all architectures (MLP, CNN, etc.) are just patterns of composition, not new primitives.

🧠 Build → Use → Understand

This module follows the TinyTorch pedagogical framework:

Build: Compose layers into complete networks
Use: Create different architectures and run inference
Understand: How architecture design affects network behavior

📚 What You'll Build

Sequential Network

# Basic network composition
network = Sequential([
    Dense(784, 128),
    ReLU(),
    Dense(128, 64),
    ReLU(),
    Dense(64, 10),
    Sigmoid()
])

# Forward pass
x = Tensor([[1.0, 2.0, 3.0, ...]])  # Input data
output = network(x)  # Network prediction

MLP (Multi-Layer Perceptron)

# Create MLP for classification
mlp = create_mlp(
    input_size=784,      # 28x28 image
    hidden_sizes=[128, 64],  # Hidden layers
    output_size=10,      # 10 classes
    activation=ReLU,
    output_activation=Sigmoid
)

Specialized Networks

# Classification network
classifier = create_classification_network(
    input_size=100, num_classes=2
)

# Regression network  
regressor = create_regression_network(
    input_size=13, output_size=1
)

🎨 Visualization Features

This module includes powerful visualizations to help you understand:

Network Architecture Visualization

Layer-by-layer structure: See how layers connect
Color-coded layers: Different colors for Dense, ReLU, Sigmoid, etc.
Connection arrows: Visualize data flow between layers
Layer details: Input/output sizes and parameters

Data Flow Visualization

Shape transformations: See how tensor shapes change through the network
Activation patterns: Visualize intermediate layer outputs
Statistics tracking: Mean, std, and distribution of activations
Layer analysis: Understand what each layer learns

Network Comparison

Side-by-side analysis: Compare different architectures
Performance metrics: Output distributions and statistics
Architectural insights: Layer type distributions and complexity

Behavior Analysis

Input-output relationships: How inputs map to outputs
Activation patterns: Layer-by-layer activation analysis
Network depth: Understanding the role of depth vs width
Practical insights: Real-world application considerations

🚀 Getting Started

Prerequisites

Complete Module 1: Tensor ✅
Complete Module 2: Layers ✅
Understand basic function composition
Familiar with matplotlib for visualizations

Quick Start

# Navigate to the networks module
cd modules/networks

# Work in the development notebook
jupyter notebook networks_dev.ipynb

# Or work in the Python file
code networks_dev.py

📖 Module Structure

modules/networks/
├── networks_dev.py           # Main development file (work here!)
├── networks_dev.ipynb        # Jupyter notebook version
├── tests/
│   └── test_networks.py      # Comprehensive tests
├── README.md                # This file
└── solutions/               # Reference implementations (if stuck)

🎓 Learning Path

Step 1: Sequential Network (Function Composition)

Understand f(x) = layer_n(...layer_1(x))
Implement basic network composition
Test with simple examples

Step 2: Network Visualization

Visualize network architectures
Understand data flow through networks
Compare different network designs

Step 3: Common Architectures

Build MLPs for different tasks
Create classification networks
Design regression networks

Step 4: Behavior Analysis

Analyze network behavior with different inputs
Understand architectural trade-offs
See how design affects capability

Step 5: Practical Applications

Build networks for real problems
Understand classification vs regression
See how architecture matches task

🧪 Testing Your Implementation

Module-Level Tests

# Run comprehensive tests
python -m pytest tests/test_networks.py -v

# Quick test
python -c "from networks_dev import Sequential; print('✅ Networks working!')"

Package-Level Tests

# Export to package
python ../../bin/tito sync

# Test integration
python ../../bin/tito test --module networks

🎯 Key Concepts

Function Composition

Networks as f(x) = g(h(x))
Each layer is a function
Composition creates complex behavior

Architecture Design

Depth: Number of layers
Width: Number of neurons per layer
Activation: Nonlinearity choices
Output: Task-specific final layer

Visualization Benefits

Debugging: See where things go wrong
Understanding: Visualize complex transformations
Design: Compare different architectures
Intuition: Build mental models of networks

Practical Considerations

Input size: Must match your data
Output size: Must match your task
Hidden layers: Balance complexity vs overfitting
Activation functions: Choose based on task

🔍 Common Issues

Import Errors

# Make sure you're in the right directory
import sys
sys.path.append('../../')
from modules.layers.layers_dev import Dense
from modules.activations.activations_dev import ReLU, Sigmoid

Shape Mismatches

# Check layer sizes match
layer1 = Dense(3, 4)    # 3 inputs, 4 outputs
layer2 = Dense(4, 2)    # 4 inputs (matches layer1 output), 2 outputs

Visualization Issues

# Make sure matplotlib is installed
pip install matplotlib seaborn

# Check if plots are disabled during testing
if _should_show_plots():
    # Your visualization code
    pass

🎉 Success Criteria

You've successfully completed this module when:

✅ All tests pass (pytest tests/test_networks.py)
✅ You can build and visualize different network architectures
✅ You understand how architecture affects network behavior
✅ You can create networks for classification and regression tasks
✅ Package export works (tito test --module networks)

🚀 What's Next

After completing this module, you're ready for:

Module 4: Training - Learn how networks learn from data
Module 5: Data - Work with real datasets
Module 6: Applications - Solve real-world problems

🤝 Getting Help

Check the tests for examples of expected behavior
Look at the solutions/ directory if you're stuck
Review the pedagogical principles in docs/pedagogy/
Remember: Build → Use → Understand!

🎨 Visualization Examples

Network Architecture

Input → Dense(784,128) → ReLU → Dense(128,64) → ReLU → Dense(64,10) → Sigmoid → Output

Data Flow

(1,784) → (1,128) → (1,128) → (1,64) → (1,64) → (1,10) → (1,10)

Layer Analysis

Dense layers: Linear transformations
ReLU: Introduces nonlinearity
Sigmoid: Outputs probabilities

Build powerful neural networks with beautiful visualizations! 🚀