mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-24 19:39:39 -05:00

Files

Vijay Janapa Reddi eafbb4ac8d Fix comprehensive testing and module exports

🔧 TESTING INFRASTRUCTURE FIXES:
- Fixed pytest configuration (removed duplicate timeout)
- Exported all modules to tinytorch package using nbdev
- Converted .py files to .ipynb for proper NBDev processing
- Fixed import issues in test files with fallback strategies

📊 TESTING RESULTS:
- 145 tests passing, 15 failing, 16 skipped
- Major improvement from previous import errors
- All modules now properly exported and testable
- Analysis tool working correctly on all modules

🎯 MODULE QUALITY STATUS:
- Most modules: Grade C, Scaffolding 3/5
- 01_tensor: Grade C, Scaffolding 2/5 (needs improvement)
- 07_autograd: Grade D, Scaffolding 2/5 (needs improvement)
- Overall: Functional but needs educational enhancement

✅ RESOLVED ISSUES:
- All import errors resolved
- NBDev export process working
- Test infrastructure functional
- Analysis tools operational

🚀 READY FOR NEXT PHASE: Professional report cards and improvements

2025-07-13 09:20:32 -04:00

tests

Fix comprehensive testing and module exports

2025-07-13 09:20:32 -04:00

networks_dev.ipynb

Fix comprehensive testing and module exports

2025-07-13 09:20:32 -04:00

networks_dev.py

Enhance networks module with comprehensive composition theory

2025-07-12 21:13:52 -04:00

README.md

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

README.md

🧠 Module 3: Networks - Neural Network Architectures

📊 Module Info

Difficulty: ⭐⭐⭐ Advanced
Time Estimate: 5-7 hours
Prerequisites: Tensor, Activations, Layers modules
Next Steps: Training, CNN modules

Compose layers into complete neural network architectures with powerful visualizations

🎯 Learning Objectives

After completing this module, you will:

Understand networks as function composition: f(x) = layer_n(...layer_2(layer_1(x)))
Build common architectures (MLP, CNN) from layers
Visualize network structure and data flow
See how architecture affects capability
Master forward pass inference (no training yet!)

Note: MLP (Multi-Layer Perceptron) is not a fundamental building block, but a use case of composing Dense layers and activations in sequence. In TinyTorch, you will learn to build MLPs by composing primitives, not as a separate module. This approach helps you see that all architectures (MLP, CNN, etc.) are just patterns of composition, not new primitives.

🧠 Build → Use → Understand

This module follows the TinyTorch pedagogical framework:

Build: Compose layers into complete networks
Use: Create different architectures and run inference
Understand: How architecture design affects network behavior

📚 What You'll Build

Sequential Network

# Basic network composition
network = Sequential([
    Dense(784, 128),
    ReLU(),
    Dense(128, 64),
    ReLU(),
    Dense(64, 10),
    Sigmoid()
])

# Forward pass
x = Tensor([[1.0, 2.0, 3.0, ...]])  # Input data
output = network(x)  # Network prediction

MLP (Multi-Layer Perceptron)

# Create MLP for classification
mlp = create_mlp(
    input_size=784,      # 28x28 image
    hidden_sizes=[128, 64],  # Hidden layers
    output_size=10,      # 10 classes
    activation=ReLU,
    output_activation=Sigmoid
)

Specialized Networks

# Classification network
classifier = create_classification_network(
    input_size=100, num_classes=2
)

# Regression network  
regressor = create_regression_network(
    input_size=13, output_size=1
)

🎨 Visualization Features

This module includes powerful visualizations to help you understand:

Network Architecture Visualization

Layer-by-layer structure: See how layers connect
Color-coded layers: Different colors for Dense, ReLU, Sigmoid, etc.
Connection arrows: Visualize data flow between layers
Layer details: Input/output sizes and parameters

Data Flow Visualization

Shape transformations: See how tensor shapes change through the network
Activation patterns: Visualize intermediate layer outputs
Statistics tracking: Mean, std, and distribution of activations
Layer analysis: Understand what each layer learns

Network Comparison

Side-by-side analysis: Compare different architectures
Performance metrics: Output distributions and statistics
Architectural insights: Layer type distributions and complexity

Behavior Analysis

Input-output relationships: How inputs map to outputs
Activation patterns: Layer-by-layer activation analysis
Network depth: Understanding the role of depth vs width
Practical insights: Real-world application considerations

🚀 Getting Started

Prerequisites

Complete Module 1: Tensor ✅
Complete Module 2: Layers ✅
Understand basic function composition
Familiar with matplotlib for visualizations

Quick Start

# Navigate to the networks module
cd modules/networks

# Work in the development notebook
jupyter notebook networks_dev.ipynb

# Or work in the Python file
code networks_dev.py

📖 Module Structure

modules/networks/
├── networks_dev.py           # Main development file (work here!)
├── networks_dev.ipynb        # Jupyter notebook version
├── tests/
│   └── test_networks.py      # Comprehensive tests
├── README.md                # This file
└── solutions/               # Reference implementations (if stuck)

🎓 Learning Path

Step 1: Sequential Network (Function Composition)

Understand f(x) = layer_n(...layer_1(x))
Implement basic network composition
Test with simple examples

Step 2: Network Visualization

Visualize network architectures
Understand data flow through networks
Compare different network designs

Step 3: Common Architectures

Build MLPs for different tasks
Create classification networks
Design regression networks

Step 4: Behavior Analysis

Analyze network behavior with different inputs
Understand architectural trade-offs
See how design affects capability

Step 5: Practical Applications

Build networks for real problems
Understand classification vs regression
See how architecture matches task

🧪 Testing Your Implementation

Module-Level Tests

# Run comprehensive tests
python -m pytest tests/test_networks.py -v

# Quick test
python -c "from networks_dev import Sequential; print('✅ Networks working!')"

Package-Level Tests

# Export to package
python ../../bin/tito sync

# Test integration
python ../../bin/tito test --module networks

🎯 Key Concepts

Function Composition

Networks as f(x) = g(h(x))
Each layer is a function
Composition creates complex behavior

Architecture Design

Depth: Number of layers
Width: Number of neurons per layer
Activation: Nonlinearity choices
Output: Task-specific final layer

Visualization Benefits

Debugging: See where things go wrong
Understanding: Visualize complex transformations
Design: Compare different architectures
Intuition: Build mental models of networks

Practical Considerations

Input size: Must match your data
Output size: Must match your task
Hidden layers: Balance complexity vs overfitting
Activation functions: Choose based on task

🔍 Common Issues

Import Errors

# Make sure you're in the right directory
import sys
sys.path.append('../../')
from modules.layers.layers_dev import Dense
from modules.activations.activations_dev import ReLU, Sigmoid

Shape Mismatches

# Check layer sizes match
layer1 = Dense(3, 4)    # 3 inputs, 4 outputs
layer2 = Dense(4, 2)    # 4 inputs (matches layer1 output), 2 outputs

Visualization Issues

# Make sure matplotlib is installed
pip install matplotlib seaborn

# Check if plots are disabled during testing
if _should_show_plots():
    # Your visualization code
    pass

🎉 Success Criteria

You've successfully completed this module when:

✅ All tests pass (pytest tests/test_networks.py)
✅ You can build and visualize different network architectures
✅ You understand how architecture affects network behavior
✅ You can create networks for classification and regression tasks
✅ Package export works (tito test --module networks)

🚀 What's Next

After completing this module, you're ready for:

Module 4: Training - Learn how networks learn from data
Module 5: Data - Work with real datasets
Module 6: Applications - Solve real-world problems

🤝 Getting Help

Check the tests for examples of expected behavior
Look at the solutions/ directory if you're stuck
Review the pedagogical principles in docs/pedagogy/
Remember: Build → Use → Understand!

🎨 Visualization Examples

Network Architecture

Input → Dense(784,128) → ReLU → Dense(128,64) → ReLU → Dense(64,10) → Sigmoid → Output

Data Flow

(1,784) → (1,128) → (1,128) → (1,64) → (1,64) → (1,10) → (1,10)

Layer Analysis

Dense layers: Linear transformations
ReLU: Introduces nonlinearity
Sigmoid: Outputs probabilities

Build powerful neural networks with beautiful visualizations! 🚀