mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-03 00:02:02 -05:00

Files

Vijay Janapa Reddi e6920c9779 Implements 2D Convolutional Neural Network module

Introduces a comprehensive module for 2D Convolutional Neural Networks.

This module provides a foundational understanding of CNNs through:
- Implementation of a naive Conv2D layer with sliding window convolution
- Visualization of kernel operations and feature map construction
- Composition of Conv2D layers with other layers to build a simple ConvNet

This structure provides a step-by-step guide to building and understanding CNNs, with clear examples and tests.

2025-07-11 15:21:58 -04:00

tests

feat: Scaffold CNN module for students

2025-07-10 23:34:35 -04:00

cnn_dev.ipynb

Improves clarity in CNN notebook

2025-07-11 15:01:16 -04:00

cnn_dev.py

Implements 2D Convolutional Neural Network module

2025-07-11 15:21:58 -04:00

README.md

feat: Scaffold CNN module for students

2025-07-10 23:34:35 -04:00

README.md

🧠 Module X: CNN - Convolutional Neural Networks

Implement the core building block of modern computer vision: the convolutional layer.

🎯 Learning Objectives

Understand the convolution operation (sliding window, local connectivity, weight sharing)
Implement Conv2D with explicit for-loops (single channel, single filter, no stride/pad)
Visualize how convolution builds feature maps
Compose Conv2D with other layers to build a simple ConvNet
(Stretch) Explore stride, padding, pooling, and multi-channel input

🧠 Build → Use → Understand

Build: Implement Conv2D from scratch (for-loop)
Use: Compose Conv2D with ReLU, Flatten, Dense to build a ConvNet
Understand: Visualize and analyze how convolution works

📚 What You'll Build

Conv2D (for-loop): The core operation, implemented by you
Conv2D Layer: Wrap your function in a layer class
Simple ConvNet: Compose Conv2D → ReLU → Flatten → Dense
Visualization: See how the filter slides and builds the output

🛠️ Provided Functionality

Stride and Padding: Provided as utilities or stretch goals
Multi-channel/Filter Support: Provided or as stretch
Pooling (Max/Avg): Optional, provided or as stretch
Flatten Layer: Provided
Visualization: Provided for learning
Tests: Provided for feedback

🤔 Why Focus on the For-Loop?

Implementing the convolution for-loop is the best way to understand what makes CNNs powerful. You’ll see exactly how the filter slides, how local patterns are captured, and why this operation is so efficient for images. Other features (stride, padding, pooling) are important, but the core insight comes from building the basic operation yourself.

🚀 Getting Started

cd modules/cnn
jupyter notebook cnn_dev.ipynb  # or edit cnn_dev.py

📖 Module Structure

modules/cnn/
├── cnn_dev.py           # Main development file (work here!)
├── cnn_dev.ipynb        # Jupyter notebook version
├── tests/
│   └── test_cnn.py      # Tests for your implementation
├── README.md            # This file

🧪 Testing Your Implementation

# Run tests
python -m pytest tests/test_cnn.py -v

🌟 Stretch Goals

Add stride and padding support
Support multi-channel input/output
Implement pooling layers
Visualize learned filters and feature maps

💡 Key Insight

Convolution is a new, fundamental building block. By implementing it yourself, you’ll understand the magic behind modern vision models!

README.md Unescape Escape

🧠 Module X: CNN - Convolutional Neural Networks

🎯 Learning Objectives

🧠 Build → Use → Understand

📚 What You'll Build

🛠️ Provided Functionality

🤔 Why Focus on the For-Loop?

🚀 Getting Started

📖 Module Structure

🧪 Testing Your Implementation

🌟 Stretch Goals

💡 Key Insight

README.md