mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-12 01:45:57 -05:00

Go to file

Vijay Janapa Reddi 26589a5b3b Fix module dependency chain - clean imports now work

Critical fixes to resolve module import issues:

1. Module 01 (tensor_dev.py):
   - Wrapped all test calls in if __name__ == '__main__': guards
   - Tests no longer execute during import
   - Clean imports now work: from tensor_dev import Tensor

2. Module 08 (dataloader_dev.py):
   - REMOVED redefined Tensor class (was breaking dependency chain)
   - Now imports real Tensor from Module 01
   - DataLoader uses actual Tensor with full gradient support

Impact:
- Modules properly build on previous work (no isolated implementations)
- Clean dependency chain: each module imports from previous modules
- No test execution during imports = fast, clean module loading

This resolves the root cause where DataLoader had to redefine Tensor
because importing tensor_dev.py would execute all test code.

2025-09-30 06:37:52 -04:00

_reviews

CLEANUP: Remove temporary files and add comprehensive documentation

2025-09-26 11:27:25 -04:00

.claude

Clarify module dependencies and fix import issues

2025-09-30 06:27:11 -04:00

.cursor/rules

docs: Add comprehensive integration testing rules

2025-07-18 00:33:26 -04:00

.github

Add documentation standards and development setup

2025-09-15 13:30:10 -04:00

.tito

Add gamified capability showcase system with module completion integration

2025-09-19 18:17:02 -04:00

archive/setup_module

Major reorganization: Remove setup module, renumber all modules, add tito setup command and numeric shortcuts

2025-09-28 07:02:08 -04:00

assignments

Add example NBGrader assignments for 01_setup module

2025-09-16 08:42:11 -04:00

bin

Update CLI references and virtual environment activation

2025-07-13 15:52:09 -04:00

book

Fix training pipeline: Parameter class, Variable.sum(), gradient handling

2025-09-28 19:14:11 -04:00

capstone-ideas

Add comprehensive capstone design documentation

2025-09-28 16:48:00 -04:00

datasets

Add dataset download script and documentation

2025-09-29 10:56:49 -04:00

docs

feat: Complete educational module-developer framework with progressive disclosure

2025-09-28 05:33:38 -04:00

instructor

MILESTONE: Complete Phase 2 CNN training pipeline

2025-09-23 18:33:56 -04:00

milestones

Fix module dependency chain - clean imports now work

2025-09-30 06:37:52 -04:00

modules

Fix module dependency chain - clean imports now work

2025-09-30 06:37:52 -04:00

modules_old

Complete TinyTorch module rebuild with explanations and milestone testing

2025-09-29 20:55:55 -04:00

tests

Complete TinyTorch module rebuild with explanations and milestone testing

2025-09-29 20:55:55 -04:00

tinytorch

Add CNN milestone (03_cnn) and fix spatial.py issues

2025-09-30 00:20:10 -04:00

tito

Fix training pipeline: Parameter class, Variable.sum(), gradient handling

2025-09-28 19:14:11 -04:00

.editorconfig

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.envrc

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

.gitattributes

feat: Add git-lfs support for large files

2025-09-27 01:37:45 -04:00

.gitignore

Update gitignore for NBGrader and virtual environment

2025-09-16 02:34:10 -04:00

cleanup_variables.py

Remove all Variable references - pure Tensor system with clean autograd

2025-09-30 00:08:31 -04:00

CONTRIBUTING.md

Add LICENSE and CONTRIBUTING.md files

2025-09-21 16:06:24 -04:00

fix_tensor_autograd.py

Remove all Variable references - pure Tensor system with clean autograd

2025-09-30 00:08:31 -04:00

LICENSE

Add LICENSE and CONTRIBUTING.md files

2025-09-21 16:06:24 -04:00

minimal_mnist.py

Fix module issues and create minimal MNIST training examples

2025-09-29 10:20:33 -04:00

mnist_working.py

Fix module issues and create minimal MNIST training examples

2025-09-29 10:20:33 -04:00

MODULE_OVERVIEW.md

CLEANUP: Remove temporary files and add comprehensive documentation

2025-09-26 11:27:25 -04:00

NORTH_STAR.md

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

OPTIMIZATION_FIXES_SUMMARY.md

FEAT: Complete performance validation and optimization fixes

2025-09-25 14:57:35 -04:00

optimization_matrix.md

Complete optimization test suite results

2025-09-28 21:48:25 -04:00

optimization_test_framework.py

Optimization Level 0: Baseline

2025-09-28 22:03:36 -04:00

performance_analysis.py

FEAT: Complete performance validation and optimization fixes

2025-09-25 14:57:35 -04:00

progress.json

Fix module issues and create minimal MNIST training examples

2025-09-29 10:20:33 -04:00

pyproject_placeholder.toml

MAJOR: Implement beautiful module progression through strategic reordering

2025-09-24 15:56:47 -04:00

pyproject.toml

FEAT: Complete optimization modules 15-20 with ML Systems focus

2025-09-24 22:34:20 -04:00

quick_mlp_test.py

Remove all Variable references - pure Tensor system with clean autograd

2025-09-30 00:08:31 -04:00

QUICKSTART.md

feat: Major book structure and content updates

2025-09-27 01:36:16 -04:00

README_CURRENT_STATUS.md

Add dataset creation plan and specialized agent

2025-09-28 23:31:14 -04:00

README.md

Fix training pipeline: Parameter class, Variable.sum(), gradient handling

2025-09-28 19:14:11 -04:00

requirements.txt

Refactors to use .venv for virtual environment

2025-07-09 17:40:08 -04:00

settings.ini

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

test_cnn_simple.py

Add CNN milestone (03_cnn) and fix spatial.py issues

2025-09-30 00:20:10 -04:00

README.md

TinyTorch

Build ML Systems From First Principles

🚧 Work in Progress - We're actively developing TinyTorch for Spring 2025! All core modules are complete and tested. Join us in building the future of ML systems education.

📖 Table of Contents

Why TinyTorch?
What You'll Build - Including the CIFAR-10 North Star Goal
Quick Start - Get running in 5 minutes
Learning Journey - 20 progressive modules
Learning Progression & Checkpoints - 21 capability checkpoints
Key Features - Essential-only design
Milestone Examples - Real achievements
Documentation & Resources - For students, instructors, developers
Ready to Start Building? - Your path forward

Why TinyTorch?

"Most ML education teaches you to use frameworks. TinyTorch teaches you to build them."

In an era where AI is reshaping every industry, the difference between ML users and ML engineers determines who drives innovation versus who merely consumes it. TinyTorch bridges this critical gap by teaching you to build every component of modern AI systems from scratch—from tensors to transformers.

A Harvard University course that transforms you from framework user to systems engineer, giving you the deep understanding needed to optimize, debug, and innovate at the foundation of AI.

What You'll Build

A complete ML framework capable of:

🎯 North Star Achievement: Train CNNs on CIFAR-10 to 75%+ accuracy

Real computer vision with 50,000 training images
Built entirely from scratch using only NumPy
Competitive performance with modern frameworks

Additional Capabilities:

Building GPT-style language models with attention mechanisms
Modern optimizers (Adam, SGD) with learning rate scheduling
Performance profiling, optimization, and competitive benchmarking
Complete ML systems pipeline from tensors to deployment

No dependencies on PyTorch or TensorFlow - everything is YOUR code!

Repository Structure

TinyTorch/
├── modules/           # 🏗️ YOUR workspace - implement ML systems here
│   ├── 01_tensor/     # Start: Build tensor operations from scratch
│   ├── 02_activations/# Add: Neural network intelligence (ReLU, Softmax)
│   ├── 03_layers/     # Build: Network components (Linear, Module system)
│   └── ...            # Progress through 20 learning modules
│
├── tinytorch/         # 📦 Generated package (auto-built from your work)
│   ├── core/          # Your implementations exported for use
│   ├── nn/            # Neural network components you built
│   └── optim/         # Optimizers you implemented
│
├── tests/             # 🧪 Comprehensive validation system
│   ├── checkpoints/   # 16 capability tests tracking your progress
│   └── integration/   # Full system validation tests
│
├── book/              # 📚 Complete course documentation (Jupyter Book)
│   ├── chapters/      # Learning guides for each module
│   └── resources/     # Additional learning materials
│
└── examples/          # 🎯 Milestone demonstrations (unlock as you progress)
    ├── mnist_training.py    # Train neural networks on real data
    └── cifar10_cnn.py       # Achieve 75%+ accuracy on CIFAR-10

🚨 CRITICAL: Work in modules/, Import from tinytorch/

✅ Edit code: Always in modules/XX_name/name_dev.py files
✅ Import & use: Your built components from tinytorch.core.component
❌ Never edit: Files in tinytorch/ directly (auto-generated from modules)
🔄 Sync changes: Use tito module complete XX_name to update package

Why this structure? Learn by building (modules) → Use what you built (tinytorch) → Validate mastery (tests)

Quick Start

# Clone and setup
git clone https://github.com/mlsysbook/TinyTorch.git
cd TinyTorch
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt
pip install -e .

# Start learning
cd modules/01_tensor
jupyter lab tensor_dev.py

# Track progress
tito checkpoint status

Learning Journey

20 Progressive Modules

Part I: Neural Network Foundations (Modules 1-8)

Build and train neural networks from scratch

Module	Topic	What You Build	ML Systems Learning
01	Tensor	N-dimensional arrays + operations	Memory layout, cache efficiency, broadcasting semantics
02	Activations	ReLU + Softmax (essential functions)	Numerical stability, gradient flow, function properties
03	Layers	Linear layers + Module abstraction	Parameter management, weight initialization, forward/backward
04	Losses	MSE + CrossEntropy (essential losses)	Numerical precision, loss landscapes, training objectives
05	Autograd	Automatic differentiation engine	Computational graphs, memory management, gradient flow
06	Optimizers	SGD + Adam (essential optimizers)	Memory efficiency (Adam uses 3x memory), convergence
07	Training	Complete training loops + evaluation	Training dynamics, checkpoints, monitoring systems
08	Spatial	Conv2d + MaxPool2d + CNN operations	Parameter scaling, spatial locality, convolution efficiency

Milestone Achievement: Train XOR solver and MNIST classifier after Module 8

Part II: Computer Vision (Modules 9-10)

Build CNNs that classify real images

Module	Topic	What You Build	ML Systems Learning
09	DataLoader	Efficient data pipelines + CIFAR-10	Batch processing, memory-mapped I/O, data pipeline bottlenecks
10	Tokenization	Text processing + vocabulary	Vocabulary scaling, tokenization bottlenecks, sequence processing

Milestone Achievement: CIFAR-10 CNN with 75%+ accuracy

Part III: Language Models (Modules 11-14)

Build transformers that generate text

Module	Topic	What You Build	ML Systems Learning
11	Tokenization	Text processing + vocabulary	Vocabulary scaling (memory vs sequence length), tokenization bottlenecks
12	Embeddings	Token embeddings + positional encoding	Embedding tables (vocab × dim parameters), lookup performance
13	Attention	Multi-head attention mechanisms	O(N²) scaling, memory bottlenecks, attention optimization
14	Transformers	Complete transformer blocks	Layer scaling, memory requirements, architectural trade-offs

Milestone Achievement: TinyGPT language generation

Part IV: System Optimization (Modules 15-20)

Profile, optimize, and benchmark ML systems

Module	Topic	What You Build	ML Systems Learning
15	Profiling	Performance analysis + bottleneck detection	Memory profiling, FLOP counting, Amdahl's Law, performance measurement
16	Acceleration	Hardware optimization + cache-friendly algorithms	Cache hierarchies, memory access patterns, vectorization vs loops
17	Quantization	Model compression + precision reduction	Precision trade-offs (FP32→INT8), memory reduction, accuracy preservation
18	Compression	Pruning + knowledge distillation	Sparsity patterns, parameter reduction, compression ratios
19	Caching	Memory optimization + KV caching	Memory vs compute trade-offs, cache management, generation efficiency
20	Benchmarking	TinyMLPerf competition framework	Competitive optimization, relative performance metrics, innovation scoring

Milestone Achievement: TinyMLPerf optimization competition

Learning Philosophy

Most courses teach you to USE frameworks. TinyTorch teaches you to UNDERSTAND them.

# Traditional Course:
import torch
model.fit(X, y)  # Magic happens

# TinyTorch:
# You implement every component
# You measure memory usage
# You optimize performance
# You understand the systems

Why Build Your Own Framework?

Deep Understanding - Know exactly what loss.backward() does
Systems Thinking - Understand memory, compute, and scaling
Debugging Skills - Fix problems at any level of the stack
Production Ready - Learn patterns used in real ML systems

Learning Progression & Checkpoints

16-Checkpoint Capability System

Track your progress through capability-based checkpoints that validate your ML systems knowledge:

# Check your current progress
tito checkpoint status

# See your capability development timeline
tito checkpoint timeline

Checkpoint Progression:

00-02: Foundation (Environment, Tensors, Activations)
03-07: Core Networks (Layers, Losses, Autograd, Optimizers, Training)
08-10: Computer Vision (Spatial ops, DataLoaders, Real datasets)
11-14: Language Models (Tokenization, Embeddings, Attention, Transformers)
15: Capstone (Complete end-to-end ML systems)

Each checkpoint asks: "Can I build this capability from scratch?" with hands-on validation.

Module Completion Workflow

# Complete a module (automatic export + testing)
tito module complete 01_tensor

# This automatically:
# 1. Exports your implementation to the tinytorch package
# 2. Runs the corresponding capability checkpoint test
# 3. Shows your achievement and suggests next steps

Key Features

Essential-Only Design

Focus on What Matters: ReLU + Softmax (not 20 activation functions)
Production Relevance: Adam + SGD (the optimizers you actually use)
Core ML Systems: Memory profiling, performance analysis, scaling insights
Real Applications: CIFAR-10 CNNs, not toy examples

For Students

Interactive Demos: Rich CLI visualizations for every concept
Checkpoint System: Track your learning progress through 16 capabilities
Immediate Testing: Validate your implementations instantly
Systems Focus: Learn ML engineering, not just algorithms

For Instructors

NBGrader Integration: Automated grading workflow
Progress Tracking: Monitor student achievements
Jupyter Book: Professional course website
Complete Solutions: Reference implementations included

Milestone Examples

As you complete modules, exciting examples unlock to show your framework in action:

After Module 04: First Neural Network

cd examples/perceptron_1957
python rosenblatt_perceptron.py
# Build the first trainable neural network (1957)

After Module 06: Multi-Layer Networks

cd examples/xor_1969  
python minsky_xor_problem.py
# Solve the XOR problem with multi-layer networks (1969)

After Module 08: Real Computer Vision

cd examples/mnist_mlp_1986
python train_mlp.py
# Achieve 95%+ accuracy on MNIST (1986)

After Module 10: Modern CNNs

cd examples/cifar_cnn_modern
python train_cnn.py
# Achieve 75%+ accuracy on CIFAR-10

After Module 14: Language Models

cd examples/gpt_2018
python train_gpt.py
# Generate text with your transformer implementation

After Module 20: TinyMLPerf Competition

# Use TinyMLPerf to benchmark your optimizations
tito benchmark run --event mlp_sprint
tito benchmark run --event cnn_marathon  
tito benchmark run --event transformer_decathlon
# Compete in ML systems optimization benchmarks

After Module 20: Complete Optimization Suite

# Use TinyMLPerf to benchmark and optimize your complete framework
tito benchmark run --comprehensive
python examples/optimization_showcase.py
# Professional ML systems optimization

These aren't toy demos - they're real ML applications achieving solid results with YOUR framework built from scratch and optimized for performance!

Testing & Validation

All demos and modules are thoroughly tested:

# Check your learning progress
tito checkpoint status

# Test specific capabilities
tito checkpoint test 01  # Foundation checkpoint
tito checkpoint test 05  # Autograd checkpoint

# Complete and test modules
tito module complete 01_tensor  # Exports and tests

# Run comprehensive validation
python tests/run_all_modules.py

20 modules passing all tests with 100% health status
21 capability checkpoints tracking learning progress
Complete optimization pipeline from profiling to benchmarking
TinyMLPerf competition framework for performance excellence
KISS principle design for clear, maintainable code
Streamlined development: 7-agent workflow for efficient coordination
Essential-only features: Focus on what's used in production ML systems

📚 Documentation & Resources

🎓 For Students

Interactive Course Website - Complete learning platform
Getting Started Guide - Installation and first steps
CIFAR-10 Training Guide - Achieving the north star goal
Module READMEs - Individual module documentation

👨‍🏫 For Instructors

Instructor Guide - Complete teaching resources
NBGrader Workflow - Automated grading setup
System Architecture - Technical overview

🛠️ For Developers

Agent Coordination - Development workflow
Module Development - Creating new modules
Testing Standards - Quality assurance

TinyMLPerf Competition & Leaderboard

Compete and Compare Your Optimizations

TinyMLPerf is our performance benchmarking competition where you optimize your TinyTorch implementations and compete on the leaderboard:

# Run benchmarks locally
tito benchmark run --event mlp_sprint      # Quick MLP benchmark
tito benchmark run --event cnn_marathon    # CNN optimization challenge
tito benchmark run --event transformer_decathlon  # Ultimate transformer test

# Submit to leaderboard (coming soon)
tito benchmark submit --event cnn_marathon

Leaderboard Categories:

Speed: Fastest inference time
Memory: Lowest memory footprint
Efficiency: Best accuracy/resource ratio
Innovation: Novel optimization techniques

📊 View Leaderboard: TinyMLPerf Competition | Future: tinytorch.org/leaderboard

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

We acknowledge several excellent educational ML framework projects with similar names:

tinygrad - George Hotz's minimalist deep learning framework
micrograd - Andrej Karpathy's tiny autograd engine
MiniTorch - Cornell's educational framework
Other TinyTorch implementations - Various educational implementations on GitHub

Our TinyTorch focuses specifically on ML systems engineering with a complete curriculum, NBGrader integration, and production deployment—designed as a comprehensive university course rather than a standalone library.

Acknowledgments

Created by Prof. Vijay Janapa Reddi at Harvard University.

Special thanks to students and contributors who helped refine this educational framework.

🚀 Ready to Start Building?

TinyTorch transforms you from ML framework user to ML systems engineer.

What Makes TinyTorch Different?

✅ Essential-only features - Focus on what's actually used in production
✅ Complete implementation - Build every component from scratch
✅ Real achievements - Train CNNs on CIFAR-10 to 75%+ accuracy
✅ Systems thinking - Understand memory, performance, and scaling
✅ Production relevance - Learn patterns from PyTorch and TensorFlow
✅ Immediate validation - 21 capability checkpoints track progress

Your Learning Journey

Week 1-2: Foundation (Tensors, Activations, Layers)
Week 3-4: Training Pipeline (Losses, Autograd, Optimizers, Training)
Week 5-6: Computer Vision (Spatial ops, DataLoaders, CIFAR-10)
Week 7-8: Language Models (Tokenization, Attention, Transformers)
Week 9-10: Optimization (Profiling, Acceleration, Benchmarking)

Getting Started

git clone https://github.com/mlsysbook/TinyTorch.git
cd TinyTorch && source setup.sh
cd modules/01_tensor && jupyter lab tensor_dev.py

Start Small. Go Deep. Build ML Systems.

Languages

Python 84.5%

Jupyter Notebook 7.4%

HTML 2.8%

TeX 2.2%

JavaScript 1.3%

Other 1.8%

README.md Unescape Escape

TinyTorch

📖 Table of Contents

Why TinyTorch?

What You'll Build

Repository Structure

Quick Start

Learning Journey

20 Progressive Modules

Part I: Neural Network Foundations (Modules 1-8)

Part II: Computer Vision (Modules 9-10)

Part III: Language Models (Modules 11-14)

Part IV: System Optimization (Modules 15-20)

Learning Philosophy

Why Build Your Own Framework?

Learning Progression & Checkpoints

16-Checkpoint Capability System

Module Completion Workflow

Key Features

Essential-Only Design

For Students

For Instructors

Milestone Examples

After Module 04: First Neural Network

After Module 06: Multi-Layer Networks

After Module 08: Real Computer Vision

After Module 10: Modern CNNs

After Module 14: Language Models

After Module 20: TinyMLPerf Competition

After Module 20: Complete Optimization Suite

Testing & Validation

📚 Documentation & Resources

🎓 For Students

👨‍🏫 For Instructors

🛠️ For Developers

TinyMLPerf Competition & Leaderboard

Compete and Compare Your Optimizations

Contributing

License

Related Projects

Acknowledgments

🚀 Ready to Start Building?

What Makes TinyTorch Different?

Your Learning Journey

Getting Started

README.md