mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-12 01:13:34 -05:00

Go to file

Vijay Janapa Reddi 56f374efa3 FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

🎯 NORTH STAR VISION DOCUMENTED:
'Don't Just Import It, Build It' - Training AI Engineers, not just ML users

AI Engineering emerges as a foundational discipline like Computer Engineering,
bridging algorithms and systems to build the AI infrastructure of the future.

🧪 ROBUST TESTING FRAMEWORK ESTABLISHED:
- Created tests/regression/ for sandbox integrity tests
- Implemented test-driven bug prevention workflow
- Clear separation: student tests (pedagogical) vs system tests (robustness)
- Every bug becomes a test to prevent recurrence

✅ KEY IMPLEMENTATIONS:
- NORTH_STAR.md: Vision for AI Engineering discipline
- Testing best practices: Focus on robust student sandbox
- Git workflow standards: Professional development practices
- Regression test suite: Prevent infrastructure issues
- Conv->Linear dimension tests (found CNN bug)
- Transformer reshaping tests (found GPT bug)

🏗️ SANDBOX INTEGRITY:
Students need a solid, predictable environment where they focus on ML concepts,
not debugging framework issues. The framework must be invisible.

📚 EDUCATIONAL PHILOSOPHY:
TinyTorch isn't just teaching a framework - it's founding the AI Engineering
discipline by training engineers who understand how to BUILD ML systems.

This establishes the foundation for training the first generation of true
AI Engineers who will define this emerging discipline.

2025-09-25 11:16:28 -04:00

.claude

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

.cursor/rules

docs: Add comprehensive integration testing rules

2025-07-18 00:33:26 -04:00

.github

Add documentation standards and development setup

2025-09-15 13:30:10 -04:00

.tito

Add gamified capability showcase system with module completion integration

2025-09-19 18:17:02 -04:00

assignments

Add example NBGrader assignments for 01_setup module

2025-09-16 08:42:11 -04:00

bin

Update CLI references and virtual environment activation

2025-07-13 15:52:09 -04:00

book

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

docs

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

instructor

MILESTONE: Complete Phase 2 CNN training pipeline

2025-09-23 18:33:56 -04:00

modules

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

modules_new/00_hello

MILESTONE: Complete Phase 2 CNN training pipeline

2025-09-23 18:33:56 -04:00

tests

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

tinymlperf_results

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

tinytorch

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

tinytorch_placeholder

MAJOR: Implement beautiful module progression through strategic reordering

2025-09-24 15:56:47 -04:00

tito

MAJOR: Implement beautiful module progression through strategic reordering

2025-09-24 15:56:47 -04:00

.editorconfig

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.envrc

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

.gitattributes

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.gitignore

Update gitignore for NBGrader and virtual environment

2025-09-16 02:34:10 -04:00

CLAUDE.md

Restructure .claude directory with comprehensive guidelines

2025-09-21 20:13:05 -04:00

CONTRIBUTING.md

Add LICENSE and CONTRIBUTING.md files

2025-09-21 16:06:24 -04:00

LICENSE

Add LICENSE and CONTRIBUTING.md files

2025-09-21 16:06:24 -04:00

NORTH_STAR.md

FOUNDATION: Establish AI Engineering as a discipline through TinyTorch

2025-09-25 11:16:28 -04:00

pyproject_placeholder.toml

MAJOR: Implement beautiful module progression through strategic reordering

2025-09-24 15:56:47 -04:00

pyproject.toml

FEAT: Complete optimization modules 15-20 with ML Systems focus

2025-09-24 22:34:20 -04:00

README.md

Clean up repository: remove temp files, organize modules, prepare for PyPI publication

2025-09-24 10:13:37 -04:00

requirements.txt

Refactors to use .venv for virtual environment

2025-07-09 17:40:08 -04:00

settings.ini

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

README.md

TinyTorch 🔥

Build ML Systems From First Principles

A Harvard University course that teaches ML systems engineering by building a complete deep learning framework from scratch. From tensors to transformers, understand every line of code powering modern AI.

🎯 What You'll Build

A complete ML framework capable of:

Training neural networks on CIFAR-10 to 55%+ accuracy (reliably achievable!)
Building GPT-style language models
Implementing modern optimizers (Adam, learning rate scheduling)
Production deployment with monitoring and MLOps

All built from scratch using only NumPy - no PyTorch, no TensorFlow!

🚀 Quick Start

# Clone and setup
git clone https://github.com/mlsysbook/TinyTorch.git
cd TinyTorch
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt
pip install -e .

# Start learning
cd modules/source/01_setup
jupyter lab setup_dev.py

# Track progress
tito checkpoint status

📚 Streamlined Learning Journey - No Forward Dependencies!

12 Progressive Modules - Build Complete ML Systems Step by Step!

Part I: Neural Network Foundations (Modules 1-7)

"I can train neural networks from scratch!"

Module	Topic	What You Build	Key Innovation
01	Setup	Development environment	CLI tools, testing framework
02	Tensor	N-dimensional arrays + Basic Autograd	Gradients from the start!
03	Activations	ReLU + Softmax ONLY	Focus on what matters most
04	Layers	Linear + Module + Flatten	Complete building blocks
05	Loss	MSE + CrossEntropy	Define learning objectives
06	Optimizers	SGD + Adam	How we learn
07	Training	Complete training loops	Put it all together

✅ Capstone: XOR + MNIST - Train real neural networks after just 7 modules!

Part II: Computer Vision (Modules 8-9)

"I can build CNNs that classify real images!"

Module	Topic	What You Build
08	CNN Ops	Conv2d + MaxPool2d
09	DataLoader	Efficient data pipelines

✅ Capstone: CIFAR-10 CNN - 55%+ accuracy on real images

Part III: Language Models (Modules 10-12)

"I can build transformers that generate text!"

Module	Topic	What You Build
10	Embeddings	Token embeddings, positional encoding
11	Attention	Multi-head attention
12	Transformers	Transformer blocks

✅ Capstone: TinyGPT - Generate text with transformers

🎓 Learning Philosophy

Most courses teach you to USE frameworks. TinyTorch teaches you to UNDERSTAND them.

# Traditional Course:
import torch
model.fit(X, y)  # Magic happens

# TinyTorch:
# You implement every component
# You measure memory usage
# You optimize performance
# You understand the systems

Why Build Your Own Framework?

✅ Deep Understanding - Know exactly what loss.backward() does
✅ Systems Thinking - Understand memory, compute, and scaling
✅ Debugging Skills - Fix problems at any level of the stack
✅ Production Ready - Learn patterns used in real ML systems

🛠️ Key Features

For Students

Interactive Demos: Rich CLI visualizations for every concept
Checkpoint System: Track your learning progress
Immediate Testing: Validate your implementations instantly
Real Datasets: Train on CIFAR-10, not toy examples

For Instructors

NBGrader Integration: Automated grading workflow
Progress Tracking: Monitor student achievements
Jupyter Book: Professional course website
Complete Solutions: Reference implementations included

🔥 Examples You Can Run

As you complete modules, exciting examples unlock to show your framework in action:

After Module 07 → `examples/xornet/` + `examples/mnist/` 🔥

cd examples/xornet
python train_xor.py
# 🎯 100% accuracy on XOR problem!

cd examples/mnist
python train_mlp.py
# 🏆 95%+ accuracy on handwritten digits!

After Module 09 → `examples/cifar10/` 🎯

cd examples/cifar10
python train_cnn.py
# 🏆 55%+ accuracy on real images!

After Module 12 → `examples/tinygpt/` 🚀

cd examples/tinygpt
python train_gpt.py
# 🔥 Generate text with transformers!

These aren't toy demos - they're real ML applications achieving solid results with YOUR framework built from scratch following KISS principles!

🧪 Testing & Validation

All demos and modules are thoroughly tested:

# Run comprehensive test suite (recommended)
tito test --comprehensive

# Run checkpoint tests
tito checkpoint test 01

# Test specific modules
tito test --module tensor

# Run all module tests
python tests/run_all_modules.py

✅ 16 modules passing all tests with 100% health status
✅ 16 capability checkpoints tracking learning progress
✅ Comprehensive testing framework with module and integration tests
✅ KISS principle design for clear, maintainable code

📖 Documentation

Course Website - Complete interactive course
Instructor Guide - Teaching resources
Student Quickstart - Getting started guide
CIFAR-10 Training Guide - Detailed training walkthrough

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

📄 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Created by Prof. Vijay Janapa Reddi at Harvard University.

Special thanks to students and contributors who helped refine this educational framework.

Start Small. Go Deep. Build ML Systems.

Languages

Python 84.5%

Jupyter Notebook 7.4%

HTML 2.8%

TeX 2.2%

JavaScript 1.3%

Other 1.8%

README.md

TinyTorch 🔥

🎯 What You'll Build

🚀 Quick Start

📚 Streamlined Learning Journey - No Forward Dependencies!

12 Progressive Modules - Build Complete ML Systems Step by Step!

Part I: Neural Network Foundations (Modules 1-7)

Part II: Computer Vision (Modules 8-9)

Part III: Language Models (Modules 10-12)

🎓 Learning Philosophy

Why Build Your Own Framework?

🛠️ Key Features

For Students

For Instructors

🔥 Examples You Can Run

After Module 07 → examples/xornet/ + examples/mnist/ 🔥

After Module 09 → examples/cifar10/ 🎯

After Module 12 → examples/tinygpt/ 🚀

🧪 Testing & Validation

📖 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

After Module 07 → `examples/xornet/` + `examples/mnist/` 🔥

After Module 09 → `examples/cifar10/` 🎯

After Module 12 → `examples/tinygpt/` 🚀