mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-12 11:03:34 -05:00

Go to file

Vijay Janapa Reddi 7b0404345e Clean up CIFAR-10 examples and achieve 57.2% accuracy

Major cleanup and optimization of CIFAR-10 classification examples:

📁 Directory cleanup:
- Removed 25+ experimental/debug files
- Streamlined to 3 clean, well-documented examples
- Clear file organization and purpose

🎯 Main achievements:
- train_cifar10_mlp.py: 57.2% test accuracy (exceeds course benchmarks!)
- train_simple_baseline.py: ~40% baseline for comparison
- train_lenet5.py: Historical LeNet-5 adaptation

📊 Performance improvements:
- Fixed autograd bias gradient aggregation bug
- Optimized weight initialization (He × 0.5)
- Enhanced data augmentation (flip, brightness, translation)
- Better normalization ([-2, 2] range)
- Learning rate scheduling and decay

📚 Documentation:
- Comprehensive README with performance analysis
- Literature comparison showing TinyTorch excellence
- Clear optimization technique explanations
- Educational value and next steps

🏆 Key results:
- 57.2% accuracy exceeds CS231n/CS229 benchmarks (50-55%)
- Approaches research MLP SOTA (60-65%)
- Proves TinyTorch builds working ML systems
- Students can be proud of their autograd implementation!

Technical fixes:
- Autograd add operation now handles broadcasting correctly
- Bias gradients aggregated over batch dimension
- Loss functions return Variables with gradient tracking
- Comprehensive test suite for gradient shapes

2025-09-21 15:38:31 -04:00

.claude

Standardize all module introductions and fix agent structure

2025-09-18 14:16:58 -04:00

.cursor/rules

docs: Add comprehensive integration testing rules

2025-07-18 00:33:26 -04:00

.github

Add documentation standards and development setup

2025-09-15 13:30:10 -04:00

.tito

Add gamified capability showcase system with module completion integration

2025-09-19 18:17:02 -04:00

assignments

Add example NBGrader assignments for 01_setup module

2025-09-16 08:42:11 -04:00

bin

Update CLI references and virtual environment activation

2025-07-13 15:52:09 -04:00

book

Remove redundant modules and streamline to 16-module structure

2025-09-18 16:41:43 -04:00

capabilities

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

demos

Add educational descriptions and interpretation guides to all demos

2025-09-18 19:54:34 -04:00

docs

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

instructor

Reorganize repository structure with instructor resources

2025-07-13 09:15:49 -04:00

logo

Add bright version of TinyTorch logo for theme support

2025-09-19 18:17:08 -04:00

milestones

Complete auto-generated warning system and establish core file protection

2025-09-21 11:43:35 -04:00

modules/source

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

scripts

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

tests

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

tinyGPT

Implement interactive ML Systems questions and standardize module structure

2025-09-17 14:42:24 -04:00

tinytorch

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

tito

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.claude-startup-check.md

Add Package Manager agent to ensure module integration

2025-09-16 00:45:05 -04:00

.editorconfig

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.envrc

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

.gitattributes

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

.gitignore

Update gitignore for NBGrader and virtual environment

2025-09-16 02:34:10 -04:00

CLAUDE.md

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

gradebook.db

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

nbgrader_config.py

Clean up NBGrader configuration for TinyTorch

2025-09-16 02:36:57 -04:00

pyproject.toml

Fix pytest configuration timeout issue

2025-07-14 19:35:11 -04:00

README.md

Clean up README for better GitHub presentation

2025-09-18 20:24:59 -04:00

requirements.txt

…

settings.ini

Simplify export workflow: remove module_paths.txt, use dynamic discovery

2025-07-12 17:19:22 -04:00

setup-dev.sh

Add documentation standards and development setup

2025-09-15 13:30:10 -04:00

test_all_demos.py

Add comprehensive demo testing and validation scripts

2025-09-18 20:12:49 -04:00

test_bias_fix.py

feat: Implement comprehensive student protection system for TinyTorch

2025-09-21 12:22:18 -04:00

test_milestone_1.py

Add TinyTorch examples gallery and fix module integration issues

2025-09-21 10:00:11 -04:00

validate_demos.py

Add comprehensive demo testing and validation scripts

2025-09-18 20:12:49 -04:00

README.md

TinyTorch 🔥

Build ML Systems From First Principles

A Harvard University course that teaches ML systems engineering by building a complete deep learning framework from scratch. From tensors to transformers, understand every line of code powering modern AI.

🎯 What You'll Build

A complete ML framework capable of:

Training CNNs on CIFAR-10 to 75%+ accuracy
Building GPT-style language models
Implementing modern optimizers (Adam, learning rate scheduling)
Production deployment with monitoring and MLOps

All built from scratch using only NumPy - no PyTorch, no TensorFlow!

🚀 Quick Start

# Clone and setup
git clone https://github.com/mlsysbook/TinyTorch.git
cd TinyTorch
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt
pip install -e .

# Start learning
cd modules/source/01_setup
jupyter lab setup_dev.py

# Track progress
tito checkpoint status

📚 Course Structure

16 Progressive Modules

Module	Topic	What You Build
Foundations
01	Setup	Development environment
02	Tensors	N-dimensional arrays
03	Activations	ReLU, Sigmoid, Softmax
04	Layers	Dense layers
05	Networks	Sequential models
Deep Learning
06	Spatial	CNNs for vision
07	Attention	Transformers
08	DataLoader	Efficient data pipelines
09	Autograd	Automatic differentiation
10	Optimizers	SGD, Adam
Production
11	Training	Complete training loops
12	Compression	Model optimization
13	Kernels	Performance optimization
14	Benchmarking	Profiling tools
15	MLOps	Production deployment
Language Models
16	TinyGPT	Complete GPT implementation

🎓 Learning Philosophy

Most courses teach you to USE frameworks. TinyTorch teaches you to UNDERSTAND them.

# Traditional Course:
import torch
model.fit(X, y)  # Magic happens

# TinyTorch:
# You implement every component
# You measure memory usage
# You optimize performance
# You understand the systems

Why Build Your Own Framework?

✅ Deep Understanding - Know exactly what loss.backward() does
✅ Systems Thinking - Understand memory, compute, and scaling
✅ Debugging Skills - Fix problems at any level of the stack
✅ Production Ready - Learn patterns used in real ML systems

🛠️ Key Features

For Students

Interactive Demos: Rich CLI visualizations for every concept
Checkpoint System: Track your learning progress
Immediate Testing: Validate your implementations instantly
Real Datasets: Train on CIFAR-10, not toy examples

For Instructors

NBGrader Integration: Automated grading workflow
Progress Tracking: Monitor student achievements
Jupyter Book: Professional course website
Complete Solutions: Reference implementations included

📊 Example: Train a CNN on CIFAR-10

from tinytorch.core.networks import Sequential
from tinytorch.core.spatial import Conv2D
from tinytorch.core.activations import ReLU
from tinytorch.core.dataloader import CIFAR10Dataset
from tinytorch.core.training import Trainer
from tinytorch.core.optimizers import Adam

# Load real data
dataset = CIFAR10Dataset(download=True)
train_loader = DataLoader(dataset.train_data, batch_size=32)

# Build CNN
model = Sequential([
    Conv2D(3, 32, kernel_size=3),
    ReLU(),
    Conv2D(32, 64, kernel_size=3),
    ReLU(),
    Dense(64*28*28, 10)
])

# Train
trainer = Trainer(model, loss=CrossEntropyLoss(), optimizer=Adam())
trainer.fit(train_loader, epochs=30)
# Achieves 75%+ accuracy!

🧪 Testing & Validation

All demos and modules are thoroughly tested:

# Test all demos
python test_all_demos.py

# Validate implementations
python validate_demos.py

# Run checkpoint tests
tito checkpoint test 01

✅ 100% test coverage across 8 interactive demos
✅ 48 validation checks ensuring correctness
✅ 16 capability checkpoints tracking progress

📖 Documentation

Course Website - Complete interactive course
Instructor Guide - Teaching resources
API Reference - Framework documentation

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

📄 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Created by Prof. Vijay Janapa Reddi at Harvard University.

Special thanks to students and contributors who helped refine this educational framework.

Start Small. Go Deep. Build ML Systems.

Languages

Python 84.5%

Jupyter Notebook 7.4%

HTML 2.8%

TeX 2.2%

JavaScript 1.3%

Other 1.8%