mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-04-28 14:33:18 -05:00

Go to file

Vijay Janapa Reddi 1ac530d8ff Adds module-specific sync functionality

Extends the sync command to allow users to synchronize
specific modules instead of the entire project. This
improves efficiency by reducing the scope of the nbdev
export process. Adds argument parsing for module selection.

2025-07-10 12:47:17 -04:00

bin

Adds module-specific sync functionality

2025-07-10 12:47:17 -04:00

modules

Renames development notebook filenames

2025-07-10 11:35:11 -04:00

tests

Been refactoring the structure, got setup working

2025-07-10 11:13:45 -04:00

tinytorch

Been refactoring the structure, got setup working

2025-07-10 11:13:45 -04:00

.gitattributes

Initial commit

2025-07-08 22:47:00 -04:00

.gitignore

Been refactoring the structure, got setup working

2025-07-10 11:13:45 -04:00

logo_small.jpg

Adds initial TinyTorch CLI and core structure

2025-07-09 00:23:19 -04:00

logo.png

Adds initial TinyTorch CLI and core structure

2025-07-09 00:23:19 -04:00

Makefile

Cleaning up the repo

2025-07-10 11:23:48 -04:00

PROJECT_GUIDE.md

Renames development notebook filenames

2025-07-10 11:35:11 -04:00

pyproject.toml

Been refactoring the structure, got setup working

2025-07-10 11:13:45 -04:00

QUICKSTART.md

Renames development notebook filenames

2025-07-10 11:35:11 -04:00

README.md

Renames development notebook filenames

2025-07-10 11:35:11 -04:00

requirements.txt

Refactors to use .venv for virtual environment

2025-07-09 17:40:08 -04:00

settings.ini

Cleaning up the repo

2025-07-10 11:23:48 -04:00

VISION.md

Adds TinyTorch VISION.md

2025-07-10 12:21:53 -04:00

README.md

🔥 TinyTorch: Build ML Systems from Scratch

A hands-on systems course where you implement every component of a modern ML system

TinyTorch is a comprehensive machine learning systems course where you'll build everything from tensors to production monitoring systems. Using a module-first approach, students work in self-contained modules while building a complete ML framework.

🎯 What You'll Build

By the end of this course, you will have implemented:

✅ Core tensor operations with automatic differentiation
✅ Neural network layers (Linear, CNN, RNN, Transformer)
✅ Training algorithms (SGD, Adam, distributed training)
✅ Data pipelines with efficient loading and preprocessing
✅ Model compression (pruning, quantization, distillation)
✅ Performance optimization (profiling, kernel fusion)
✅ Production systems (deployment, monitoring, MLOps)

🚀 Quick Start

1. Setup Environment

# Clone the repository
git clone https://github.com/tinytorch/TinyTorch.git
cd TinyTorch

# Setup development environment
python -m venv .venv
source .venv/bin/activate  # or `.venv\Scripts\activate` on Windows
pip install -r requirements.txt

2. Start with Setup Module

# Navigate to the setup module
cd modules/setup/

# Read the module overview
cat README.md

# Open the development notebook
jupyter lab setup_dev.ipynb

3. Development Workflow

Work in module notebooks (modules/[module]/[module]_dev.ipynb)
Mark code for export with #| export directives
Export to package with python bin/tito.py sync
Test your code with python bin/tito.py test --module [module]
Move to next module when tests pass

📚 Course Structure

TinyTorch follows a progressive module structure. Each module builds on the previous ones:

Module	Location	Topic	Exports To
setup	`modules/setup/`	Environment & Hello World	`tinytorch.core.utils`
tensor	`modules/tensor/`	Core Tensor Implementation	`tinytorch.core.tensor`
autograd	`modules/autograd/`	Automatic Differentiation	`tinytorch.core.autograd`
mlp	`modules/mlp/`	Neural Network Layers	`tinytorch.core.modules`
cnn	`modules/cnn/`	Convolutional Networks	`tinytorch.models.cnn`
training	`modules/training/`	Training Loops	`tinytorch.training`
data	`modules/data/`	Data Loading Pipeline	`tinytorch.data`
kernels	`modules/kernels/`	Custom CUDA Kernels	`tinytorch.kernels`
compression	`modules/compression/`	Model Compression	`tinytorch.compression`
profiling	`modules/profiling/`	Performance Profiling	`tinytorch.profiling`
benchmarking	`modules/benchmarking/`	Performance Benchmarks	`tinytorch.benchmarking`
config	`modules/config/`	Configuration Management	`tinytorch.config`
mlops	`modules/mlops/`	Production Monitoring	`tinytorch.mlops`

🔧 Key Commands

Command	Purpose	Example
`python bin/tito.py sync`	Export notebook code to package	Export all modules
`python bin/tito.py test --module [name]`	Test specific module	Test tensor module
`python bin/tito.py test --all`	Run all tests	Test everything
`python bin/tito.py info`	Check implementation status	Show progress
`jupyter lab [module]_dev.ipynb`	Start module development	Open tensor notebook

📦 Package Structure

The final TinyTorch package structure (auto-generated from modules):

tinytorch/                  # Auto-generated from modules/
├── __init__.py            # Main package
├── core/                  # Core ML components
│   ├── tensor.py         # From modules/tensor/
│   ├── autograd.py       # From modules/autograd/
│   ├── modules.py        # From modules/mlp/
│   └── utils.py          # From modules/setup/
├── data/                 # From modules/data/
├── training/             # From modules/training/
├── models/               # Model architectures
│   └── cnn.py           # From modules/cnn/
├── kernels/              # From modules/kernels/
├── compression/          # From modules/compression/
├── profiling/            # From modules/profiling/
├── benchmarking/         # From modules/benchmarking/
├── config/               # From modules/config/
└── mlops/                # From modules/mlops/

🎓 Learning Approach

Module-First Development

TinyTorch uses a module-first approach where each module is self-contained:

✅ Self-contained: Each module has its own notebook, tests, and documentation
✅ Progressive: Modules build on each other in a logical sequence
✅ Interactive: Work in Jupyter notebooks with immediate feedback
✅ Tested: Comprehensive tests verify your implementation
✅ Integrated: nbdev automatically exports to the main package

Development Workflow per Module

# 1. Navigate to module
cd modules/[module-name]/

# 2. Read the overview
cat README.md

# 3. Open development notebook
jupyter lab [module-name]_dev.ipynb

# 4. Implement functions with #| export
# 5. Test interactively in notebook

# 6. Export to package
python bin/tito.py sync

# 7. Run automated tests
python bin/tito.py test --module [module-name]

# 8. Move to next module when tests pass

Progressive Complexity

Foundation (setup, tensor): Basic building blocks
Core ML (autograd, mlp): Neural network fundamentals
Advanced Architectures (cnn): Specialized network types
Training Systems (training, data): Complete learning pipelines
Optimization (kernels, compression, profiling): Performance
Production (benchmarking, config, mlops): Real-world deployment

🛠️ Module Structure

Each module follows a consistent structure:

modules/[module-name]/
├── README.md              # 📖 Module overview and instructions
├── [module-name]_dev.ipynb    # 📓 Main development notebook
├── test_[module].py       # 🧪 Automated tests
└── check_[module].py      # ✅ Manual verification (optional)

Module Development Process

Read README.md - Understand learning objectives and requirements
Open notebook - Work through guided implementation
Mark exports - Use #| export for package code
Test locally - Verify functionality in notebook
Export code - Run tito sync to update package
Run tests - Ensure implementation meets requirements
Iterate - Fix issues and repeat until tests pass

📋 Requirements

Python 3.8+
Jupyter Lab/Notebook
nbdev (for notebook development)
pytest (for testing)
NumPy, Matplotlib (for ML operations)

See requirements.txt for complete dependency list.

🎯 Goals & Philosophy

Educational Goals

Deep Understanding: Implement every component from first principles
Systems Thinking: Understand how components interact
Performance Awareness: Learn to optimize real systems
Production Skills: Build systems that work in practice

Design Philosophy

Module-First: Self-contained learning units
Notebook-Driven: Interactive development with immediate feedback
Test-Driven: Comprehensive testing for reliability
Incremental: Build understanding step by step
Real-World: Techniques used in production systems

🚀 Getting Started

For New Students

Setup Environment: pip install -r requirements.txt
Start with Setup: cd modules/setup/ && cat README.md
Follow the sequence: Complete modules in order
Test frequently: Use tito test to verify progress
Build incrementally: Each module prepares for the next

For Instructors

Add modules: Copy existing module structure
Update tests: Add to modules/[name]/test_[name].py
Document well: Clear READMEs and notebook explanations
Test integration: Ensure modules work together

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🙏 Acknowledgments

Inspired by PyTorch, fastai, and Karpathy's micrograd
Built with nbdev for seamless notebook development
Course structure inspired by modern ML systems courses

Languages

Python 84.5%

Jupyter Notebook 7.4%

HTML 2.8%

TeX 2.2%

JavaScript 1.3%

Other 1.8%