TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-03-12 06:13:35 -05:00

Files

Vijay Janapa Reddi 3265eabe79 Add Profiler demo to Module 17 Quantization

- Added Section 5.5: Measuring Quantization Savings with Profiler
- Demonstrates FP32 to INT8 memory reduction (4x savings)
- Shows actual memory measurements before/after quantization
- Uses Profiler from Module 15 for measurements
- Educates students on production workflow: measure compress validate deploy

2025-11-06 20:38:44 -05:00

quantization_dev.ipynb

Module 17: Export QuantizationComplete for INT8 quantization

2025-11-06 15:50:48 -05:00

quantization_dev.py

Add Profiler demo to Module 17 Quantization

2025-11-06 20:38:44 -05:00