mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-03-12 06:13:35 -05:00
- Added Section 5.5: Measuring Quantization Savings with Profiler - Demonstrates FP32 to INT8 memory reduction (4x savings) - Shows actual memory measurements before/after quantization - Uses Profiler from Module 15 for measurements - Educates students on production workflow: measure compress validate deploy