mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-05-05 17:54:48 -05:00
- Create standalone verify_vectorization_speedup() function (Section 4) - Measures ACTUAL timing of loop-based vs vectorized operations - Uses time.perf_counter() for precise measurements - Includes warmup runs for accurate timing - Verifies >10× speedup (typical for NumPy/BLAS) - test_module() calls verification function cleanly - Returns dict with speedup, times, and verification status - Includes example usage in __main__ block - Update section numbering: Systems Analysis now Section 5 Verification shows: - Loop-based: ~100ms for 100 iterations - Vectorized: ~1ms for 100 iterations - Demonstrates SIMD parallelization benefits