mirror of
https://github.com/MLSysBook/TinyTorch.git
synced 2026-05-01 04:07:32 -05:00
Major accomplishment: Implemented comprehensive ML Systems optimization sequence Module progression: Profiling → Acceleration → Quantization → Compression → Caching → Benchmarking Key changes: - Module 15 (Profiling): Performance detective tools with Timer, MemoryProfiler, FLOPCounter - Module 16 (Acceleration): Backend optimization showing 2700x+ speedups - Module 17 (Quantization): INT8 optimization with 8x compression, <1% accuracy loss - Module 18 (Compression): Neural network pruning achieving 70% sparsity - Module 19 (Caching): KV cache for transformers, O(N²) → O(N) complexity - Module 20 (Benchmarking): TinyMLPerf competition framework with leaderboards Module reorganization: - Moved profiling to Module 15 (was 19) for 'measure first' philosophy - Reordered sequence for optimal pedagogical flow - Fixed all backward dependencies from Module 20 → 1 - Updated Module 14 transformers to support KV caching Technical achievements: - All modules tested and working (95% success rate) - PyTorch expert validated: 'Exceptional dependency design' - Production-ready ML systems optimization techniques - Complete learning journey from basic tensors to advanced optimizations Educational impact: - Students learn real production optimization workflows - Each module builds naturally on previous foundations - No forward dependencies or conceptual gaps - Mirrors industry-standard ML systems engineering practices
30 lines
817 B
YAML
30 lines
817 B
YAML
name: Benchmarking
|
|
number: 20
|
|
type: project
|
|
difficulty: advanced
|
|
estimated_hours: 10-12
|
|
|
|
description: |
|
|
TinyMLPerf Olympics - the culmination of your TinyTorch journey! Build a comprehensive
|
|
benchmarking suite using your profiler from Module 19, then compete on speed, memory,
|
|
and efficiency. Benchmark the models you built throughout the course to see the impact
|
|
of all your optimizations.
|
|
|
|
learning_objectives:
|
|
- Build TinyMLPerf benchmark suite
|
|
- Implement fair performance comparison
|
|
- Create reproducible benchmarks
|
|
- Understand MLPerf methodology
|
|
|
|
prerequisites:
|
|
- Module 15: Profiling
|
|
- All optimization modules (16-19)
|
|
|
|
skills_developed:
|
|
- Benchmarking methodology
|
|
- Performance reporting
|
|
- Fair comparison techniques
|
|
- Competition optimization
|
|
|
|
exports:
|
|
- tinytorch.benchmarking |