[GH-ISSUE #734] model optimization notes #1560

Closed
opened 2026-04-11 07:55:33 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @18jeffreyma on GitHub (Mar 6, 2025).
Original GitHub issue: https://github.com/harvard-edge/cs249r_book/issues/734

otherwise rlly good organization, these are only minor nits imo

Originally created by @18jeffreyma on GitHub (Mar 6, 2025). Original GitHub issue: https://github.com/harvard-edge/cs249r_book/issues/734 - [ ] https://mlsysbook.ai/contents/core/hw_acceleration/hw_acceleration.html#mixed-precision-computing-and-hardware-evolution, really great taxonomy to introduce - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#dynamic-pruning maybe a citation here to a representative paper - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#tbl-pruning bold the left hand labels for readability - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#applications-of-td probably just expand TD to tensor decomposition for easier readability - [ ] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#fig-quantized-energy figure wip - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#early-exit-architectures diagram of an example early exit - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#gate-based-conditional-computation moe diagram insert - [x] https://mlsysbook.ai/contents/core/optimizations/optimizations.html#hardware-execution-inefficiencies maybe a link here to XLA compilation and static shapes documentation - [x] the sparsity section could use a reference to https://arxiv.org/pdf/2211.15841 (essentially showing how sparsity can still be arranged into dense ish and hardware friendly copmutation) otherwise rlly good organization, these are only minor nits imo
GiteaMirror added the area: book label 2026-04-11 07:55:33 -05:00
Author
Owner

@18jeffreyma commented on GitHub (Mar 6, 2025):

what compressed formats in sparsity?

<!-- gh-comment-id:2704226208 --> @18jeffreyma commented on GitHub (Mar 6, 2025): what compressed formats in sparsity?
Author
Owner

@18jeffreyma commented on GitHub (Mar 6, 2025):

take a look at spatial computing (and joel emer)

<!-- gh-comment-id:2704228156 --> @18jeffreyma commented on GitHub (Mar 6, 2025): take a look at spatial computing (and joel emer)
Author
Owner

@18jeffreyma commented on GitHub (Mar 6, 2025):

2:4 spasity blog, representation and show the packing and unpacking

<!-- gh-comment-id:2704231889 --> @18jeffreyma commented on GitHub (Mar 6, 2025): 2:4 spasity blog, representation and show the packing and unpacking
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/cs249r_book#1560