mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-05-07 02:03:55 -05:00
Efficient AI Thoughts Checklist #325
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @18jeffreyma on GitHub (Feb 3, 2025).
Originally assigned to: @profvjreddi, @18jeffreyma on GitHub.
9.2
9.2.1
“Era of Algorithmic Efficiency”: Maybe worth including some examples here on why parallelism and ml systems didnt take off:
Namely, things like decision trees or SVMs were not that easily model parallelized: most works focused on ensemble learning (i.e. learning multiple trees/models in parallel), which one can do with data parallelism and splitting across batches.
Only when deep learning came did we have models that were much easier to shard across model dimension.
“The shift to deep learning: what do you think about bolding some of the key words to make them stand out?
“Modern Era of Algorithmic Efficiency”: I think an additional linkage here is to explain even at large scale datacenters, we need efficiency. Maybe worth briefly touching here on we’re hitting limits of our hardware and need to be more creative in software/efficiency to maximize hardware usage.
Maybe discuss things like general latency or memory requirements briefly (I see the next section goes into depth)
Update Figure 9.2 with some LLMs maybe?
9.2.2
9.2.3
Some bits on data efficiency like https://www.frontiersin.org/journals/computational-neuroscience/articles/10.3389/fncom.2022.760085/full
Cite WakeVision, KWS dataset, FineWeb as a nice data efficient postprocessing example.
9.3.2
“Algorithmic Efficiency Reinforces Compute and Data Efficiency”:
Maybe let’s go into what the ideas behind this section are in person
9.3.3
9.3.4
9.4.2
9.5.1
9.5.2
9.2 should be renamed toHistory over Time
9.3 should be renamed Defining System Efficiency
9.4 stays same
9.5 stays same
9.6: Building a Efficiency-first Mindset (or Efficiency as a First Party Consideration)
9.7 Broader Challenges and Philosophical Questions
9.7.4 Balancing Innovation and Efficiency
Thoughts from our tuesday discussion:
Compute + Data Efficiency: understanding how to build data pipelines and loading such that accelerators and GPUs are always fed and have work readily available to make training as fast as possible.