mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-03-09 07:15:51 -05:00
Enhances the `inference` chapter with several new SVG diagrams, providing visual explanations for complex topics. These figures illustrate: - Tensor, pipeline, and expert parallelism request routing - Horizontal scaling with shard groups - Global load balancing across multiple regions - Edge caching strategies (hit/miss paths) - Spot-aware traffic distribution Updates the `inference.qmd` document to integrate these new diagrams, replacing previous textual and ASCII-art descriptions for improved clarity and presentation. Applies a widespread style standardization to existing SVG diagrams, uniformly setting the main background fill color to `#fff` (pure white) and a consistent corner radius (`rx="4"`) for the primary canvas rectangle to enhance visual consistency throughout the book.
Symbolic link
1 line
10 B
Plaintext
Symbolic link
1 line
10 B
Plaintext
book/tools |