[GH-ISSUE #600] Feedback on Chapter 6: Data Engineering #1507

Closed
opened 2026-04-11 07:52:19 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @zishenwan on GitHub (Jan 12, 2025).
Original GitHub issue: https://github.com/harvard-edge/cs249r_book/issues/600

Originally assigned to: @profvjreddi on GitHub.

Feedback on Chapter 6: Data Engineering

  • Figure 6.3: maybe consider adding "data labeling" and "data governance" blocks in the figure? so all subsequent sections are visually represented in figure
  • Reference: some refs seem in wrong format, e.g., "pineau2020improving?", "dutta2020mlops?"
  • Consider adding some figures or flowcharts for sections and concepts
    • Sec.6.5.1: maybe add a batching vs streaming processing figure, e.g., link
    • Sec.6.5.2: maybe add a ETL vs. ELT figure, e.g., link
    • etc
Originally created by @zishenwan on GitHub (Jan 12, 2025). Original GitHub issue: https://github.com/harvard-edge/cs249r_book/issues/600 Originally assigned to: @profvjreddi on GitHub. Feedback on Chapter 6: Data Engineering - Figure 6.3: maybe consider adding "data labeling" and "data governance" blocks in the figure? so all subsequent sections are visually represented in figure - Reference: some refs seem in wrong format, e.g., "pineau2020improving?", "dutta2020mlops?" - Consider adding some figures or flowcharts for sections and concepts - Sec.6.5.1: maybe add a batching vs streaming processing figure, e.g., [link](https://medium.com/@evertongomede/batch-vs-streaming-data-ingestion-choosing-the-right-approach-for-efficient-data-processing-8fa492299dd4) - Sec.6.5.2: maybe add a ETL vs. ELT figure, e.g., [link](https://blog.skyvia.com/elt-vs-etl/) - etc
GiteaMirror added the area: booktype: improvement labels 2026-04-11 07:52:19 -05:00
Author
Owner

@profvjreddi commented on GitHub (Jan 12, 2025):

Feedback on Chapter 6: Data Engineering

Thank you! 🙏

  • Figure 6.3: maybe consider adding "data labeling" and "data governance" blocks in the figure? so all subsequent sections are visually represented in figure

Good point. Done!

  • Reference: some refs seem in wrong format, e.g., "pineau2020improving?", "dutta2020mlops?"

I fixed these in an earlier push.

  • Consider adding some figures or flowcharts for sections and concepts

    • Sec.6.5.1: maybe add a batching vs streaming processing figure, e.g., link
    • Sec.6.5.2: maybe add a ETL vs. ELT figure, e.g., link
    • etc

Thanks, I was going to make a dedicated pass on the images at a later time but these are good starting points.

<!-- gh-comment-id:2585755039 --> @profvjreddi commented on GitHub (Jan 12, 2025): > Feedback on Chapter 6: Data Engineering > Thank you! 🙏 > * Figure 6.3: maybe consider adding "data labeling" and "data governance" blocks in the figure? so all subsequent sections are visually represented in figure Good point. Done! > * Reference: some refs seem in wrong format, e.g., "pineau2020improving?", "dutta2020mlops?" I fixed these in an earlier push. > * Consider adding some figures or flowcharts for sections and concepts > > * Sec.6.5.1: maybe add a batching vs streaming processing figure, e.g., [link](https://medium.com/@evertongomede/batch-vs-streaming-data-ingestion-choosing-the-right-approach-for-efficient-data-processing-8fa492299dd4) > * Sec.6.5.2: maybe add a ETL vs. ELT figure, e.g., [link](https://blog.skyvia.com/elt-vs-etl/) > * etc Thanks, I was going to make a dedicated pass on the images at a later time but these are good starting points.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/cs249r_book#1507