mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-05-07 02:03:55 -05:00
Discuss latest NVIDIA results on quantization schemes (NVFP4) #430
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @profvjreddi on GitHub (Aug 26, 2025).
A new NVIDIA blog post introduces NVFP4, which enables training with 16-bit precision and the speed/efficiency of 4-bit quantization: https://developer.nvidia.com/blog/nvfp4-trains-with-precision-of-16-bit-and-speed-and-efficiency-of-4-bit
Should we discuss or incorporate any new insights from these results in our work?