You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable specifying output dtype for fp8 quantized communication (#5154)
Summary:
X-link: meta-pytorch/torchrec#3568
Pull Request resolved: #5154
X-link: https://github.com/facebookresearch/FBGEMM/pull/2154
Adding fp8_output_dtype parameter to the qcomms config allowing fp8 to dequantize in different float formats as opposed to only FP32
Reviewed By: spcyppt
Differential Revision: D86890315
fbshipit-source-id: 1cbfdabd63ad4dc0a1c3d47990aa591a567fc9d0
0 commit comments