Skip to content

Conversation

@armandsauzay
Copy link

Summary:
X-link: meta-pytorch/torchrec#3602

X-link: https://github.com/facebookresearch/FBGEMM/pull/2202

Add output_dtype parameter to MX4 dequantization stack to support direct
conversion to BF16/FP16, avoiding expensive FP32 intermediate step.

Differential Revision: D87826479

Summary:
X-link: meta-pytorch/torchrec#3602

X-link: facebookresearch/FBGEMM#2202

Add output_dtype parameter to MX4 dequantization stack to support direct
conversion to BF16/FP16, avoiding expensive FP32 intermediate step.

Differential Revision: D87826479
@meta-cla meta-cla bot added the cla signed label Dec 9, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Dec 9, 2025

@armandsauzay has exported this pull request. If you are a Meta employee, you can view the originating Diff in D87826479.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant