Skip quantization when channels_out / channels_in are not multiple of 16 #3309

jerryzh168 · 2025-11-07T19:16:41Z

Summary:
The underlying fbgemm conv3d kernel for float8 only supports channels_out/channels_in are both multiples of 16 so we skip the shapes that doesn't satisfy the requirements for now, we can expand the support to do padding if needed in the future

Test Plan:
(in B200 machine) python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_skip_quant

pytorch-bot · 2025-11-07T19:16:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3309

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 3e52ad2 with merge base 6259e98 ():

NEW FAILURE - The following job has failed:

Run 1xH100 Tests / test (H100, linux.aws.h100, --pre torch torchvision torchaudio fbgemm-gpu-genai --index-url https... / linux-job (gh)
RuntimeError: Command docker exec -t 87087a22fc750d8e2a92a846aa9595724e3750ba45c12e7f2610bcdcad7c3c74 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: The underlying fbgemm conv3d kernel for float8 only supports channels_out/channels_in are both multiples of 16 so we skip the shapes that doesn't satisfy the requirements for now, we can expand the support to do padding if needed in the future Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_skip_quant

jerryzh168 · 2025-11-07T22:12:47Z

CI error unrelated

jerryzh168 · 2025-11-07T23:59:57Z

test/quantization/quantize_/workflows/float8/test_float8_tensor.py

        compile: bool,
-        granularity,
        inference_mode: bool,
        kernel_preference: KernelPreference,


sorry just noticed that this is not removed, the test is skipped in CI so should not be ran for now, will remove in next PR

… 16 (#3309) Summary: The underlying fbgemm conv3d kernel for float8 only supports channels_out/channels_in are both multiples of 16 so we skip the shapes that doesn't satisfy the requirements for now, we can expand the support to do padding if needed in the future Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_skip_quant

… 16 (pytorch#3309) Summary: The underlying fbgemm conv3d kernel for float8 only supports channels_out/channels_in are both multiples of 16 so we skip the shapes that doesn't satisfy the requirements for now, we can expand the support to do padding if needed in the future Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_skip_quant

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 7, 2025

jerryzh168 force-pushed the skip-quant-for-unsupported-shapes branch from fb4b5d4 to bceadd1 Compare November 7, 2025 19:19

jerryzh168 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Nov 7, 2025

jerryzh168 force-pushed the skip-quant-for-unsupported-shapes branch from bceadd1 to 3e52ad2 Compare November 7, 2025 19:27

andrewor14 approved these changes Nov 7, 2025

View reviewed changes

jerryzh168 merged commit 86af458 into pytorch:main Nov 7, 2025
17 of 19 checks passed

jerryzh168 commented Nov 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Skip quantization when channels_out / channels_in are not multiple of 16 #3309

Skip quantization when channels_out / channels_in are not multiple of 16 #3309

Uh oh!

jerryzh168 commented Nov 7, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 7, 2025 •

edited

Loading

Uh oh!

jerryzh168 commented Nov 7, 2025

Uh oh!

Uh oh!

jerryzh168 Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Skip quantization when channels_out / channels_in are not multiple of 16 #3309

Skip quantization when channels_out / channels_in are not multiple of 16 #3309

Uh oh!

Conversation

jerryzh168 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3309

❌ 1 New Failure

Uh oh!

jerryzh168 commented Nov 7, 2025

Uh oh!

Uh oh!

jerryzh168 Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jerryzh168 commented Nov 7, 2025 •

edited

Loading

pytorch-bot bot commented Nov 7, 2025 •

edited

Loading