Support fp16 model to weight-only quantization for PyTorch framework #1387

PenghuiCheng · 2023-11-15T06:57:51Z

Type of Change

feature
No API changed

Support fp16 model to weight-only quantization for PyTorch framework.

Quantization fp16 model successfully.

local tested

Signed-off-by: Cheng, Penghui <[email protected]>

xin3he

I think It's better to move the model.float() into the for loopfor name, m in model.named_modules():

Signed-off-by: Cheng, Penghui <[email protected]>

PenghuiCheng · 2023-11-15T09:55:15Z

I think It's better to move the model.float() into the for loopfor name, m in model.named_modules():

yes, changed

Support fp16 model to weight-only quantization for PyTorch framework

ae1881b

Signed-off-by: Cheng, Penghui <[email protected]>

PenghuiCheng requested a review from xin3he November 15, 2023 06:57

Add UT

91d60c9

Signed-off-by: Cheng, Penghui <[email protected]>

PenghuiCheng requested a review from changwangss November 15, 2023 07:18

xin3he approved these changes Nov 15, 2023

View reviewed changes

xin3he reviewed Nov 15, 2023

View reviewed changes

update code

8399d64

Signed-off-by: Cheng, Penghui <[email protected]>

PenghuiCheng requested a review from kevinintel November 15, 2023 09:55

chensuyue approved these changes Nov 16, 2023

View reviewed changes

chensuyue merged commit d5cb567 into master Nov 16, 2023

chensuyue deleted the penghuic/support_fp16_for_woq branch November 16, 2023 02:51