Will module output not be quantized when the model is directly trained after Calibration? #336

tusiqi1 · 2024-10-11T09:35:10Z

No description provided.

tusiqi1 · 2024-10-12T03:24:24Z

model structure and train data is:
clibration code is:
train quant model with initial input_scale and output_scale code is:

I found that when I executed step 3, the output quant would not execute, and the reason is when model first passes

with Clibration():
    model(input)

the hook for callibrate_output for the entire model executes the code in the red box:

Why should the code in the red box disable output quant?
Or my use of calibration + QAT use errors.

dacorvo · 2024-10-14T07:24:05Z

As you can see in the comment on the line you highlighted, the quantization of the outputs is disabled because the operation immediately following is not compatible with quantized inputs. This means that when the Tensor reaches that operation, it will be immediately dequantized: the streamline optimization policy removes the spurious quantize/dequantize.

If you want to disable this behaviour, just pass streamline=False during calibration.

tusiqi1 · 2024-10-15T11:57:40Z

As you can see in the comment on the line you highlighted, the quantization of the outputs is disabled because the operation immediately following is not compatible with quantized inputs. This means that when the Tensor reaches that operation, it will be immediately dequantized: the streamline optimization policy removes the spurious quantize/dequantize.

If you want to disable this behaviour, just pass streamline=False during calibration.

When I press the break point in the image above, the entire model is passed to the calibrate_output() function, which causes the quantization of the output to be turned off.

Is this right?

dacorvo · 2024-10-15T12:01:44Z

Just use with Calibration(streamline=False): to disable this behaviour.

tusiqi1 · 2024-10-15T12:13:13Z

Just use with Calibration(streamline=False): to disable this behaviour.

Thank you very much, I'll think more about the purpose of streamline myself.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will module output not be quantized when the model is directly trained after Calibration? #336

Will module output not be quantized when the model is directly trained after Calibration? #336

tusiqi1 commented Oct 11, 2024

tusiqi1 commented Oct 12, 2024 •

edited

Loading

dacorvo commented Oct 14, 2024

tusiqi1 commented Oct 15, 2024

dacorvo commented Oct 15, 2024 •

edited

Loading

tusiqi1 commented Oct 15, 2024

Will module output not be quantized when the model is directly trained after Calibration? #336

Will module output not be quantized when the model is directly trained after Calibration? #336

Comments

tusiqi1 commented Oct 11, 2024

tusiqi1 commented Oct 12, 2024 • edited Loading

dacorvo commented Oct 14, 2024

tusiqi1 commented Oct 15, 2024

dacorvo commented Oct 15, 2024 • edited Loading

tusiqi1 commented Oct 15, 2024

tusiqi1 commented Oct 12, 2024 •

edited

Loading

dacorvo commented Oct 15, 2024 •

edited

Loading