[How-To] Quantize my own model in Tensorflow using this approach? #4

mahimairaja · 2023-07-31T14:46:12Z

No description provided.

jundaf2 · 2023-07-31T14:58:50Z

This dynamic quantization simply leverage the fact that
(1) the global max of the matrix P is not necessarily the max value of each line of matrix P
(2)you know the max and min value in the softmax computation of a line on the fly -- it's the inherit property of softmax, i.e. the numerators in every line is between [0,1]
, so you can leverage this fact without passing addtional global quantization information of matrix $P$.

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[How-To] Quantize my own model in Tensorflow using this approach? #4

[How-To] Quantize my own model in Tensorflow using this approach? #4

mahimairaja commented Jul 31, 2023

jundaf2 commented Jul 31, 2023 •

edited

Loading

[How-To] Quantize my own model in Tensorflow using this approach? #4

[How-To] Quantize my own model in Tensorflow using this approach? #4

Comments

mahimairaja commented Jul 31, 2023

jundaf2 commented Jul 31, 2023 • edited Loading

jundaf2 commented Jul 31, 2023 •

edited

Loading