Question about the model parameters after quantization #9

liwenwei123 · 2020-02-09T16:44:05Z

Dear Yury,
I have run the code successfully as the setps in readme.Thanks for your work! And I have some questions about the model after quantization.
I try to print the parameters after quantization, I chose 'qtype=int4' and qweigh=int8, but the parameters seem to be float but not int? such as :
'conv1.weight', Parameter containing:
tensor([[[[-2.4899e-03, -1.2449e-03, 0.0000e+00, ..., 1.3694e-02,
3.7348e-03, -2.4899e-03],
[ 2.4899e-03, 2.4899e-03, -2.6144e-02, ..., -6.3492e-02,
-2.9879e-02, 1.2449e-03],
[-1.2449e-03, 1.3694e-02, 6.8472e-02, ..., 1.2076e-01,
5.9758e-02, 1.4939e-02],.....
I try to save the model by 'torch.save(self.model.stat_dict(),'resnet18_qm.pkl)', but the size of the model is as same as the original resnet18 pretrained model file. I thought it would be much smaller after quantization.
Is there any steps I missed or haven't understand the meaning if code correctly?
Thanks again and Looking forward to your reply!

Anna

xieydd · 2020-02-20T13:46:41Z

Why the weight is not int8? @liwenwei123

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the model parameters after quantization #9

Question about the model parameters after quantization #9

liwenwei123 commented Feb 9, 2020

xieydd commented Feb 20, 2020

Question about the model parameters after quantization #9

Question about the model parameters after quantization #9

Comments

liwenwei123 commented Feb 9, 2020

xieydd commented Feb 20, 2020