Added hybrid quantization for seq2seq models #427

AlexKoff88 · 2023-09-08T14:47:53Z

This is a draft of the PR that should be merged after OpenVINO 2023.1 release

HuggingFaceDocBuilderDev · 2023-09-08T14:54:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

echarlaix · 2023-12-15T11:38:29Z

optimum/intel/openvino/quantization.py

+ # Compress weights of decoders for safity
+ self.model.decoder_model = nncf.compress_weights(self.model.decoder_model)
+ if self.model.use_cache:
+ self.model.decoder_with_past_model = nncf.compress_weights(self.model.decoder_with_past_model)


Would there be a possibility to also quantize the activations for the decoder components?

It is possible but it may hurt accuracy.

Added hybrid quantization for seq2seq models

f87f93f

AlexKoff88 added 2 commits November 9, 2023 14:30

Merge remote-tracking branch 'origin/main' into ak/seq2seq_opt

a635bc5

Added a test for Seq2Seq quantization

c5b21b4

AlexKoff88 requested a review from helena-intel November 9, 2023 12:20

AlexKoff88 changed the title ~~[Draft]: Added hybrid quantization for seq2seq models~~ Added hybrid quantization for seq2seq models Nov 9, 2023

AlexKoff88 added 6 commits November 9, 2023 18:00

Fixed a couple of bugs

7815333

Fixed test issue

0cb7f19

Fixed issue

b8d98f4

Fixed test

35a30a0

Fixed test

6df8329

Style

04f8344

echarlaix reviewed Dec 15, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added hybrid quantization for seq2seq models #427

Added hybrid quantization for seq2seq models #427

AlexKoff88 commented Sep 8, 2023

HuggingFaceDocBuilderDev commented Sep 8, 2023

echarlaix Dec 15, 2023

AlexKoff88 Jan 8, 2024

Added hybrid quantization for seq2seq models #427

Are you sure you want to change the base?

Added hybrid quantization for seq2seq models #427

Conversation

AlexKoff88 commented Sep 8, 2023

HuggingFaceDocBuilderDev commented Sep 8, 2023

echarlaix Dec 15, 2023

Choose a reason for hiding this comment

AlexKoff88 Jan 8, 2024

Choose a reason for hiding this comment