Add JPQD evaluation notebook #231

helena-intel · 2023-03-12T21:20:14Z

Add JPQD evaluation notebook. Since JPQD QA takes about 12 hours to train, it doesn't make sense to do it in a notebook (if the browser crashes or the computer goes to sleep, training would stop). So I just refer to the example and use the notebook to evaluate the model.

This makes the notebook similar to the PTQ QA notebook. I thought about removing duplication but I think duplication in examples is not so bad, at least for now. It's nice that examples are standalone.

Since JPQD starts from a plain bert-base-uncased model I finetuned a bert-base-uncased model following the transformers run_qa.py example to compare performance.

Instead of making this a JPQD specific notebook, it could make more sense to make it a generic QA INT8 evaluation notebook, but on the other hand, it's an example, people can surely change it for similar purposes, and it's nice to promote JPQD.

TODO: the intro text at the top needs to explain a bit more about JPQD.

Colab link: https://colab.research.google.com/github/helena-intel/optimum-intel/blob/jpqd-notebook/notebooks/openvino/question_answering_quantization_jpqd.ipynb (performance is probably bad on Colab because there is no AVX512/VNNI).

@vuiseng9

HuggingFaceDocBuilderDev · 2023-03-12T21:26:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

AlexKoff88 · 2023-03-21T07:31:45Z

I think that it could be more useful if we can show the performance and accuracy trade-offs for three models:

Original Transformer model (fp32)
Quantized model (PTQ/QAT)
Pruned and quantized (JPQD, distillation is an auxiliary method here)

ljaljushkin · 2023-04-14T09:57:00Z

@yujiepan-work and @vuiseng9 implemented very nice lightweight tests for JPQD training.
9 epochs take just a few seconds on a single card. I'd reuse them for this notebook.
https://github.com/openvinotoolkit/nncf/blob/develop/tests/torch/sparsity/movement/test_training.py#L237

if we need a very good accuracy/performance results, there are longer tests to consider:
https://github.com/openvinotoolkit/nncf/blob/develop/tests/torch/sparsity/movement/test_training.py#L318
If I am not mistaken, it takes minutes. Probably, @yujiepan-work could say the exact time.

helena-intel requested review from AlexKoff88 and echarlaix March 12, 2023 21:20

helena-intel force-pushed the jpqd-notebook branch 2 times, most recently from 8f7b129 to 82dc6a6 Compare March 14, 2023 23:18

Add JPQD evaluation notebook

c07bef3

helena-intel force-pushed the jpqd-notebook branch from 82dc6a6 to c07bef3 Compare March 15, 2023 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add JPQD evaluation notebook #231

Add JPQD evaluation notebook #231

helena-intel commented Mar 12, 2023

HuggingFaceDocBuilderDev commented Mar 12, 2023

AlexKoff88 commented Mar 21, 2023

ljaljushkin commented Apr 14, 2023

Add JPQD evaluation notebook #231

Are you sure you want to change the base?

Add JPQD evaluation notebook #231

Conversation

helena-intel commented Mar 12, 2023

HuggingFaceDocBuilderDev commented Mar 12, 2023

AlexKoff88 commented Mar 21, 2023

ljaljushkin commented Apr 14, 2023