[punet] CI for quantization import/compilation/golden check #76

stellaraccident · 2024-06-26T23:09:50Z

No description provided.

ScottTodd · 2024-06-26T23:31:33Z

Could you take a look at #70 and nod-ai/SHARK-TestSuite#272 ? Those are for llama, and it would be nice to have similar testing for punet and llama. For the first, I'm not particularly attached to the bash script but just wanted something running in this repo somehow. The second distills the program down to a common test format we can use from IREE on presubmit.

stellaraccident · 2024-06-27T00:13:51Z

Oh thanks, will have a look.

* Imports/validates the FP16 model running eagerly. * Imports/validates the int8 model running eagerly. * Exports the models. Progress on #76

Sample run (with failures): https://github.com/nod-ai/sharktank/actions/runs/9752141473/job/26915109873 Definitely room for improvement, but it's a start. Progress on #76

Related to nod-ai/SHARK-Platform#76 * Model source: https://github.com/nod-ai/sharktank/tree/main/sharktank/sharktank/models/punet * Python test: https://github.com/nod-ai/sharktank/blob/main/sharktank/integration/models/punet/integration_test.py * Python run script: https://github.com/nod-ai/sharktank/blob/main/sharktank/sharktank/models/punet/tools/run_punet.py * Weights are labeled in the sharktank sources. The primarily come from https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0 and https://huggingface.co/amd-shark/sdxl-quant-models Only testing compilation and basic execution for now. Result comparisons will follow once they are enabled upstream. TBD if the quantized outputs will be different enough across backends to need extra infrastructure configuration there.

stellaraccident self-assigned this Jun 26, 2024

stellaraccident added the sdxl-int8 label Jun 26, 2024

stellaraccident mentioned this issue Jun 29, 2024

[punet] Add integration tests. #84

Merged

stellaraccident added a commit that referenced this issue Jun 29, 2024

[punet] Add integration tests. (#84)

dbc50eb

* Imports/validates the FP16 model running eagerly. * Imports/validates the int8 model running eagerly. * Exports the models. Progress on #76

ScottTodd mentioned this issue Jul 1, 2024

Add CI job for punet tests, running nightly. #90

Merged

ScottTodd added a commit that referenced this issue Jul 1, 2024

Add CI job for punet tests, running nightly. (#90)

697f8a6

Sample run (with failures): https://github.com/nod-ai/sharktank/actions/runs/9752141473/job/26915109873 Definitely room for improvement, but it's a start. Progress on #76

ScottTodd mentioned this issue Jul 2, 2024

Import punet tests from sharktank to iree_tests. nod-ai/SHARK-TestSuite#277

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[punet] CI for quantization import/compilation/golden check #76

[punet] CI for quantization import/compilation/golden check #76

stellaraccident commented Jun 26, 2024

ScottTodd commented Jun 26, 2024

stellaraccident commented Jun 27, 2024

[punet] CI for quantization import/compilation/golden check #76

[punet] CI for quantization import/compilation/golden check #76

Comments

stellaraccident commented Jun 26, 2024

ScottTodd commented Jun 26, 2024

stellaraccident commented Jun 27, 2024