[punet] Add integration tests. #84

stellaraccident · 2024-06-29T02:16:08Z

Imports/validates the FP16 model running eagerly.
Imports/validates the int8 model running eagerly.
Exports the models.

Progress on #76

* Imports/validates the FP16 model running eagerly. * Imports/validates the int8 model running eagerly. * Exports the models.

ScottTodd

Nice!

ScottTodd · 2024-06-29T02:56:11Z

sharktank/integration/models/punet/integration_test.py

+ REPO_ID = "amd-shark/sharktank-goldens"
+ REVISION = "230dad4d85fbcb8759a331dcf1d45f0562875abe"


Ooh using huggingface (instead of Azure, GCS, etc.)? Nice!

May want to link https://huggingface.co/amd-shark/sharktank-goldens somewhere for easy referencing

ScottTodd · 2024-06-29T03:02:14Z

sharktank/sharktank/utils/testing.py

+def get_best_torch_device() -> str:
+ import torch
+
+ if torch.cuda.is_available() and torch.cuda.device_count() > 0:
+ return "cuda:0"
+ return "cpu"


May want a way to set this device index via environment variable or flag.

On some of our CI runners we try to only use a specific GPU (like 6), since we and possibly other groups are running multiple jobs on the same "machine" / node.

ScottTodd · 2024-06-29T03:05:03Z

sharktank/integration/models/punet/integration_test.py

Until this is running on CI, it would be useful to see the logs produced (e.g. in a gist).

Things I'd look for:

Time taken for each step / test case

Format of logs on success

Format of logs on failure

How easy the flow is to understand and reproduce outside of pytest

What artifacts are downloaded

What artifacts are produced

ScottTodd · 2024-06-29T03:07:51Z

sharktank/sharktank/utils/testing.py

+ return "cpu"
+
+
+def assert_golden_safetensors(actual_path, ref_path):


This function is nice, giving a good reason to stay in Python instead of using bash or just the native tools like iree-run-module.

(I'm still slowly coming to terms with moving infrastructure from C/C++ to python :P)

[punet] Add integration tests.

471473b

* Imports/validates the FP16 model running eagerly. * Imports/validates the int8 model running eagerly. * Exports the models.

stellaraccident force-pushed the punet_integration_test branch from 4ec1e65 to 471473b Compare June 29, 2024 02:16

stellaraccident merged commit dbc50eb into main Jun 29, 2024
3 checks passed

stellaraccident deleted the punet_integration_test branch June 29, 2024 02:21

ScottTodd reviewed Jun 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[punet] Add integration tests. #84

[punet] Add integration tests. #84

stellaraccident commented Jun 29, 2024

ScottTodd left a comment

ScottTodd Jun 29, 2024

ScottTodd Jun 29, 2024

ScottTodd Jun 29, 2024

ScottTodd Jun 29, 2024

		REPO_ID = "amd-shark/sharktank-goldens"
		REVISION = "230dad4d85fbcb8759a331dcf1d45f0562875abe"

		return "cpu"


		def assert_golden_safetensors(actual_path, ref_path):

[punet] Add integration tests. #84

[punet] Add integration tests. #84

Conversation

stellaraccident commented Jun 29, 2024

ScottTodd left a comment

Choose a reason for hiding this comment

ScottTodd Jun 29, 2024

Choose a reason for hiding this comment

ScottTodd Jun 29, 2024

Choose a reason for hiding this comment

ScottTodd Jun 29, 2024

Choose a reason for hiding this comment

ScottTodd Jun 29, 2024

Choose a reason for hiding this comment