Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tests: benchdnn: add experimental support for sycl graph #2295

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

spalicki
Copy link
Contributor

@spalicki spalicki commented Dec 19, 2024

Adds ability to run benchdnn tests using graph execution by adding --execution-mode=[direct(default)|graph] option. It is currently limited to SyCL GPU engine executing on L0 backend.

@spalicki spalicki self-assigned this Dec 19, 2024
@spalicki spalicki requested a review from a team as a code owner December 19, 2024 18:04
@github-actions github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Dec 19, 2024
@spalicki spalicki requested review from a team December 19, 2024 20:21
tests/benchdnn/common.hpp Outdated Show resolved Hide resolved
@spalicki spalicki force-pushed the spalicki/sycl_graph_test branch from a7c4af4 to 541256b Compare December 19, 2024 21:07
@spalicki spalicki requested a review from a team as a code owner December 19, 2024 21:07
@github-actions github-actions bot added the documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc label Dec 19, 2024
cmake/testing.cmake Outdated Show resolved Hide resolved
Copy link
Contributor

@ranukund ranukund left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few edits suggested, please incorporate as you see fit!

tests/benchdnn/doc/knobs_common.md Outdated Show resolved Hide resolved
tests/benchdnn/doc/knobs_common.md Outdated Show resolved Hide resolved
dnnl_status_t status = dnnl_runtime_error;
bool run_regular_exec = true;
#if DNNL_GPU_RUNTIME == DNNL_RUNTIME_DPCPP
while (use_sycl_graph && is_gpu(engine)) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is while with a breaks preferred here instead of just ifs or a lambda?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the functionality can't be supported, the fallback to a regular execution must happen. Without while it's gonna be a more tangled connection between two or three ifs. Replacing this code with lambda would be identical to this while as we need objects from checking part and executing part, can't break it down into two without passing those objects around which doesn't help with readability. So to day, it's just a style preference.

DNN_SAFE(dnnl_stream_wait(stream), CRIT);

auto exec = graph.finalize();
queue.ext_oneapi_graph(exec).wait();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you give a bit more context on the reasons behind this PR in general? I see it is adding a way to test the SYCL Graph extension with benchdnn. Is the intention to test for correctness and/or measure the impact on performance? Wouldn't it be useful to also be able to re-use the graph n times?
Also the PR says it is an experimental support. Is there a plan for the future of SYCL-Graph in benchdnn?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The main goal is enabling functional validation of oneDNN primitives in SYCL Graph record/replay mode. While use of SYCL Graph does not require changes in oneDNN API it puts some limitations on what the implementation can do internally.

'Experimental' refers to support status of SYCL Graph extension support in the compiler.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, thanks. Would it be safer to re-use the graph a number of times to make sure that the limitations you mentioned are respected? Is this planned for a separate PR maybe?
I think it looks odd to create a graph without re-using it.

@petercad
Copy link
Contributor

General suggestions: for use in CI, it would be nice to be able to apply --use-sycl-graph on a per-layer basis. I'd also suggest a more generic name (perhaps something like --gpu-execution-mode=graph), because other GPU runtimes also support similar functionality.

@dzarukin
Copy link
Contributor

General suggestions: for use in CI, it would be nice to be able to apply --use-sycl-graph on a per-layer basis. I'd also suggest a more generic name (perhaps something like --gpu-execution-mode=graph), because other GPU runtimes also support similar functionality.

By "per layer" do you mean to run like this:

--matmul --gpu-execution-mode=,graph 32x32:32x64

and two problems will be executed, one with sycl-graph, another - not?

@spalicki spalicki force-pushed the spalicki/sycl_graph_test branch from 541256b to 4f14837 Compare December 20, 2024 18:47
@spalicki
Copy link
Contributor Author

General suggestions: for use in CI, it would be nice to be able to apply --use-sycl-graph on a per-layer basis. I'd also suggest a more generic name (perhaps something like --gpu-execution-mode=graph), because other GPU runtimes also support similar functionality.

The function is limited to GPU today due to L0 dependency, but in the end, it should be extended to CPU as well. Maybe then --execution-mode=[direct(default)|graph]?

@petercad
Copy link
Contributor

By "per layer" do you mean to run like this:
--matmul --gpu-execution-mode=,graph 32x32:32x64
and two problems will be executed, one with sycl-graph, another - not?

@dzarukin My main thought was that it would be nice to add a few targeted matmul test cases for SYCL graph (since there are some special SYCL graph code paths), but we don't need to test the whole CI test suite with SYCL graph. By making a per-layer knob we can add some graph test cases to the regular CI test suites instead of creating new test suites for SYCL graph.

The function is limited to GPU today due to L0 dependency, but in the end, it should be extended to CPU as well. Maybe then --execution-mode=[direct(default)|graph]?

That sounds good to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:build component:tests Codeowner: @oneapi-src/onednn-arch documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants