[SYCL][Graph] Enable L0 optimizations (no profiling mode) #358

mfrancepillois · 2024-02-23T16:46:40Z

Enable in-order cmd-list
Analyze the graph and apply enable the use of in-order command-list for linear graph.
Add a property to finalize function to enable graph profiling.
Update the specification.

Analyze the graph and apply enable the use of in-order command-list for linear graph. Add a property to finalize function to disable this optimization which is not compatible with profiling. Update the specification.

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

sycl/source/detail/graph_impl.cpp

sycl/source/detail/graph_impl.hpp

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

sycl/include/sycl/ext/oneapi/experimental/graph.hpp

sycl/source/detail/graph_impl.hpp

sycl/test-e2e/Graph/event_profiling_info.cpp

Co-authored-by: Pablo Reble <[email protected]>

…is not passed to finalize(). + Update spec.

sycl/source/detail/graph_impl.hpp

julianmi · 2024-02-28T10:56:55Z

sycl/source/detail/graph_impl.hpp


  /// @return True if the partition contains a host task
  bool isHostTask() const {
    return (MRoots.size() && ((*MRoots.begin()).lock()->MCGType ==
                              sycl::detail::CG::CGTYPE::CodeplayHostTask));
  }

+  /// Checks if the graph is single path, i.e. each node has a single successor.
+  /// If so, the MIsInOrderGraph flag is set.
+  void checkIfGraphIsSinglePath() {


What is the overhead of adding this routine in comparison to the potential in-order optimization?

Its difficult to give an approximation of the overhead for this routine since it depends of the graph typology.
That said, if in-order graph is found we win on both sides: finalization delay (we do not need to create events) and execution time (we do not have to execute events nor synchronization on them).
On my setup (12th Gen Intel(R) Core(TM) i9-12900K, Intel(R) Level-Zero, Intel(R) UHD Graphics 770 1.3 [1.3.28454]), the finalization delay for an 2000 nodes in-order graph is reduced by ~40%. The execution time is reduced by ~15% and the second execution by ~20% compared to execution with event profiling capability disabled (and respectively 20% and 30% with the current implementation (i.e. event profiling enabled)).

Thanks, this sounds promising. I think we should run some microbenchmarks with and without these changes to better understand the overhead for nonlinear graphs.

On my setup, the checkIfGraphIsSinglePath function takes less than 0.01% of the total runtime of finalize for checking 2000 nodes.

Im not sure if just looking at the single function call is a fair metric, because the Schedule of the Graph is already available. That obviously is an implementation detail and won't necessarily take away the concerns about the complexity of this check. Another aspect that might be relevant: What if the check fails, it might be still beneficial to execute Schedule as-is on an in-order CommandList, or interleaving on multiple in-order CommandLists. That brings up the question if a scheduling hint that we'd pass as a property on graph finalization is a better option?

Extending running things in-order to be user controlled with a property or something definitely seems like it could be useful.

But as to the other point of discussion here, do we really need to care that much about small optimizations of finalize()? It is expensive by design and barring any outlandishly slow performance it seems to me it doesn't really matter much at all how it performs.

Regarding the idea of adding a hint/property to enable in-order command lists in more situation, it seems that probably requires some more in-depth discussion and is probably better done as a separate PR to avoid delaying this one too much.

sycl/test-e2e/Graph/event_profiling_info.cpp

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

sycl/test-e2e/Graph/event_profiling_info.cpp

sycl/source/detail/event_impl.hpp

Co-authored-by: Ewan Crawford <[email protected]>

EwanC

LGTM 🙂

sycl/source/detail/event_impl.hpp

sycl/include/sycl/ext/oneapi/experimental/graph.hpp

sycl/source/detail/graph_impl.cpp

sycl/source/detail/graph_impl.hpp

sycl/test-e2e/Graph/event_profiling_info.cpp

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

Bensuo · 2024-03-21T15:53:54Z

Upstream PR here (draft until UR changes merge): intel#13088

[SYCL][Graph] Enable in-order cmd-list

8c5dea5

Analyze the graph and apply enable the use of in-order command-list for linear graph. Add a property to finalize function to disable this optimization which is not compatible with profiling. Update the specification.

mfrancepillois added the Graph Implementation Related to DPC++ implementation and testing label Feb 23, 2024

mfrancepillois requested review from EwanC, ori-sky, reble, Bensuo and julianmi February 23, 2024 16:46

Add test for cehcking profiling when in-order command-list enabled

f671539

mfrancepillois mentioned this pull request Feb 26, 2024

[EXP][Command-Buffer] Optimize L0 command-buffer submission Bensuo/unified-runtime#9

Closed

mfrancepillois marked this pull request as ready for review February 26, 2024 17:34

EwanC reviewed Feb 27, 2024

View reviewed changes

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc Outdated Show resolved Hide resolved

EwanC reviewed Feb 27, 2024

View reviewed changes

sycl/source/detail/graph_impl.cpp Outdated Show resolved Hide resolved

sycl/source/detail/graph_impl.cpp Outdated Show resolved Hide resolved

sycl/source/detail/graph_impl.hpp Outdated Show resolved Hide resolved

sycl/source/detail/graph_impl.hpp Outdated Show resolved Hide resolved

mfrancepillois added 2 commits February 27, 2024 10:44

Change property to enable_profiling + typo

15f02c0

Propagate enableProfiling property to UR.

0527464

mfrancepillois changed the title ~~[SYCL][Graph] Enable in-order cmd-list~~ [SYCL][Graph] Enable L0 optimizations (no profiling mode) Feb 27, 2024

Update spec

b55a301

reble reviewed Feb 27, 2024

View reviewed changes

sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc Outdated Show resolved Hide resolved

EwanC reviewed Feb 28, 2024

View reviewed changes

mfrancepillois and others added 2 commits February 28, 2024 09:54

Update sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

d885aa0

Co-authored-by: Pablo Reble <[email protected]>

Add exception throwing for all backends if enable_profiling property …

b922a77

…is not passed to finalize(). + Update spec.

julianmi reviewed Feb 28, 2024

View reviewed changes

Typos

c084e19

EwanC reviewed Feb 29, 2024

View reviewed changes

mfrancepillois and others added 2 commits February 29, 2024 11:03

Update sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc

49630d8

Co-authored-by: Ewan Crawford <[email protected]>

Move test to unitest + typo

e4ee57e

EwanC approved these changes Mar 4, 2024

View reviewed changes

sycl/source/detail/event_impl.hpp Show resolved Hide resolved

Bensuo reviewed Mar 6, 2024

View reviewed changes

mfrancepillois added 2 commits March 6, 2024 14:26

Pass prop-list to executable_command_graph constructor + typos

0318696

Set enable-profling on nodes rather than graph_exec (finalize)

2127b90

mfrancepillois requested a review from EwanC March 8, 2024 11:07

mfrancepillois requested a review from Bensuo March 8, 2024 11:07

Bensuo approved these changes Mar 12, 2024

View reviewed changes

julianmi approved these changes Mar 14, 2024

View reviewed changes

Bensuo closed this Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][Graph] Enable L0 optimizations (no profiling mode) #358

[SYCL][Graph] Enable L0 optimizations (no profiling mode) #358

mfrancepillois commented Feb 23, 2024 •

edited

Loading

julianmi Feb 28, 2024

mfrancepillois Feb 28, 2024

julianmi Feb 28, 2024

mfrancepillois Feb 29, 2024

reble Mar 6, 2024

Bensuo Mar 13, 2024

Bensuo Mar 14, 2024

EwanC left a comment

Bensuo commented Mar 21, 2024

[SYCL][Graph] Enable L0 optimizations (no profiling mode) #358

[SYCL][Graph] Enable L0 optimizations (no profiling mode) #358

Conversation

mfrancepillois commented Feb 23, 2024 • edited Loading

julianmi Feb 28, 2024

Choose a reason for hiding this comment

mfrancepillois Feb 28, 2024

Choose a reason for hiding this comment

julianmi Feb 28, 2024

Choose a reason for hiding this comment

mfrancepillois Feb 29, 2024

Choose a reason for hiding this comment

reble Mar 6, 2024

Choose a reason for hiding this comment

Bensuo Mar 13, 2024

Choose a reason for hiding this comment

Bensuo Mar 14, 2024

Choose a reason for hiding this comment

EwanC left a comment

Choose a reason for hiding this comment

Bensuo commented Mar 21, 2024

mfrancepillois commented Feb 23, 2024 •

edited

Loading