Introduce repeat and RepeatOp #3687

naoyam · 2025-01-09T10:24:24Z

Adds repeat as an alias op as well as the RepeatOp IR node. The repeat op has almost the same semantics as the PyTorch repeat.

The main motivation is to fix #3682, which is due to #3645, which introduced a preseg pass that detects and translates a repeat pattern to broadcast, expand and reshape. The issue of #3682 is because that the translation-based method does not work when a broadcast ID is repeated. I originally just used TensorDomain::flatten (https://github.com/NVIDIA/Fuser/blob/main/csrc/ir/nodes.cpp#L3674-L3740), which just merges broadcast IDs. However, for reshape, it should not merge but squeeze them. Merging broadcast IDs triggered an assertion of the transpose scheduler as seen in #3682.

TensorDomain::flatten needs to be fixed (#3691), but that's a separate issue. For fixing #3682, since repeating broadcast IDs cannot be translated to the broadcast-expand-reshape pattern anyway, I added the new RepeatOp node. I initially thought it could be just LoadStoreOp but decided to have a different IR node since, unlike usual LoadStore case, some of the broadcast IDs of a producer becomes concrete IDs in the corresponding consumer logical domain. I did actually try using LoadStoreOp but some of the preseg passes complained the mismatched broadcast pattern.

Repeating non-broadcast IDs is still done by the broadcast-expand-reshape patten. Only for repeating broadcast IDs gets represented using the RepeatOp node.

Fixes #3682

Almost same semantics PyTorch repeat. Previously only partially introduced as a translation from a repeat pattern using concat. which had a bug when repeating broadcast IDs. This PR fixes the issue by handling broadcast separately using a new IR node, RepeatOp, which represents repetition of broadcast IDs.

naoyam · 2025-01-09T10:24:32Z

!test

csrc/ops/alias.cpp

wujingyue · 2025-01-09T23:04:02Z

csrc/ir/nodes.cpp

+}
+
+std::string RepeatOp::toInlineString(int indent_size) const {
+  NVF_CHECK(false, "Tensor op can not be printed inline");


Question for my understanding: is it correct that only IterDomain ops can be printed inline according to our convention?

Looks like that's the case, but I just copied this from ExpandOp::toInlineString.

csrc/ir/nodes.cpp

wujingyue · 2025-01-09T23:06:00Z

csrc/ir/nodes.cpp

+  sizes.reserve(out()->getLogicalDomain().size());
+  const auto c2p =
+      PairwiseLogicalDomainMap(in(), out()).mapConsumerToProducer();
+  for (const auto i : c10::irange(out()->getLogicalDomain().size())) {


Skip reduction?

Since this is the output, it should not have a reduction ID. I added an assertion in the constructor.

csrc/ir/nodes.cpp

wujingyue · 2025-01-09T23:10:20Z

csrc/logical_domain_map.h

@@ -504,6 +504,10 @@ class ComputeAtLogicalDomainMapBuilder : private BackwardVisitor {
    mapPointwiseLikeOp(op);
  }

+  void handle(RepeatOp* op) override {
+    mapPointwiseLikeOp(op);


Is it supposed to be mapped as pointwise? The input and the output don't even have the same extent.

tests/cpp/test_gpu3.cpp

csrc/ops/alias.cpp

csrc/preseg_passes/translate_repeat_to_expand.cpp

tests/cpp/test_preseg_passes.cpp

naoyam · 2025-01-10T00:05:33Z

!build

naoyam · 2025-01-10T00:57:18Z

!build

commens

4f57eeb

naoyam marked this pull request as ready for review January 9, 2025 20:00

naoyam requested a review from wujingyue January 9, 2025 20:01

wujingyue reviewed Jan 9, 2025

View reviewed changes

wujingyue approved these changes Jan 9, 2025

View reviewed changes

PR feedback

3e6f480

Merge branch 'main' into repeat

3a5f4ed

naoyam merged commit 78db5e1 into main Jan 10, 2025
15 of 16 checks passed

naoyam deleted the repeat branch January 10, 2025 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce repeat and RepeatOp #3687

Introduce repeat and RepeatOp #3687

naoyam commented Jan 9, 2025 •

edited

Loading

naoyam commented Jan 9, 2025

wujingyue Jan 9, 2025

naoyam Jan 9, 2025

wujingyue Jan 9, 2025

naoyam Jan 9, 2025

wujingyue Jan 9, 2025 •

edited

Loading

naoyam commented Jan 10, 2025

naoyam commented Jan 10, 2025

Introduce repeat and RepeatOp #3687

Introduce repeat and RepeatOp #3687

Conversation

naoyam commented Jan 9, 2025 • edited Loading

naoyam commented Jan 9, 2025

wujingyue Jan 9, 2025

Choose a reason for hiding this comment

naoyam Jan 9, 2025

Choose a reason for hiding this comment

wujingyue Jan 9, 2025

Choose a reason for hiding this comment

naoyam Jan 9, 2025

Choose a reason for hiding this comment

wujingyue Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

naoyam commented Jan 10, 2025

naoyam commented Jan 10, 2025

naoyam commented Jan 9, 2025 •

edited

Loading

wujingyue Jan 9, 2025 •

edited

Loading