Syntactic `impl` declaration matching updates #4762

josh11b · 2025-01-06T23:50:32Z

Implement ignoring the difference between Self as and as, as well as where clauses at the end of an impl declaration when checking whether impl declarations match, from Matching redeclarations #3763.
Allow impl declarations with different constraint ids to match, as long as the facet type of the constraint has the same interface_id and specific_id.
Add some TODOs reflecting future facet type resolution.

…ax` instead

zygoloid · 2025-01-06T23:59:13Z

toolchain/check/handle_impl.cpp

+      --last_param_iter;
+      CARBON_CHECK(first_param_iter < last_param_iter);
+    } while (where_operands_to_skip > 0);
+  }


Would it be better to defer doing this until we build the DeclParams objects in MergeImplRedecl? That'd mean we don't need to do the scan here until we're about to compare two token sequences anyway, which would avoid doing this scan entirely in cases where the impl isn't in the same bucket as any other impl (which is probably a common case), and would likely increase locality given that we'd be doing two checks of nearby tokens at around the same time.

@jonmeow What do you think?

To try to offer a few upsides to the current approach:

When we're doing it here the nodes will have recently been loaded [for checking]. So from a cache locality perspective, these are the nodes we just checked and may still be in cache. If deferred, it may result in overlapping impls being reevaluated (I don't know how expensive that'd be).

This uses a reverse iteration instead of a forward iteration, whereas MergeImplRedecl uses a forward iteration, so may not cache well if the range turns out "large".

By doing it here, does that make it easier to save the results so that the scan won't be repeated?

But maybe I'd bring up a more radical option: in WhereExpr handling, if there's an impl on the stack and it hasn't reached a where clause, had you considered trying to store the where location? Maybe that could be annotated as part of decl_introducer_state_stack. Could that give the result you want without the scan of nodes?

I agree that we could avoid the scan by doing saving something in the WhereExpr handling, but it seems really fragile. There are a lot of places where a where clause can occur in an impl declaration that we don't want to truncate at.

impl forall [T:! I where...] ... impl DynPtr(Container where .ElementType = i32) as ... impl ... as J(K where ...) impl ... as (... where ...) where ...

I think the current code is easier to get right since it directly looks in the one place we want to operate on.

In each of the cases you're showing, it seems like there'd be something on the NodeStack indicating the nesting:

[ at [T:!

( at (Container

( at (K

( at (... where

Maybe it's worth thinking about further?

Put differently, is there a characteristic here where 2 nodes before the where must be something like forall or as?

jonmeow · 2025-01-07T00:28:52Z

toolchain/check/handle_impl.cpp

+      --last_param_iter;
+      CARBON_CHECK(first_param_iter < last_param_iter);
+    } while (where_operands_to_skip > 0);
+  }


To try to offer a few upsides to the current approach:

When we're doing it here the nodes will have recently been loaded [for checking]. So from a cache locality perspective, these are the nodes we just checked and may still be in cache. If deferred, it may result in overlapping impls being reevaluated (I don't know how expensive that'd be).

This uses a reverse iteration instead of a forward iteration, whereas MergeImplRedecl uses a forward iteration, so may not cache well if the range turns out "large".

By doing it here, does that make it easier to save the results so that the scan won't be repeated?

But maybe I'd bring up a more radical option: in WhereExpr handling, if there's an impl on the stack and it hasn't reached a where clause, had you considered trying to store the where location? Maybe that could be annotated as part of decl_introducer_state_stack. Could that give the result you want without the scan of nodes?

jonmeow · 2025-01-07T00:38:06Z

toolchain/check/handle_impl.cpp

@@ -210,23 +210,41 @@ static auto PopImplIntroducerAndParamsAsNameComponent(
                              }));
  }

-  Parse::NodeId first_param_node_id =
-      context.node_stack().PopForSoloNodeId<Parse::NodeKind::ImplIntroducer>();
+  Parse::Tree::PostorderIterator first_param_iter(


It looks like you wrap this just to write the CHECKs. Had you considered instead only doing the iterator construction inside the CHECK?

I'm particularly wondering about this because whereas last_param_iter is mutated, this isn't, so having it be an iterator and stored in first_param_node_id took me time to understand the code.

I've made the change so you can see how it looks. I'm neutral -- this new version is definitely longer/noiser, but if that makes it easier to understand that is an improvement.

Another option I was kind of wondering about is if we should just make NodeId inherit from IndexBase, allowing you to do the comparison directly (converting the iterator to a nodeid with just *). Thoughts?

jonmeow · 2025-01-07T00:42:31Z

toolchain/check/handle_impl.cpp

+    --last_param_iter;
+    CARBON_CHECK(first_param_iter < last_param_iter);
+    do {
+      node_kind = context.parse_tree().node_kind(*last_param_iter);
+      if (node_kind == Parse::NodeKind::WhereExpr) {
+        // If we have a nested `where`, we need to see another `WhereOperand`
+        // before we find the one that matches our original `WhereExpr` node.
+        ++where_operands_to_skip;
+      } else if (node_kind == Parse::NodeKind::WhereOperand) {
+        --where_operands_to_skip;
+      }
+      --last_param_iter;
+      CARBON_CHECK(first_param_iter < last_param_iter);
+    } while (where_operands_to_skip > 0);


What do you think about using a while to avoid the copied CHECK? You could also move in --last_param_iter; like so:

Suggested change

--last_param_iter;

CARBON_CHECK(first_param_iter < last_param_iter);

do {

node_kind = context.parse_tree().node_kind(*last_param_iter);

if (node_kind == Parse::NodeKind::WhereExpr) {

// If we have a nested `where`, we need to see another `WhereOperand`

// before we find the one that matches our original `WhereExpr` node.

++where_operands_to_skip;

} else if (node_kind == Parse::NodeKind::WhereOperand) {

--where_operands_to_skip;

}

--last_param_iter;

CARBON_CHECK(first_param_iter < last_param_iter);

} while (where_operands_to_skip > 0);

while (where_operands_to_skip > 0) {

--last_param_iter;

CARBON_CHECK(first_param_iter < last_param_iter);

node_kind = context.parse_tree().node_kind(*last_param_iter);

if (node_kind == Parse::NodeKind::WhereExpr) {

// If we have a nested `where`, we need to see another `WhereOperand`

// before we find the one that matches our original `WhereExpr` node.

++where_operands_to_skip;

} else if (node_kind == Parse::NodeKind::WhereOperand) {

--where_operands_to_skip;

}

}

The problem is that code isn't quite the same since it doesn't skip the final WhereOperand node. Which we want so that Foo where... compares equal to Foo without a where. The code that doesn't repeat the decrement and CHECK does:

while (true) { --last_param_iter; CARBON_CHECK(first_param_iter < last_param_iter); if (where_operands_to_skip == 0) { break; } node_kind = context.parse_tree().node_kind(*last_param_iter); // ...

which isn't shorter, but maybe the lack of repetition makes the code more robust? I'm not sure. Unfortunately C++ doesn't have a convenient control flow construct for loop-and-a-half

FWIW I'd lean towards that, although maybe I'm being biased about do/while. Ultimately I don't have a strong preference (although will Carbon have do/while?)

toolchain/check/handle_impl.cpp

toolchain/check/merge.cpp

jonmeow

LG -- discussed the as lookup approach offline, approving and merging for parallelism.

josh11b added 30 commits December 27, 2024 17:52

Checkpoint progress.

bd77cfa

Checkpoint progress.

7502868

DumpIfValid -> DumpNameIfValid

8bf2b71

Remove extra newline

f307afb

Merge remote-tracking branch 'upstream/trunk' into dump

8c7404a

Failing tests

415c186

Debug

884821d

Merge branch 'dump' into assoc

0fe446e

Roughly working

edb5c9f

Update tests

a987574

Checkpoint progress.

9e0c236

More diagnostics

f7bbc18

Add TODOs about broken syntactic impl match

2e0014a

Exclude where from syntactic impl match

ab99a06

impl Self as now matches impl as

adfdfe4

Checkpoint progress.

aff650a

Checkpoint progress.

2285d25

add impl_self_as tests

11abc91

Comment

eed17cd

Checkpoint progress.

539474e

Skip difference between Self and Self as in `CheckRedeclParamSynt…

c7e2e18

…ax` instead

TODO comments

a68b52d

Checkpoint progress.

ee6fa85

Merge remote-tracking branch 'upstream/trunk' into assoc

f25dda2

Checkpoint progress.

c7d82d6

Checkpoint progress.

e17d9f6

Test updates

4d1155d

Syntactic impl declaration matching updates

95f98c1

Checkpoint progress.

694204e

Checkpoint progress.

8afd387

Checkpoint progress.

1580edb

josh11b requested a review from zygoloid January 6, 2025 23:50

github-actions bot requested a review from geoffromer January 6, 2025 23:50

github-actions bot added the toolchain label Jan 6, 2025

josh11b requested review from jonmeow and removed request for geoffromer January 6, 2025 23:51

zygoloid reviewed Jan 6, 2025

View reviewed changes

More tests

21c5af6

jonmeow reviewed Jan 7, 2025

View reviewed changes

Implement suggestions

a320cbf

jonmeow approved these changes Jan 7, 2025

View reviewed changes

jonmeow added this pull request to the merge queue Jan 7, 2025

Merged via the queue into carbon-language:trunk with commit 1d379ff Jan 7, 2025
8 checks passed

josh11b deleted the syn branch January 7, 2025 23:47

josh11b mentioned this pull request Jan 8, 2025

Save WhereOperand node instead of performing a scan #4772

Draft

jonmeow mentioned this pull request Jan 13, 2025

Save WhereOperand node instead of performing a scan #4795

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Syntactic `impl` declaration matching updates #4762

Syntactic `impl` declaration matching updates #4762

josh11b commented Jan 6, 2025

zygoloid Jan 6, 2025

josh11b Jan 7, 2025

jonmeow Jan 7, 2025 •

edited

Loading

josh11b Jan 7, 2025

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025

josh11b Jan 7, 2025

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025

josh11b Jan 7, 2025

jonmeow Jan 7, 2025

jonmeow left a comment

Syntactic impl declaration matching updates #4762

Syntactic impl declaration matching updates #4762

Conversation

josh11b commented Jan 6, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

jonmeow Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

jonmeow Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow left a comment

Choose a reason for hiding this comment

Syntactic `impl` declaration matching updates #4762

Syntactic `impl` declaration matching updates #4762

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading

jonmeow Jan 7, 2025 •

edited

Loading