RFC 98: Remote channels for cross-browsing-group communication #98

jgraham · 2021-09-21T10:16:43Z

rfcs/remote_channel.md

jgraham · 2021-09-29T10:20:33Z

Gentle ping here. This has been open for a week, so per the process I could go ahead and merge it, but I'm pretty sure you don't want me to do that ;)

hiroshige-g · 2021-10-05T01:08:14Z

rfcs/remote_channel.md

+field provides the line/column numbers of the original exception,
+where available.
+
+In addition there is a `RemoteWindow.executeScriptNoResult(fn,


Based on my experience on bfcache tests, I feel executeScriptNoResult() might be hard to use, because:

Mixing await executeScriptNoResult() and await executeScript() might complicates the timeline. Having await executeScript() only would be clear, because it serializes the script evaluation on caller and remote context.

The choice between await executeScriptNoResult() and await executeScript() might depend on internals of executeScript(). When I was migrating my bfcache tests from COOP-based framework to test_driver-based framework, I had to make changes around executeScriptNoResult()-equivalent that were hard to explain without knowing the internal structure of the framework, so I switched to the executeScript()-only interface.

So I'd prefer removing executeScriptNoResult().
Still there would remain complexities around navigation: instead of executeScriptNoResult(), I created promises / async callbacks triggered by executeScript() but not blocking async executeScript() in bfcache tests. But by removing executeScriptNoResult() I expect we can make the complexities only navigation-specific, not affecting RemoteWindow interface.

So (for the context of others) the technical concern here is that for bfcache tests in particular you need all sockets to be closed before initiating the navigation. The problem is that if you send a message, execute some script that closes all sockets, and then send the result, you end up re-opening a socket for the result. So there are broadly two choices:

Don't try to send a result at all. postMessage works like this, and executeScriptNoResult does the same: we simply don't provide a channel to send the result on so there's no concern about sockets being reopened.

Send a result, but ensure we navigate after the result has been sent, so nothing will implictly reopen a socket. Practically this means something like setTimeout(async url => {await closeAllSockets(); startNavigation(url)}, 0). The problem with that is that we are currently linting for setTimeout so the infra discourages even usage that's required like this.

I don't quite get the argument about serialization; in the case that you're putting the actual action of the remote script in a timeout the test window can guarantee the script has been sent, but the actual action of the script hasn't happened at the time the value is returned, so it doesn't seem like that method offers particuarly stronger guarantees around ordering. I suppose one advantage is that any exception that happens when evalutaing the script is reported through the return value, rather than just happening in the remote window.

So, I think I'm mostly OK with dropping executeScriptNoResult because, as you say, it's exposing an implementation detail and it seems cleaner to start with a smaller API. My concern is that for some kinds of tests (notably bfcache), understanding that implementation detail is very important, and there's an argument that making it an explicit part of the API will help people write correct tests. So I'd like to hear other opinions here.

hiroshige-g · 2021-10-05T01:09:30Z

rfcs/remote_channel.md

+at the time of navigation, otherwise the page will be excluded from
+bfcache. In the current prototype this is handled as follows:
+
+* A `pause` method on `SendChannel`. This causes a server-initiated


As commented at web-platform-tests/wpt#29803 (comment), I'm not sure how pause works and whether pause is needed (the draft impl doesn't use pause).

So the problem pause is intended to solve is where you have something like:

await remote.executeScript(() => startNavigation()); await remote.executeScript(() => ensureNavigationHappened());

With a poll() based interface this can work as written; the remote simply doesn't call poll() after startNavigation and before the navigation is complete. But with websockets, we don't have any synchronisation between the navigation and sending the ensureNavigationHappened message. If the test sends the second message before the socket is closed on the remote, it can end up being lost. So pause solves this by effectively providing a synchronisation point around the socket being closed:

await remote.executeScript(() => startNavigation()); remote.pause(); // We now know the remote now won't listen for messages until it reconnects to the socket after navigation await remote.executeScript(() => ensureNavigationHappened());

From the point of view of the remote, closeAllSockets() and pause() do race, but the effect of pause() is a server-initiated disconnect of the read channel for the remote, and the effect of closeAllSockets is a client-initiated disconnect of all channel-related sockets, so the ordering isn't very important.

We can avoid requiring pause by not sending requests to a remote after initiating a navigation, before getting a confirmation that the navigation is actually complete. In practice this means that the target page has to be a different RemoteWindow (i.e. have a different uuid) and has to process a message i.e. something like:

await remote.executeScript(() => startNavigation()); await navigationTargetRemote.executeScript(() => goBack()); await remote.executeScript(() => ensureNavigationHappened());

That's what the bfcache tests are doing now, and why pause isn't needed there, but either way the ergonomics are a bit tricky. I think we could drop it from the initial featureset, but these races around navigation are tricky to understand and debug.

rfcs/remote_channel.md

hiroshige-g · 2021-10-05T01:44:03Z

rfcs/remote_channel.md

+  via the main test window, make it hard to build an ergonomic
+  cross-context messaging API.
+
+### Proposal


I'd like to clarify (somewhere in the RFC) what are the dependencies of this RFC in terms of implementation.

IIUC this RFC can be implementable by pure JavaScript outside WPT infra + existing server-side stash, which is good because we can run the tests by running wpt server and bare browsers, without further depending on WPT test infra.
This RFC mentions integration with WebDriver BiDi, test_driver and testharness, but these are out of scope of this RFC itself.

Is my understanding correct?
Why the changes to resources/testharness.js and tools/wptrunner/wptrunner are needed in web-platform-tests/wpt#29803 ?

(Other changes, in JS/HTML files, tools/wptserve/wptserve/stash.py, and websockets/handlers/msg_channel_wsh.py look like "implementable by pure JavaScript outside WPT infra + existing server-side stash")

hiroshige-g · 2021-10-05T01:50:25Z

rfcs/remote_channel.md

+they are unable to communicate. For example windows opened with the
+`noopener` attribute will not have a handle to their parent, nor will
+the parent have a handle to the child. Similarly, cross-origin
+navigations with the `cross-origin-opener-policy` header appropriately


Is this RFC going to replace the framework in the COOP/COEP tests?

I don't expect the RFC to require moving the COOP/COEP tests. However I expect the RFC to provide all the features needed for those tests, and would anticipate the actual migration happening as a followup once the implementation has landed.

hiroshige-g · 2021-10-05T01:53:19Z

rfcs/remote_channel.md

+synchronize the navigation starting (which will close the socket) with
+writing the response.
+
+TODO: the naming here isn't great. In particular a `RemoteWindow`


Doesn't RemoteContext work (as proposed in #91)?

No, because RemoteContext is already used by testharness.js to mean something slightly different:

/* * A RemoteContext listens for test events from a remote test context, such * as another window or a worker. These events are then used to construct * and maintain RemoteTest objects that mirror the tests running in the * remote context. */

In the long term it would make sense to unify these features, but in the short term a naming conflict will cause breakage.

Maybe RemoteGlobalScope?

Would RemoteGlobal work? RemoteGlobalScope is quite a mouthful.

hiroshige-g · 2021-10-05T01:59:29Z

rfcs/remote_channel.md

+the transition may even be seamless.
+
+testdriver integration is possible. For example we could add
+`RemoteContext.testdriver.click` to execute a click in the remote


Would RemoteContext.testdriver.click be a pure JavaScript wrapper like RemoteContext.executeScript(() => test_driver.click())?

That doesn't work as written, because all testdriver commands have to go via the test window since that's the one that webdriver is using. So it would desugar into testdriver.click(RemoteContext.uuid), and testdriver would learn to look up contexts from a uuid parameter in the URL, as well as from the internal identifier it uses today.

hiroshige-g · 2021-10-05T02:10:43Z

rfcs/remote_channel.md

+without providing the API surface that only makes sense in a test
+window.
+
+It may be possible in the future to replace the backend with a


For further integration with WebDriver, testdriver etc., discussion about benefits and risks would be helpful (not necessarily now though, given that this is "Possible Future Additions").

One one hand ("Risk" side), I basically prefer not integrating the framework too much to WebDriver/test_driver etc., to make the framework work with minimum dependencies and work with bare browsers, and not to make debugging harder. We might want to JSON.stringify() instead of WebDriver BiDi format if we want to reduce complexity around serialization, given that the current tests considered so far don't need WebDriver BiDi capability beyond JSON.stringify().
So I'm interested in hearing more about the benefits (from the point of view of test writers and WPT infra maintainers) when we consider the further integration.

Yes, future additions here would be subject to the same RFC process. So the idea was just to sketch out some possibilites in case people felt there were specific parts that should be given additional consideration in the initial design, or would be high value to work on sooner.

In terms of the serialization format, we are already using extra complexity beyond JSON-stringify when serializing a SendChannel for message responses. We obviously could special case that, but in general I think using plain JSON as a serialization format for data where we might want to support non-JSON types is a bad idea (e.g. current WebDriver mostly uses plain JSON but also has super-hacky support for serializing Element objects which doesn't generalise to other types and is generally a mess). Aiming for a featureset closer to structured clone seems more future proof, but structured clone doesn't actually define a serialization. So, given that WebDriver BiDi has very similar requirements, and actually defines the wire format, it seems sensible to use that as a reference point. The fact that, going forward, we could use WebDriver BiDi instead of the stash as the backend in the wptrunner implementation is good, but it's not the primary motivation here.

hiroshige-g · 2021-10-05T02:19:24Z

rfcs/remote_channel.md

+
+TODO: the naming here isn't great. In particular a `RemoteWindow`
+could actually be some other kind of global like a worker, and
+`start_window()` is a pretty nondescript method name.


I did a similar thing to start_window() in my bfcache tests and I was also not sure about the better naming, better semantics, etc. (and thus I just made it const executor = new Executor(uuid); and avoided explicit naming).
So I'm also curious how this should be named.

hiroshige-g · 2021-10-05T02:29:14Z

Commented somehow verbosely to move discussion forward.

As for executeScript(), the RFC basically looks similar to what the current RemoteContext.execute_script() and bfcache tests do, while there are more primitives around navigations (executeScriptNoResult, pause and closeAllChannelSockets in this RFC while the bfcache tests only have closeAllChannelSockets-equivalent, as commented inline above) and I'd like to understand the differences (IIUC there aren't fundamental differences, because I expect switching the impl from Fetch API to websocket doesn't impact so much).
So in terms of executeScript(), this RFC looks like switching from Fetch API to websocket based implementation (compared to the previous RFCs and web-platform-tests/wpt#28950), both pure JavaScript-based, and thus doesn't cause test bodies except for mechanical changes (correct?).

The RFC is more about replacing the existing send() and receive() primitives currently used in COEP/COOP tests, by introducing related API classses, adding more APIs in addition to send()/receive() etc., and I expect test writers (other than me) and WPT infra people might have more interests on these.

cc/ @ArthurSonzogni and @foolip.

jgraham · 2021-10-05T11:31:22Z

@hiroshige-g Thanks for the detailed comments, it's much appreciated! I've responded to the technical concerns, hopefully in a way that also provides context for other reviewers. I'll update the RFC for the more editorial issues. Please let me know if I miss[ed] anything.

Co-authored-by: Ms2ger <[email protected]>

* Remove executeScriptNoResult * Rename `start_window()` to `start_window_channel()` and add `window_channel()` without the auto-connect behaviour. * Change the (de)serialization model to automatically create local objects (like structuredClone) rather than requiring a `toLocal()` call. * Updates from PR feedback.

foolip · 2021-10-12T15:58:43Z

Ping @web-platform-tests/wpt-core-team for review on this. ~~I'll be taking a look tomorrow~~ (I am a liar) but more review is better!

rfcs/remote_channel.md

foolip · 2021-10-25T11:22:02Z

rfcs/remote_channel.md

+[PR 29803](https://github.com/web-platform-tests/wpt/pull/29803)
+contains a prototype implementation of this.
+
+<!--  LocalWords:  UUID WebDriver wptrunner testharness APIs UI


What's LocalWords?

Oh sorry this is emacs embedding the extra dictionary entries.

Do you want to keep them, then?

Well it's convenient for me when editing, but I could remove them once the overall RFC is approved, right before merging.

Co-authored-by: Philip Jägenstedt <[email protected]>

Require a message type. Also support connect and close messages. Provide the message type in the callback.

jgraham · 2021-11-04T18:15:56Z

Ping, again. This has now gone another week since the last round of updates without further review.

foolip · 2021-11-22T11:38:00Z

rfcs/remote_channel.md

-send messages to the remote. Alternatively the `RemoteWindow` may be
-created first and its `uuid` property used when constructing the URL.
+This API is provided by a `RemoteGlobal` object. The `RemoteGlobal`
+object doesn't handle creating the browsing context (or other global


Good clarification!

foolip

Just some missing punctuation.

rfcs/remote_channel.md

jgraham force-pushed the remote_channel branch from edc3703 to ff05826 Compare September 21, 2021 10:21

jgraham force-pushed the remote_channel branch from ff05826 to 65f40ea Compare September 21, 2021 10:28

jgraham changed the title ~~Add proposal for remote channels for cross-browsing-group communication~~ RFC 98: Remote channels for cross-browsing-group communication Sep 21, 2021

jgraham force-pushed the remote_channel branch from 65f40ea to 1f8064c Compare September 21, 2021 10:29

Ms2ger reviewed Sep 21, 2021

View reviewed changes

jgraham mentioned this pull request Sep 23, 2021

[WPT] Introduce RemoteContext.execute_script() and add basic BFCache tests + helpers web-platform-tests/wpt#28950

Merged

hiroshige-g reviewed Oct 5, 2021

View reviewed changes

rfcs/remote_channel.md Show resolved Hide resolved

hiroshige-g reviewed Oct 5, 2021

View reviewed changes

rfcs/remote_channel.md Outdated Show resolved Hide resolved

hiroshige-g reviewed Oct 5, 2021

View reviewed changes

jgraham and others added 3 commits October 11, 2021 16:57

RFC 98: Remote channels for cross-browsing-group communication

c92c15c

Do whatever Ms2ger says

c5232f1

Co-authored-by: Ms2ger <[email protected]>

jgraham force-pushed the remote_channel branch from 85856e3 to a5d3c2b Compare October 11, 2021 15:57

foolip reviewed Oct 25, 2021

View reviewed changes

Apply suggestions from code review

add0a81

Co-authored-by: Philip Jägenstedt <[email protected]>

jgraham added 8 commits October 26, 2021 20:22

Rename RemoteWindow to RemoteGlobal

7c7924f

Rename executeScript to call

b025716

Make the Channel.addEventListener funtion act more like platform APIs

31d0b85

Require a message type. Also support connect and close messages. Provide the message type in the callback.

Rename pause() to disconnectReader()

f43b556

Clarify behaviour with multiple UUIDs

3552773

Rename next() to nextMessage()

6051fd0

RemoteObject objectId must be defined

89c39b6

Cleanup

a658a7d

jgraham requested a review from foolip October 27, 2021 10:24

Clarify the nested value representation

9843a71

foolip reviewed Nov 22, 2021

View reviewed changes

foolip approved these changes Nov 22, 2021

View reviewed changes

rfcs/remote_channel.md Outdated Show resolved Hide resolved

Update rfcs/remote_channel.md

f84f0b5

foolip merged commit 49d43a7 into master Nov 22, 2021

foolip deleted the remote_channel branch November 22, 2021 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC 98: Remote channels for cross-browsing-group communication #98

RFC 98: Remote channels for cross-browsing-group communication #98

jgraham commented Sep 21, 2021 •

edited

Loading

jgraham commented Sep 29, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

hiroshige-g Oct 5, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

foolip Oct 25, 2021

jgraham Oct 26, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

hiroshige-g Oct 5, 2021

jgraham Oct 5, 2021

hiroshige-g Oct 5, 2021

hiroshige-g commented Oct 5, 2021

jgraham commented Oct 5, 2021 •

edited

Loading

foolip commented Oct 12, 2021 •

edited

Loading

foolip Oct 25, 2021

jgraham Oct 26, 2021

foolip Nov 22, 2021

jgraham Nov 22, 2021

jgraham commented Nov 4, 2021

foolip Nov 22, 2021

foolip left a comment

RFC 98: Remote channels for cross-browsing-group communication #98

RFC 98: Remote channels for cross-browsing-group communication #98

Conversation

jgraham commented Sep 21, 2021 • edited Loading

jgraham commented Sep 29, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hiroshige-g commented Oct 5, 2021

jgraham commented Oct 5, 2021 • edited Loading

foolip commented Oct 12, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jgraham commented Nov 4, 2021

Choose a reason for hiding this comment

foolip left a comment

Choose a reason for hiding this comment

jgraham commented Sep 21, 2021 •

edited

Loading

jgraham commented Oct 5, 2021 •

edited

Loading

foolip commented Oct 12, 2021 •

edited

Loading