Fix Infrequent Polling sample #150

drewhoskins-temporal · 2024-11-21T00:56:23Z

What was changed

Fix the polling/infrequent sample. It wasn't correctly calculating attempts and was therefore running forever.

Why?

Checklist

Test plan:

Verify that it works:
poetry run pytest tests/polling/infrequent/workflow_test.py --workflow-environment time-skipping

Verify that it skips in non-time-skipping mode, to avoid slow test:

poetry run pytest tests/polling/infrequent/workflow_test.py

Any docs updates needed?

tests/polling/infrequent/workflow_test.py

polling/test_service.py

dandavison · 2024-11-21T01:18:32Z

polling/test_service.py

-        )
-        self.try_attempts += 1
-        if self.try_attempts % self.error_attempts == 0:
+    async def get_service_result(self, input, attempt: int):


Suggested change

async def get_service_result(self, input, attempt: int):

async def get_service_result(self, input: ComposeGreetingInput, attempt: int):

I went a couple turns down this rabbit hole, but it turns out to require a shockingly large refactoring, which is not worth doing.

polling/test_service.py

dandavison · 2024-11-21T03:33:58Z

polling/frequent/activities.py

    while True:
        try:
            try:
-                result = await test_service.get_service_result(input)
+                result = await test_service.get_service_result(input, attempt)


I think that the way the sample previously did not pass the attempt number to the test service is desirable to maintain, since in real world examples the service in question won't want to know about our attempt counting.

In other words, the way it is now will make some readers think that the solution to the polling-from-activity problem involves passing an attempt count to a service.

Is there a way to fix the algorithm without explicitly passing the attempt number to the service?

I understand and like the meta-point of wanting samples to look real-world, though this one feels shippable anyway. I could use a global, but it wouldn't work out-of-process, and anyway I'm not inspired to spend the time to change this.

OK, I do think it's worth finding a way to make your test pass without making the sample misleading. Here's a global variable version: #152 (I'm starting to see why the code had that modulo-arithmetic return condition!)

Yeah, exactly, this doesn't work predictably when it's run multiple times on the same worker. I really don't think anybody's going to be misled by it as it is, and I would greatly prefer predictability and test isolation as a user.
That said, I wouldn't block your approach if you feel strongly enough.
Can you either accept my PR and then add on top of it, or just leave it as I have it?

GitHub was having problems with this PR after I brought it up-to-date with main and it's closed currently. Are you able to reopen it?

The code in this sample falls into two categories:

Code that we are providing to users, essentially saying "follow this closely; this is what your code should look like". Note that users may well copy and paste code in this category. In particular: workflows.py, activities.py

Private implementation detail of the sample, which exists only to make the sample work. Users will not copy and paste this code because, in their use cases, its role will be played by something specific to their domain. In particular: test_service.py

For (2) the basic idea of the sample is to create a toy service that is stateful, only responding after a few attempts. But that statefulness needs to be private to the test_service implementation, since it's modeling real-world slowness-to-come-up. The one thing we don't want to do is give users the impression that they need to count their attempts in their activity code and send them to a service. The sample in its current form takes care not to do that, and we don't need to make the same worse in that respect.

There are a few options I think:

We could move the counter in the test_service to the module level and use modulo arithmetic to address the issue that it might be used by multiple workflows.

We could move the line test_service = TestService() in activities.py to the top level and use modulo arithmetic to address the issue that it might be used by multiple workflows.

We could use a dict keyed by activity.info().workflow_id for the counter in test_service

All those would teach the lesson we're trying to teach correctly. (1) and (2) are suboptimal in that they'd result in some potentially confusing shared state if someone were to run multiple instances of the sample concurrently. So (3) is perhaps the best choice, seeing as it's hardly more complex than (1) and (2).

I've implemented option (3) in #152, which builds upon this PR (i.e. using the test you added).

Co-authored-by: Dan Davison <[email protected]>

drewhoskins-temporal assigned Sushisource Nov 21, 2024

drewhoskins-temporal force-pushed the drewhoskins_polling branch from 1573d7c to 0a183ab Compare November 21, 2024 00:57

drewhoskins-temporal assigned dandavison and unassigned Sushisource Nov 21, 2024

drewhoskins-temporal commented Nov 21, 2024

View reviewed changes

tests/polling/infrequent/workflow_test.py Show resolved Hide resolved

dandavison reviewed Nov 21, 2024

View reviewed changes

polling/test_service.py Outdated Show resolved Hide resolved

dandavison reviewed Nov 21, 2024

View reviewed changes

polling/test_service.py Outdated Show resolved Hide resolved

drewhoskins-temporal force-pushed the drewhoskins_polling branch 2 times, most recently from 9441578 to b054fd8 Compare November 21, 2024 02:29

dandavison reviewed Nov 21, 2024

View reviewed changes

dandavison mentioned this pull request Nov 21, 2024

Fix polling sample 2 #152

Merged

drewhoskins-temporal and others added 4 commits November 22, 2024 18:07

Fix Infrequent Polling sample

fb2f927

poe format

0204af5

Add __init__.py

61cb1bf

Apply suggestions from code review

f77df0a

Co-authored-by: Dan Davison <[email protected]>

dandavison force-pushed the drewhoskins_polling branch from b054fd8 to f77df0a Compare November 22, 2024 23:07

dandavison closed this Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Infrequent Polling sample #150

Fix Infrequent Polling sample #150

drewhoskins-temporal commented Nov 21, 2024

dandavison Nov 21, 2024

drewhoskins-temporal Nov 21, 2024 •

edited

Loading

dandavison Nov 21, 2024 •

edited

Loading

drewhoskins-temporal Nov 21, 2024 •

edited

Loading

dandavison Nov 21, 2024

drewhoskins-temporal Nov 22, 2024 •

edited

Loading

dandavison Nov 25, 2024

dandavison Nov 25, 2024

	async def get_service_result(self, input, attempt: int):
	async def get_service_result(self, input: ComposeGreetingInput, attempt: int):

Fix Infrequent Polling sample #150

Fix Infrequent Polling sample #150

Conversation

drewhoskins-temporal commented Nov 21, 2024

What was changed

Why?

Checklist

dandavison Nov 21, 2024

Choose a reason for hiding this comment

drewhoskins-temporal Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

dandavison Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

drewhoskins-temporal Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

dandavison Nov 21, 2024

Choose a reason for hiding this comment

drewhoskins-temporal Nov 22, 2024 • edited Loading

Choose a reason for hiding this comment

dandavison Nov 25, 2024

Choose a reason for hiding this comment

dandavison Nov 25, 2024

Choose a reason for hiding this comment

drewhoskins-temporal Nov 21, 2024 •

edited

Loading

dandavison Nov 21, 2024 •

edited

Loading

drewhoskins-temporal Nov 21, 2024 •

edited

Loading

drewhoskins-temporal Nov 22, 2024 •

edited

Loading