Refactor Schedule Many #49

mjp41 · 2024-09-17T13:58:06Z

The code in schedule many has been extended by two features:

Both of these lead to some complexity, and the schedule_many code now does a lot of work. It was originally factored into two phases

acquire phase
release phase

This effectively replicates the 2PL of enqueue that occurs. However, now in the acquire phase there is a lot of work, and that both complicates the readability, and potentially makes the time taken in the 2PL longer reducing potential throughput.

I propose we split the schedule_many into four phases

prepare phase - create all the chains that should be added to the DAG, and effectively all calculation that can be done before the first exchange
acquire phase - Exchange in the set of segments, and track any segments that had no predecessor
release phase - Mark slots as completed to enable subsequent work to enqueue.
process phase - Any segments that had no predecessor can be resolved in this phase.

I believe this factoring will help with both readability and performance as it moves work before or after the critical acquire/release phases reducing the time spent blocking other scheduler threads.

I believe this factoring would make it easier to address missing features interaction between the two PRs mentioned above.

@vishalgupta97 @marioskogias as you have both recently touched/reviewed this code. Do you have any thoughts on if this refactoring makes sense to you?

The text was updated successfully, but these errors were encountered:

mjp41 · 2024-09-23T20:40:48Z

After discussing with @marioskogias, we should probably call the last phase, "resolve phase".

vishalgupta97 · 2024-09-24T18:22:52Z

Yeah, the refactoring looks good. Small points:

Optimisation opportunity: Maybe free the extra references on cown in the prepare phase (if any). Limited opportunity though as we can't know the exact ref count until after the exchange.
Maybe cown's reference counting can be removed from schedule_many/release functions and move into separate functions. For reads (it can be done in add_rcount and del_rcount). Similarly there can be a wrapper around xchg (in schedule_many) / CAS (in release) to do the ref counting. This will reduce possible bug surface which occurs due to incorrect ref counting.
acquiring read/write cowns across bodies(using + operator) is not implemented yet.

mjp41 mentioned this issue Sep 25, 2024

Read-only cown v2 #45

Merged

marioskogias mentioned this issue Sep 27, 2024

Split schedule many #54

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Schedule Many #49

Refactor Schedule Many #49

mjp41 commented Sep 17, 2024 •

edited

Loading

mjp41 commented Sep 23, 2024

vishalgupta97 commented Sep 24, 2024

Refactor Schedule Many #49

Refactor Schedule Many #49

Comments

mjp41 commented Sep 17, 2024 • edited Loading

mjp41 commented Sep 23, 2024

vishalgupta97 commented Sep 24, 2024

mjp41 commented Sep 17, 2024 •

edited

Loading