On-the-fly load balancing #1269

joaoluisro · 2022-12-07T21:06:19Z

joaoluisro
Dec 7, 2022

Dear OceanParcels team,

I've recently been using parcels and really enjoying it, congratulations and keep up the good work!

I'm interested in the subject of plastic pollution and computational models, and one of the main features that peaked my interest in the framework was the MPI parallelization scheme, which i was eager to try!

I created a simple benchmark for advecting a couple of particles in the south-atlantic for 90 days, and wanted to test how the parallel implementation would scale-up. I noticed a trend of increasing memory usage for longer periods of advection just like the docs stated in the MPI section, which was most likely caused by the dispersion of particles and the need for increasing chunk-sizes, which in turn causes load imbalance across the MPI process. Here are my recorded results for 10^6 particles:

num_procs	memory(mb)
1	527.84
2	780.29
3	1055.28
4	1233.84
5	1456.11
6	1671.96
7	1951.50
8	2173.07
9	2375.92
10	2617.78
11	2841.37
12	3022.03
13	3191.87
14	3410.42
15	3633.87

I checked previous issues and PR's, and saw that the initial distribution is done through K-means, but on the 'on-the-fly' redistribution is yet to be implemented.

I'm very interested in contributing and improving this great package, so i would like to test a few spatial aware load-balancing algorithms, but one of my questions is: How can i manually re-assign the group of particles that each processing unit should take in its ParticleSet? I noticed that the initial distribution is done in collectionsoa.py and collectionaos.py through the pu_indicator variable and MPI broadcasting, but is it possible to rearrange the ParticleSets in a similar way at runtime?

Thanks in advance.

Answered by erikvansebille

Dec 8, 2022

Thanks @joaoluisro, for initiating this discussion! It would be great if you could further explore on-the-fly load balancing; which is indeed something that has been on the Parcels development Wishlist for a while.

I think the simplest implementation of on-the-fly load balancing would be to move the code for the Kmeans-distribution into its own method (e.g. ParticleCollectionSOA.loadbalance()) and then calling that immediately after the code to write out particle locations in ParticleSet.execute(), with an if-statement to test if the particles is too unbalanced? To avoid running this test too often, you could also introduce a new balancedt-argument to pset.execute() that only does the tes…

View full answer

erikvansebille · 2022-12-08T15:11:42Z

erikvansebille
Dec 8, 2022
Maintainer

Thanks @joaoluisro, for initiating this discussion! It would be great if you could further explore on-the-fly load balancing; which is indeed something that has been on the Parcels development Wishlist for a while.

I think the simplest implementation of on-the-fly load balancing would be to move the code for the Kmeans-distribution into its own method (e.g. ParticleCollectionSOA.loadbalance()) and then calling that immediately after the code to write out particle locations in ParticleSet.execute(), with an if-statement to test if the particles is too unbalanced? To avoid running this test too often, you could also introduce a new balancedt-argument to pset.execute() that only does the test at specific intervals (which can/should be multiples of dt)

Does that help? Can you try something with this?

1 reply

joaoluisro Dec 8, 2022
Author

Hi @erikvansebille thank you for the clarifications and the quick reply, i'll run some experiments and see if i can improve the load imbalance and memory usage, will report if my benchmark results in any improvements!

Cheers, João

joaoluisro · 2022-12-15T14:07:34Z

joaoluisro
Dec 15, 2022
Author

Hi @erikvansebille, i implemented a basic prototype following your advice, with a routine in the collectionSOA object that periodically re-assigns particle-set data after a pre-determined number of iterations right after writing particle locations. It does this by gathering the whole data in the root process, and broadcasting it to the assigned process based on a method. For testing i reinstated kmeans and also kept the initialization procedure, which seemed to have its own condition.

I ran some experiments and managed to get some bad termination from mpi with exit code 9, which imply incorrect memory management of my implementation. I also got a lot of OutOfTimeErrors, which forced me to set allow_time_extrapolation=True.

I am still very unfamiliar with the codebase and overall parcels structure, so i have a couple of questions:

I noticed that the output is stored in separate .zarr files for each process, how does the shuffling of the particles in runtime affect the output?
In my prototype i gathered and re-assigned the entire ParticleCollectionSOA._data for each processor, is this the correct approach?

Thanks in advance.

4 replies

erikvansebille Dec 16, 2022
Maintainer

Thanks for your great initial efforts on this development, @joaoluisro! Indeed, this seems a good way to start the development to me

Responses to your questions:

Hmm, I'm not sure. As long as the particles keep their original (and unique!) ID, we could always write a postprocessing tool to combine data across different zarr files? That seems something to worry about later?
Yes, I think that simply moving the whole _data structure is the way to go. But I must admit I'm not entirely sure if there are no other aspects of the ParticleCollectionSOA class that need copying

joaoluisro Dec 19, 2022
Author

I see, i will keep this in mind when analyzing the output.
I also managed to fix my issues by reassigning a few more attributes of the ParticleCollectionSOA class, so now i plan to run another benchmark to check for improvements.

Thank you again for the clarifications!

etodt Dec 19, 2022

Hi Erik and João Luis,
I am Eduardo, advisor of João Luís. I am pleased with your promising cooperation. In order to evaluate the algorithms, we considered using the peak of memory usage and processing time as metrics. Do you have something else you find interesting to optimize in this load-balancing problem?

erikvansebille Dec 20, 2022
Maintainer

Thanks @etodt, for following up. Yes, peak memory use and (especially) processing time would be the most important metrics.

JamiePringle · 2022-12-20T15:13:29Z

JamiePringle
Dec 20, 2022
Collaborator

As someone who took early stabs at optimal particle positioning, I would also monitor the time the processes are waiting for disk reads to complete. This strongly interacts with how the ocean data files that parcels is reading are chunked (especially if the data is compressed). You will find much better performance if the input files are chunked (I can send you ncks scripts to do so). The details depend on the problem, but for the Mercator 1/12th of a degree global model I find that chunking x and y by 256 and z by 1 works best. When you chunk, the code only reads the part of the data file that contains the needed data. Thus if the particles tracked in each MPI process are spatially nearby, the IO time will be reduced. This quickly becomes a complex problem space, with the optimal partitioning depending on how the data is stored. Jamie Pringle

…

On Tue, Dec 20, 2022 at 9:28 AM Erik van Sebille ***@***.***> wrote: CAUTION: This email originated from outside of the University System. Do not click links or open attachments unless you recognize the sender and know the content is safe. CAUTION: This email originated from outside of the University System. Do not click links or open attachments unless you recognize the sender and know the content is safe. Thanks @etodt <https://github.com/etodt>, for following up. Yes, peak memory use and (especially) processing time would be the most important metrics. — Reply to this email directly, view it on GitHub <#1269 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADBZR27WCLKJTV7VJIZSSGLWOG7CHANCNFSM6AAAAAASXJWCYU> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.*** com>

0 replies

SePte9Mber · 2022-12-29T00:30:29Z

SePte9Mber
Dec 29, 2022

收到

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On-the-fly load balancing #1269

{{title}}

Replies: 4 comments 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

On-the-fly load balancing #1269

joaoluisro Dec 7, 2022

Replies: 4 comments · 5 replies

erikvansebille Dec 8, 2022 Maintainer

joaoluisro Dec 8, 2022 Author

joaoluisro Dec 15, 2022 Author

erikvansebille Dec 16, 2022 Maintainer

joaoluisro Dec 19, 2022 Author

etodt Dec 19, 2022

erikvansebille Dec 20, 2022 Maintainer

JamiePringle Dec 20, 2022 Collaborator

SePte9Mber Dec 29, 2022

joaoluisro
Dec 7, 2022

Replies: 4 comments 5 replies

erikvansebille
Dec 8, 2022
Maintainer

joaoluisro Dec 8, 2022
Author

joaoluisro
Dec 15, 2022
Author

erikvansebille Dec 16, 2022
Maintainer

joaoluisro Dec 19, 2022
Author

erikvansebille Dec 20, 2022
Maintainer

JamiePringle
Dec 20, 2022
Collaborator

SePte9Mber
Dec 29, 2022