MPI distribution for an ocean model: what's the best way to deal with processes that are completely on land?

navidcy · 18 July 2023 15:57

@ssilvest is working on further optimizing the distributed Oceananigans. We came across the following connundrum. When we distribute the model across several MPI processes there are a few processes are have only land on them (see eg the schematic below for a 48x16 ranks configuration).

So the processes that correspond to regions that are completely over land do not contribute anything. While we don’t compute any tendencies for the points on land, still, we do load the corresponding MPI processes that ends up allocating a GPU (or CPU) for those regions. And that GPU/CPU ends up doing no work at all!

If there was a way to avoid starting an MPI process on those regions it would help. But it seems like a complicated logistically since then we’d have to skip some MPI processes and renumber the ranks and the way communication is done…

Question: How do we handle this in ACCESS-OM2 – if we are handling it at all? Or we just ignore it and just launch MPI processes on land?

cc @glwagner @Aidan @dougiesquire

micael · 18 July 2023 22:34

I’m afraid there’s no “simple” trick here. The solution is indeed to renumber ranks and change the way how you do communications. AFAIK, this is how it’s done in both MOM5 and MOM6.

But even that solution is not optimal, as you will still large load-imbalances because of the domains that intersect the coastline. To solve that, you need to allow for domain of arbitrary shapes. This increases the code complexity another notch, but based on my experience I would say it’s worth it. I don’t know if there are codes in fluid-dynamics that do this, but I’ve develop a finite-differences code that does precisely that, and its parallel scalability is extremely good.

angus-g · 19 July 2023 00:31

The FMS-based models, i.e. ACCESS-OM2 and MOM6, rely on the land mask preprocessing to solve the problem of not starting a process which would lie completely on the land. There’s a lot of (probably significantly overcomplicated) logic within FMS itself to then handle the domain connectivity.

There are indeed unstructured mesh models, like FVCOM and MPAS-Ocean. I also think that would have been a potentially nicer way to go about things, especially with the wealth of libraries for handling domain decomposition on unstructured meshes, etc. I do wonder how it would impact the ease and efficiency of analysis however…

micael · 19 July 2023 01:00

Interesting. What about codes using structured grids?

angus-g · 19 July 2023 01:12

Do you mean structured grids, but arbitrary shape domains? I’m not aware of anything, although that isn’t to say it doesn’t exist!

aekiss · 19 July 2023 05:26

CICE has the ability to subdivide the computational domain horizontally into tiles (termed “blocks”), and then parallelise by allocating several blocks to each CPU. This can improve load balancing if a similar number of ice-containing and ice-free blocks are allocated to each CPU. Land-only blocks aren’t allocated to a CPU at all. See Craig et al., (2015).

navidcy · 30 July 2023 13:43

Thank you all for these. Very useful.

Topic		Replies	Views
Session 5: Model development options CMIP7 Workshop workshop , cmip7	22	421	2 March 2023
Changing land-sea mask in ACCESS-ESM1.5 Working Group	34	530	4 September 2023
ACCESS-coupled N48 for deep paleo Earth System palaeogeography	44	819	25 August 2023
ACCESS-coupled N48 issue #1: change topography, bathymetry and land-mask Earth System	3	364	21 March 2023
Issue creating an idealised channel configuration mimicking the Antarctic shelves-margins Technical mom6	6	150	24 April 2024

MPI distribution for an ocean model: what's the best way to deal with processes that are completely on land?

Related topics