CABLE profiling results

Juergen · 4 April 2023 06:23

We ran the ‘scorep’ profiler for a mpi test run of the CABLE-POP model (aka the Canberra version). The profiler checks how much time the model spends in each subroutine. It showed that more than 80% of run time was used on communication between master and workers. Particularly the MPI_Recv subroutine seems to be very time consuming.

Would be good to know if others have similar issues with their version.

To run the profiler, we added/modified the following in the build script:

module add scorep
FC=“scorep mpiifort”

and added the following lines to the run script:

export SCOREP_ENABLE_PROFILING=1
export SCOREP_EXPERIMENT_DIRECTORY=/path/to/profiler_outputs

We looked at the outputs using CubeGUI > > https://www.scalasca.org/software/cube-4.x/download.html

cheers,
Juergen

Jhan · 4 April 2023 07:34

Hi Jurgen,

Several years ago I was profiling CABLE. I think it was pre-CMIP5 even, because of the reason I put it down. Even without POP the MPI drivers were the main culprits (as you also found). This didn’t surprise me a great deal as the IO at the front end is where it spends a huge chunk of time, and then it keeps coming back to the head node to collate all the fields. It might be nice to have a reliable measure of how many extra cores can increase performance before it all just gets swamped by network traffic.

So given I didn’t really learn anything, I profiled the serial model. The model is/was spending a huge chunk of time in “canopy”. No surprise I guess as there are two major loops there. It might be nice to improve on this but we’ve never had the time. Coupled in ACCESS the LSM only takes up 1 to a few percent of runtime. Hence I put the issue down. It’s likely even worse now as the UM has adopted ENDGAME dynamics, doubled atmospheric resolution, etc.

Cheers.
Jhan

Topic		Replies	Views
CABLE-POP runs - global parallel setup (without MPI) and sensitivity tests CABLE cable4-planning	7	276	15 February 2024
Error with payu and loading modules CABLE payu	7	400	18 July 2023
Run CABLE-POP at ACCESS Resolution CABLE cable4-planning	26	422	29 August 2024
ACCESS-CM2 walltime runout Coupled Model	8	285	11 May 2023
Developing a CABLE4 work plan: Issues to consider CABLE cable4-planning	7	112	13 September 2024

CABLE profiling results

Related topics