It is resolved on my end. Just last time it happened Spencer went looking for the underlying cause (if there was one). Just thought you might want an extra example to check the logs if you want.
1 Like
Aidan
(Aidan Heerdegen, ACCESS-NRI Release Team Lead)
Assigned spencerwong
4
Assigned to Spencer so he can take a look when he is back from leave. Does not require resolution as such.
I’ve taken a look at the output logs. In each of the failing years, the collation job ran out of walltime when combining the MOM output files. The UM netCDF conversion is done after this step, and so never ran for these years.
For most years, the collation appears to have finished in ~20 minutes, but for some reason it didn’t complete within an hour for the failing years. If this problem keeps coming up, it might be worth increasing the collation walltime in the config.yaml: