Community Talks 1: Aidan Heerdegen (ACCESS-NRI) RRR: Reliability, Replicability, Reproducibility for Climate Models

Aidan · 30 August 2024 08:16

Community Talk: Aidan Heerdegen (ACCESS-NRI)

RRR: Reliability, Replicability, Reproducibility for Climate Models

Abstract

It is difficult to reliably build climate models, reproduce results and so replicate scientific findings. Modern software engineering coupled with the right tools can make this easier.
Some sources of complexity that make this a difficult problem:
Climate models are an imperfect translation of extremely complex scientific understanding into computer code. Imperfect because many assumptions are made to make the problems tractable.
Climate models are typically a number of separate models of different realms of the earth system, which run independently while exchanging information at their boundaries.
Building multiple completely separate models and their many dependencies, all with varying standards of software engineering and architecture.
Computational complexity requires high performance computing (HPC) centres, which contain exotic hardware utilising specially tuned software.
ACCESS-NRI uses spack, a build-from-source package manager that targets HPC, and which gives full build provenance and guaranteed build reproducibility. This makes building climate models easier and reliable. Continuous integration testing of build correctness and reproducibility, model replicability, and scientific reproducibility eliminates a source of complexity and uncertainty. The model is guaranteed to produce the same results from the same code, or modified code, when those changes should not alter answers.
Scientists can be confident that any variation in their climate model experiments is due to factors under their control, rather than changes in software dependencies, or the tools used to build the model.

Please use this thread for further discussion on this talk.

Aidan · 2 September 2024 23:17

Some of the relevant repositories mentioned in the talk:

Model configurations

Model deployment repositories

Build Infrastructure

mdsumner · 3 September 2024 05:57

really enjoyed this talk @Aidan! Looking forward to meeting hopefully tomorrow

Aidan · 3 September 2024 06:51

Thanks @mdsumner. Keen to chat!

Aidan · 9 September 2024 06:51

@rmholmes asked me afterwards if changing code related to model output diagnostics would be an example of a code change that would only change the minor version, i.e. not change the reproducibility of the model.

The answer to that question is YES! Code changes that only affect diagnosed outputs, e.g. fixing the units of a field, or adding an entirely new diagnostic is not something you would ordinarily expect to change the reproducibility of a configuration.

Some examples of other changes that might not also change the reproducibility of a model configuration:

Some PBS run options: walltime
Metadata updates
Collation options
restart_freq
Diagnostic output options (changing diagnostic profiles)
Changing run time debugging/logging options

Topic		Replies	Views
Workshop on Correctness and Reproducibility for Climate and Weather Software General workshop , testing , reproducibility	0	209	12 September 2023
ACCESS Workshop Day 1 Threads ACCESS Workshop Day 1 workshop-2024	7	50	2 September 2024
ACCESS-OM2: Release Information ACCESS-NRI Releases release , access-om2 , model	3	702	10 July 2024
Community Talks 1: Sramana Neogi (Monash University) Estimating the climate in a global model with small samples: how many do we need? ACCESS Workshop Day 1 workshop-2024	0	26	30 August 2024
ACCESS Workshop Day 2 Threads ACCESS Workshop Day 2	1	24	2 September 2024

Community Talks 1: Aidan Heerdegen (ACCESS-NRI) RRR: Reliability, Replicability, Reproducibility for Climate Models

Model configurations

Model deployment repositories

Build Infrastructure

Related topics