How do I start a new perturbation experiment?

Aidan · 9 December 2022 06:08

How do I start a new perturbation experiment?

User story

As a new user I want to be able to start a new perturbation experiment from an existing ACCESS-OM2 control run.

How do I identify which control run to choose?
Which experiment do I clone?
How do I know where to branch my experiment?
How do I determine which restart files to use for my chosen branch point?
Where can I find those restart files on disk?
How do I configure payu to use the correct restart files?
How do I know I’ve done things correctly?

Background

At the COSIMA meeting discussing the scope of an ACCESS-NRI release of ACCESS-OM2 a use-case that would be useful for ACCESS-NRI to assist with was getting new users up and running new experiments from existing control runs.

Current workflow:

Talk to supervisor
Ask data owner
Ask data owner
Ask data owner: they will tell user to consult the git log of the experiment repository which has the run number, and check in restart manifest for restart file directory
Ask data owner where restart files are
Ask supervisor who says read the access-om2 wiki

Aidan · 9 December 2022 06:13

I have made this a wiki, with the intention that it should be edited to better reflect the experience of a new user (which I am not). So feel free to dive in and change as required.

I encourage anyone to create more user stories, and not just related to COSIMA, but just put them in the correct category. User stories are a great way to capture workflows, how they might be blocked or inefficient and we can improve them.

ACCESS-NRI would like to use user-stories as qualitative measures of impact and improvement for the community. In many cases what we might do can’t be well measured in metrics, but the community will “just know” it is a lot easier than it used to be. User stories are a way to capture this improvement, by documenting the improvement in the workflow for a particular user story.

Aidan · 12 January 2023 04:18

@adele-morrison and @rmholmes Does this user story accurately reflect the “use case” discussed in that COSIMA meeting?

rmholmes · 12 January 2023 04:44

Looks good to me Aidan. One additional thing ~~that could be added to the first list~~ I’ve now added to the first list:

How do I know I’ve done things correctly (i.e. the only difference between my new simulation and the previous control simulation is what I want)? [side note: Are our simulations bitwise reproducible? I can’t remember.].

adele-morrison · 12 January 2023 04:58

[side note: Are our simulations bitwise reproducible? I can’t remember.].

Usually yes, but not guaranteed, see this discussion.

Aidan · 12 January 2023 05:17

Yes they should be bitwise reproducible in a deterministic sense: the same model configuration with the same inputs will produce the same outputs. Some of the models are known to be not bitwise reproducible if processor layout is changed for example, but that is a more stringent reproducibility criteria.

The issue @adele-morrison linked to is weird, and I would say anomalous, but we just don’t know why.

Aidan · 12 January 2023 05:20

I think we should be adding an initial step for every forked experiment where the experiment is forked and run without changes and confirm the outputs are unchanged. That confirms that when changes are made that the control run is a valid comparison.

rmholmes · 12 January 2023 06:01

Yes I agree. It’s pretty common to run the control simulation forward anyway as often the diagnostics that we want aren’t available.

Aidan · 12 January 2023 06:05

That’s an important piece of information too. Check the diagnostics required are present in the control. If they’re not, and you have to re-run, need to factor in the compute and storage required.

adele-morrison · 12 January 2023 06:23

Rerunning the control all the time will get very expensive for 1/10deg or higher res!

rmholmes · 12 January 2023 06:52

Yes, but it depends on how long your perturbation runs are. If you’re just running one perturbation then it doubles the cost (more perturbations it becomes relatively cheaper).

aekiss · 11 October 2023 01:49

Disconcertingly, we now have a 2nd example of a non-reproducible run. It’s unclear how often this occurs, as we don’t discover it unless we re-run and check the restarts.

The lesson is: always check the md5 hashes in the manifests in a re-run - they should match those from the original run (the only exception being ocean_barotropic.res.nc.* which always differ - but this doesn’t affect any other restarts so is presumably benign, maybe a datestamp in the file or something).

Topic		Replies	Views
Perturbation experiments forked from ACCESS-OM2-025 RYF control run on ik11 Ocean help , access-om2 , solved	20	116	5 November 2024
Helping new users run ACCESS-OM2 from an existing control run Ocean	8	121	12 March 2025
Easy forcing perturbation experiments in ACCESS-OM2 + Tutorial COSIMA access-om2 , perturbation	1	45	27 March 2025
Feedback requested: ACCESS-OM2 RYF 0.1-degree Configuration Workflow Updates COSIMA	23	132	12 December 2024
How to run ACCESS-OM2-RYF from an existing control run? Experiments help , access-om2	26	137	19 May 2025

How do I start a new perturbation experiment?

How do I start a new perturbation experiment?

User story

Background

Related topics