COSIMA TWG Meeting Minutes 2025

cbull · 14 January 2025 05:58

Minutes for 2025, please set new posts to ‘Wiki’ for easier editing.

Previous TWG minutes: COSIMA TWG Announce
2024 TWG minutes: COSIMA TWG Meeting Minutes 2024
2025 OSIT minutes: https://forum.access-hive.org.au/t/nri-ocean-meeting-minutes-2025/4053
2024 OSIT minutes: https://forum.access-hive.org.au/t/nri-ocean-meeting-minutes-2024/3700

anton · 15 January 2025 01:37

TWG meeting - 15 Jan

Attendees: @helen, @anton, @cbull, @aekiss, @Minghangli, @KieranRicardo, @Martindix, @Angus-g, @PaulLeopardi

Landmask error:
Kieran has been running the CM3 prototype with the new 0.25 grids

Some points in Africa look like they should be land and are currently ocean

@aekiss will follow up, and mask out those cells.

Interesting that it only came up in CM3 and OM3 didn’t give an issue.

Kieren will try smoothing within the mediator for runoff.

Mom symmetric:

For mom6 regional , the recommendation is to use MOM “symmetric memory” but for access-om3 global modelling we haven’t been using this (NCAR is doing the same). In non-symmetric, all arrays are the same size but for symmetric the velocity arrays are one larger because they include all four edges for all cells (rather than just the typical north and east edges). The mom-ocean tests check that symmetric and non symmetric results are the same, and rotating domains give the same results. The halos for each “rank” are large enough that the missing cells (default 3 cells) for velocity won’t impact the results of any calculation.

Angus suggested that we could check the mom-ocean tests are testing the same parameters we are using for access-om3. In an overall sense, symmetric and non-symmetric should be bitwise identical.

@anton: will do a bfb test using mom_symmetric vs current for om3, with plan to turn on mom_symmetric.
@helen has raised an issue for the change

OM3-0.25 Project Board - check in on this. No updates/blockers from anyone

Run-off

Anton has some fixes to the OM3 configuration to conserve run-off. See Runoff not conserved · Issue #231 · COSIMA/access-om3 · GitHub

Will use first order conservative remapping for moving from JRA grid to the ocean grid. Any new grids need to go to atleast 80 degrees south to capture all runoff.

To account for differing landmasks, Anton suggested just mapping any water in land cells to the nearest ocean cell. We noted the limitations of this, which include runoff isn’t moved to the mouth of any bay that is landmasked, it is just moved into the nearest ocean. There is also no spreading of run-off, we need to test if EPBL + vertical river mix is stable and sensible with this configuration. We will go ahead with the simplistic just mapping to nearest ocean cell, and see how the results look.

Pedro at UNSW has been inserting frozen runoff at depth in OM2 iceberg spreading projects

Speaker for the Community Symposium on Sea Ice Modelling

ACCESS has been invited to present, Andrew could say something for the last seven years? or maybe someone from CSIRO has more historical perspective.

@anton to ask how long is the talk and follow up

GitHub Desktop and GitKraken - @cbull

GitHub Desktop

Provides a clicky interface to cherry-pick commits, reorder commits, squash, create branches, commits etc
Also shows local changes and other git stuff

GitKraken does similar + shows history graphs

MOM6 testing on the access-nri branch

The automated testing errors are hard to follow for folks who are not the authors. In future, we should check that tests pass as they are committed.

These tests are failing on 2025.01 branch can be resolved when @dougiesquire / @ezhilsabareesh8 is back. Further details of outstanding issues here.

ACCESS-OM3 0.4.0 Release

New components versions - updated MOM6/CIC6 and CMEPS versions.

Andrew & Chris to review (details)

MOM6

Chris is to start attending MOM6 developer moeetings, they are on fortnightly. And Dougie will resume attending as well.

Access-nri model transformation team will start working on a MOM6 GPU implementation. Led by @edoyango and @micael

@claireyung has put her work to track the MOM6 panAn/regional ice shelf model development (see issues for current status)

COSIMA training program

Starting soon, program is here:

https://anu365-my.sharepoint.com/:x:/g/personal/u1164007_anu_edu_au/ESNybi3AdkFHrQJGyE3phtIBVhLEcqCw8craAhP4FBL8OA?e=WWsP69

Please let Chris know if there’s a particular session/topic you’d like to present on.

COSIMA talk upcoming (Thursday 13th Feb)

@minghangli present a brief introduction into the expt manager tool & save a more detailed tutorial at training program (23 May)
Need some content for the rest of the session (payu ?)

Andrew is Chair and Minghang in minutes for next TWG (29th Jan)

minghangli · 29 January 2025 06:43

TWG meeting - 29 Jan

Attendees: @micael, @CharlesTurner, @helen, @anton, @cbull, @aekiss, @Minghangli, @Martindix, @dougiesquire

Agenda can be found in this link COSIMA TWG Announce - #55 by aekiss.

ACCESS-NRI COSIMA training program and upcoming cosima ‘main’ training session - @CharlesTurner

For the Cosima meeting:

@minghangli will present a 5 minute overview of the parameter manager tool and reference his training session in May.
A 20-30 minute talk on new Payu features will be delivered by @anton, @Aidan, or @jo-basevi.

Training program outpline:

create users’ own datastore
some basics using intake
Q&A
searchable coordinates for the ocean files (current issues where grid-related searches are not indexed by Intake).

@CharlesTurner has almost finished the cli for making intake datastores. He’s currently writing the tests. He’s also working on the coordniate search within intake ESM, allowing catalog to support searchable coordinates.

For the training program, it is good to have breakout rooms to allow people to retain knowledge better and allow informal discussion. There may also people jump in to ask other questions related to intake, it is better to leave some time for that as well.

@CharlesTurner is also working on the model autodetection through CLI when buidling a catalog. @dougiesquire raised concerns about validation risks, as model names are not strictly prescribed.

Choice of compiler for ACCESS-OM3, moving to ifx - @micael

IFX provides better optimisation for Sapphire Rapids compared to classic Intel compilers.
For spack builds, when the release team has made available configuration to use IFX on Gadi, it is easy to make a switch by simply changing the name of the compiler used in the spack environment.
@MartinDix has tested UM with IFX a while ago and the performance is pretty much the same as with IFORT, and is considering moving CM3 to IFX.
All model components need to be compiled with IFX due to Fortran module compatibility constraints.

OM3 0.25deg project board
3.1 0.25deg ocean mask error

@MartinDix found differences on the tripolar edge between @aekiss and @ezhilsabareesh8 versions. @aekiss suggested waiting for @ezhilsabareesh8to return for further discussion.

3.2 diag_table
Discussions on diag_table daily / monthly output frequency and potential performance concerns when switching to daily output.
Ongoing discussions on managing diag_table and diag_table_source.yaml, where the yaml configuration file along with GitHub - COSIMA/make_diag_table: Python script to generate MOM diag_table is used to generate the diag_table. An issue is created Diagnostic table files management · Issue #259 · COSIMA/access-om3 · GitHub, where there will be further discussions there.

3.3 KPP and epbl with fixed runoff
@minghangli analysed salinity comparisons at the Amazon river mouth over time and across z* coordinates. Results show similar trends with KPP. The rivermixdepth parameter (40m or 20m) has minimal impact. Seasonal cycles are stronger, but salinity remains low (~0.02 PSU).
epbl with/without fixed runoff has velocity truncation errors, but KPP does not. Before the runoff fix, truncation errors appeared around Antarctica. Since runoff is a constant in time, errors may be circulation-related. @minghangli will provide a spatial plot of the new truncation errors with runoff fix. One thing to note, before runoff fix, truncations happened in a row of 10 years and seemed persistent.
@MartinDix mentioend that Kieran has been testing the horizontal spreading without epbl.

Next TWG is assigned: Chair: Dougie. Minutes: Andrew K. Andrew will give summary of this TWG at next COSIMA meeting.

aekiss · 11 February 2025 23:06

TWG meeting 12 Feb

Present: @aekiss, @anton, @MartinDix, @dougiesquire, @cbull, @minghangli, @manodeep

ACCESS-OM3-025 project board

ePBL

ML: Using latest Riechl et al (2024) ePBL parameters but we get truncation errors. To maintain runtime performance comparable to OM2, we’ve set a tracer timestep of 3 hours, which is longer than GFDL’s 2-hour timestep. He’ll test the alpha release (0.4.0) since it includes fixes with more numerically stable schemes that might help address this issue.
CB: ePBL preferred over KPP in COSIMA meeting
CB: what are other groups using?
ML: GFDL are using ePBL with same params
ML: currently tuning params but still not working

other items (project board)

Many items will be resolved with config updates

Repro issue

also on zulip

ML: OM3 build & config passed CI repro test but aren’t actually reproducing - CI test now fixed
but still have no repro between 0.3.0 and 0.4.0
also no repro in MOM6 standalone using MOM6 driver (not NUOPC, no sea ice)
default MOM6 parameters changed but have been fixed.
DS: only one of the long list of parameters is different, and this change won’t affect our model
ML: still no repro even with parameter issue fixed
DS: how did this get through the MOM6 repro testing?
DS: unclear whether we expect MOM6 repro given the version change - would have to check through all the
repro tests don’t even run on 0.4.0 config due to truncation
AK: would be good to have a way to tell if we expect repro between any 2 commits
MS: is run deterministic?
ML: yes
DS: do we want to dig into this to find out why it all changed
CB: would want to know whether we expect reproducibility. If repro not expected, do longer run to see if results are plausible. But gap between the 2 MOM6 version, so not a good use of our time to dig into all those commits.
DS: but how to know if repro is exected with digging into commits
CB: did Marshall mention this a month ago?
DS: might have been discussed at a MOM6 dev meeting we didn’t attend - ask Angus?
AS: ask Marshall
also go back to 0.3.0, add one PR and check whether it’s something in our process that is breaking things
CB: ok will ask Marshall
DS: also check ML’s standalone runs to see if they reach the same conclusions.
This problem will crop up with other components
AS: we could have CI run 1deg for 20yr on every update

MS: for every CI, make it fail to check it works

DS: there are many bugfixes in MOM6 that are turned off by default for repro but which we should turn on, breaking repro

CB: could any of the patches on patches involved in 0.3.0 → 0.4.0 be a problem?
DS: possibly, but unlikely, and not in

MOM6 dev meeting on Tuesday

CB: notes on zulip

bug discovered (doesn’t affect us)

update to MARBL, will change cap, may affect WOMBAT - asked us to check

Marshall gave presentation on GPU work - see link to his notes. Impressively fast progress, eg pressure solve running on GPU. Targeting momentum solver first.
Our software team also contributing
Ed has long todo list
Some things in specified GPU coding style are unsupported by hardware; hard to get vendor support; NVIDIA won’t look at code due to license use (LGPL) - looking to move to more commercially-friendly Apache, which does not oblige disclosue of code changes (see table here). Asking ~90 contributors to approve license change.

DS: Generic tracer: code moving out of mom into ???, may affect us.

DS: Next set of changes will alter defaults - need to keep an eye on this for repro.
AK: good reason to storeMOM_parameter_doc.* in repo
DS: release team suggest doing via payu
MS: do via pre-commit hook to run model for 1 timestep?
AS: then add repro CI test to fail on diff between these files
MS: set up cron to regularly check?
DS: but want to know immediately that defaults have changed
MS: belt and braces - cron job to pick it up in case repro test was forgotten in commit
AS: on release there are more stringent tests than commit, so that would pick it up too
AK: would be nice to also be able to do this with CICE
DS: are we talking about just committing the docs (easy, we can do in payu) or CI repro test against branch to merge in (harder, involved release team)
CB: would be happy to have a go at this but AS is probably better positioned, AS will write something and CB can help review/have a chat.

COSIMA twg update tomorrow

CB: Dougie to give TWG update to Thurs COSIMA meeting
ML: increase tracer timestep may be part of problem with ePBL; reducing from 3 to 2hr (matching GFDL) fixes truncation errors, but performance is worse. Truncation occurs at particular places around Antarctica.
DS: truncations in 0.30 and 0.4.0?
ML: haven’t tested in 0.4
DS: one of the new MOM6 changes (off by default) helps improve model stability
ML: should I discuss this at CSIMA meeting tomorrw, or spend some time working on it first?
CB, AK: might be better to discuss ePBL in offline meeting with the few people who can give good input
ML: Wilton found ePBL performed similarly to KPP but without vertical resolution dependence

Timestep at 1 deg

DS: mom dynamic timesetep is quote short (30 min) and also differs from coupling, unlike 0.25
DS: probably inherited from CESM

Release team meeting update (Tommy, Aidan, Lachlan, Spencer, Jo, Dougie and Chris)

DS: CI to automatically create diffs (like the ones in README) between PR branches and all config branches. DS is coordinating. Some interest from Cable/land teams.

Bluelink invite

Invitation to present at Blue Link in late March on OM3 development and/or high-resolution development.

To be discussed offline

NCRIS/board update

CB: due Friday. @minghangli please write an update on the OM2 new control experiments.

ACCESS-NRI COSIMA 2025 training program

CB: starting next week on Fri 21st Feb (discussion link, draft program). What’s the status AS?
AS: may need to use xp65 env. Will have some slides. Possibly a notebook

Next time…

@dougiesquire on for the agenda for next Wednesday’s OSIT and for the next TWG: Chair: Anton. Minutes: Dougie.

cbull · 27 February 2025 00:25

Date: February 26, 2025
Present: @AndyHoggANU, @aekiss, @MartinDix, @sofarrell, @ezhilsabareesh8, @minghangli, @anton, @helen
Apologies: @dougiesquire
Chair: @anton
Minutes: @cbull

Key points

non nuopc MOM6-standalone

Some existing executables are about a year old and broken (SIS and FMS but being used as standalone – other components are turned off at run time). Helen is currently suggesting people try executables where it’s build and origin is unknown (possibly one Angus, one Micael but Andy H/Andrew K unsure).

Question: what is the best way for the community to proceed? Anton: can we ask the community to just use NUOPC? Helen: yes, long term, but current notebook workflow does not. Next actions: ask Angus to make a new one that works/we have provenance, point towards NUOPC instructions, ask NCAR if instructions can be more generalised (consider abstracting access-nri/ncar parts).

025 minimum depths (Kieran)

Was discussed at the CM meeting this afternoon, related issue

Bathymetry at the moment is based on the top level 4 layers, gives minimum depth of 5m but runtime option is just as a configuration change (which would be applied everywhere) and create a disconnect between input bathymetry and what the model does.

Kieran asked Andrew about the hand edits discussed here. Andrew: thought that many were unlikely to be needed here. Andy H: integrated salinity restoring over the Persian Gulf/Red Sea in the 0.25 might tell you that the sill depth isn’t deep enough (concept: can be inferred from ocean model run and then avoid one having to run the CM). Andrew: there are differences in the two setups that could make it more complicated (e.g. rainfall and run-off fluxes being in different locations).

Minghang re-showed the “salinity restoring flux” plots to consider Andy’s suggestion. Minghang: note OM2 had a restoring cap (not present in om3). Andy/Andrew/Siobhan: next action, run the CM model further to see how the restoring we are seeing in 025 OM plays out. (i.e. current restoring plots are inconclusive and it’s likely faster to just run CM)

OM3-025 project board & efficiency blockers (Chris)
Re: efficiency blockers for OM3-025 release.
Anton: too many side projects (e.g. ice shelves, training program, wombat etc)
Andy: would like to see the sensitivity tests run out and shared with the community asap (sensed this was imminent a few months ago?)
Chris: thinks there has been an over emphasis at this stage on bitwise repro’.

Side follow ups:
Minghang: stuck with epbl but is running parallel sensitivity tests. Andrew are these runs in a shared place with scripts that the community could access? Minghang: not currently but will look into it.
Andrew: bitwise repro is important as you say in some contexts and there’s a time for that. However, restart repro’ is still broken? Chris: still being looked into.
Ezhil: will other parameter tests require changes in the bathymetry? Andy: it’s iterative (helps to get community input early!).

CICE+WW3 regular international meetings ? (Anton + )
Ezhil: met with Luke, Alessandro, Alberto, Noah, Siobhan at Uni Melb last week and presented the status of the current model. There is community interest in using/developing the model! Chris suggested we could have a semi-regular meeting with the core group and a wider (less regular) meeting from the wider community. Anton: do you imagine targeting the development community/science focused outcomes. Ezhil’s interest, focus first on the communities that can benefit our WW3 development.

MOM consortium update (Chris)
See here for long summary. Chris gave brief update on:

change MOM6 to use the Apache license? (Such a decision must be unanimous.)
Changes to the NUOPC cap to permit the use of coupled generic tracers and coordination with other generic tracer interface changes
Sharing the ACCESS Cmake build system for MOM6

OM3 evaluation paper in terms of OM3 025 release? (Chris)
Chris: what’s the plan? How do people cite OM3 025 once it’s released? Do we wait till high-res om3 before writing eval paper?
Andy: historical perspective re: om2 was that the grant funded the high-res model so all configs were developed at the same time. In this instance, we should consider writing up the configs as they become available. E.g. om3 025 could have it’s own evaluation paper.
Andrew: last time tech doc’ was used as a starting point.

“How to do a perturbation experiment in ACCESS om2” and Aidan’s guinea pig (Chris) link
Chris: would think this session will have more community value if co-presented by the community. Andrew: what is being perturbed? Perturbation to the forcings are common (run-off, atmosphere) but involved. Chris: imagining a brief overview to some of the more common kinds of perturbations (perhaps om2 architecture session would come beforehand?). Andrew and Chris can follow up offline. Anton: consider asking community folks e.g. @hrsdawson?

International Travel in 2025
Chris: exec want to know if anyone has any planned? Please let Chris know.
No responses.

Shifting some dates on the training program re: Anzac day. (Chris)
No complaints.

@Andrew. Bluelink meeting, do Helen and Chris start booking travel?
Andrew: yes can book. Would perhaps be good to coordinate. Chris: Yes, let’s do that on Zulip.

Next week is an OSIT, Chris is rostered. (Dougie will take minutes in two TWG’s time.)

anton · 13 March 2025 03:00

Date: March 12, 2025

Attendees:

@anton
@dougiesquire
@minghangli
@ezhilsabareesh8
@MartinDix
@aekiss
@AndyHoggANU
@helen
@cbull

Meeting Schedule:
Most attendees will be at Bluelink meeting next week.

Apologies - we forgot to send an invite this week.

OM3-025 Project Board Priorities:
Discussion on the priorities for the OM3-025 release.
Addition of a Docs issue to the project board.
We are close to doing an alpha release and there was some discussion about what (if any) changes are needed before this point. Anton is advocating github should reflect what we have been testing with - initial conditions / grids / bathymetry / run-off fix. We will also allow extra truncations with this.
We talked about the changing MOM parameters - eg dt therm , KPP etc but decided only change dt_therm (MOM tracer timestep) in dev-025deg_jra55do_ryf config and leave KPP as is. Previous similar configs have been run for 30 yeas with current KPP and dt_therm.

Users - Andy would advocate a group of testers for the release. Chris suggested maybe there will be an OM3 group at how to run a model training sessions next week. Minghang has a list of sensitivity testing that maybe users could engage with.
Need to stress to uses what alpha release means re support and not-for-science use
@Cbull will make docs for access-hive for OM3

OneAPI Compiler:
Anton is close to having a build ready with the OneAPI compiler.
Discussion on the need for performance checks and the potential impact on stability. But ultimately we were happy to proceed to minimum sanity testing only as we don’t have a current baseline set of answers / climate to compare to and at this point in development changing answers is ok.

Mom Symmetric Issue:
Turning on Mom Symmetric (in 2025.01.0) breaks restart reproducibility in the main configs and breaks determinism in BGC configs. We have reverted this change for now, but it is needed for regional work. We will provided this as a pre-release in the shorter term, with a plan to investigate and fix the issue, with Dougie Squire leading the investigation.

Upstream Model Component Updates:
We reviewed changes to upstream model components since 2025.01.0 release, and these appear fairly small. We have recently built the model without trouble with latest CMEPS/CDEPS/share. We can just update this now, but probably wouldn’t do a release.

Blue Link Meeting Preparation:
Andrew Kiss and Helen will present at the Blue Link meeting.
Discussion on the content of the presentation, including grid and topography updates, dynamic tracer time step, and potential collaboration with Blue Link.

Action Items:

@dougiesquire: Investigate the Mom Symmetric issue.
@cbull: access-hive style docs for ACCESS-OM3 (025 ryf)

ezhilsabareesh8 · 26 March 2025 07:19

Meeting Summary & Minutes

Date: March 26, 2025
Attendees: @cbull, @dougiesquire, @anton, @aekiss, @AndyHoggAnu, @pearseb, @helen, @MartinDix, @spencerwong

ACCESS OM3 Config Docs
Chris demonstrated the config documentation system. The docs are versioned, written in Markdown for easy transferability, and will support citability with an automatically updated DOI upon each config release. Chris also proposed generating a full PDF version. The implementation involves a repository with a structured skeleton where the actual documentation is stored in Markdown files.

Current ACCESS OM3 config docs are located here: access-om3-configs/docs at davide/docs-setup · ACCESS-NRI/access-om3-configs · GitHub

Related issues:

New Website Structuring Approach
Chris proposed a new website structure where docs live alongside the code, with the main branch holding general info and config details in separate branches, to minimise redundancy and supports community edits.

Repo Search & Configs Discussion
Andrew explained the use of repo search links in the docs. Chris and Andy discussed dynamic linking, MathJax for equations, and a standardized config subdomain.

Chris emphasized proper work attribution and linking commits to work.

Versioning & Updates
Chris and Andy highlighted the need for flexible versioning to handle updates across releases while ensuring accuracy for older configurations.

Restart Reproducibility
Dougie identified that the Mom symmetric restart reproducibility issue comes from changes ACCESS-NRI have made to the MOM NUOPC cap. Hopefully an easy fix.

Andy inquired about patches, and Dougie confirmed they are in a PR candidate branch, queued for upstreaming.

Incoming MOM6 PR from NCAR

There’s an open PR to MOM6 main that impacts OM3
Dougie has suggested changes to that PR. One accepted already, one hopefully accepted soon.

OM3 configs consolidation

All OM3 configurations are now in access-om3-configs.
access-om3-configs and access-om3-wav-configs were merged.
access-om3-wav-configs repo has been archived.
All issues have been moved from COSIMA/access-om3 to ACCESS-NRI/access-om3-configs
Moving issues to access-om3-configs sent an email to everyone that engaged in an issue for every issue.
Dougie suggested mentioning emails in COSIMA WG meeting tomorrow in case anyone is confused.

One API Release:
Anton plans to skip the old COSIMA build system and transition to the new one, as test results were identical.

Action Items

Chris: ~~Create a release checklist template for the ocean team on the forum.~~
Dougie: Fix the restart reproducibility issue in MOM symmetric.

Note: I might have missed something, please feel free to edit.

ezhilsabareesh8 · 9 April 2025 06:34

Meeting Summary & Minutes

Date: April 9, 2025
Attendees: @cbull, @dougiesquire, @aekiss, @helen, @MartinDix, @sofarrell, @ezhilabareesh8
Minutes: @ezhilabareesh8

1. Restart Reproducibility Issue

Dougie shared an update on debugging restart reproducibility in MOM6. Initial suspicions around cap changes were ruled out, as the issue persists without them.

2. MOM6 dev meeting update

ACCESS-NRI was officially added to the reviewer list, increasing the number of reviewers from 6 to 7. An academic license may help expand the reviewer pool.
Large PRs are straining the review process; the team agreed to submit smaller, more frequent changes.
Plans to improve automated testing coverage were discussed.
GFDL will add FMAs to their test suite to check reproducibility.
Incoming PR from NCAR is now compatible with OM3, but will change answers.

3. Bottom Roughness Method Discussion

Minghang discussed the current bottom roughness method based on Jayne & St. Laurent (2001) and compared results with the OM2 file.
Siobhan noted discrepancies over the Indian Ocean compared to OM2.
The method is computationally slow and needs further testing.

4. Other updates

Chris provided updates on the beta release and plans to streamline the June release of the 25 km configuration.
Regarding staying in Canberra after the ACCESS-NRI staff retreat, suggestion is for OSIT team to fly back on Friday allowing Wed-Thursday for discussion.

ezhilsabareesh8 · 24 April 2025 02:00

Meeting Summary & Minutes

Date: April 24, 2025
Attendees: @cbull, @dougiesquire, @helen, @AndyHoggAnu, @ezhilabareesh8, @minghangli
Minutes: @ezhilabareesh8

GFDL PR Update: (Dougie)

The upcoming PR from GFD includes around 30,000 lines of changes. These changes need to be reviewed to ensure they integrate well with ACCESS-OM3.
Marshall is revisiting the Fortran do concurrent construct, which had issues a few years ago but has since seen improvements. This construct could make GPU porting more seamless.
A hybrid approach is being considered, using OpenMP for memory management and do concurrent for loop parallelization. This approach aims to manage memory more efficiently by specifying which arrays need to be passed to the GPU.
Testing is required to ensure the changes work do not alter results, and do not impact CPU runtime (likely that it will improve runtime performance given the barotropic solver performance changes). Could be worth considering using for the beta June release.

Restart Reproducibility Issues: (Dougie)

Challenges with restart reproducibility were discussed, particularly with the LEITH parameterization and the use of the use_leithy parameter, which turns on harmonic backscatter.
The combination of LEITH_AH = True and USE_LEITHY = False breaks restart reproducibility. One potential solution is to keep the LEITH parameterization and turn on harmonic backscatter, which Dougie has shown fixes reproducibility.
Another option is to move to Smagorinsky, which also fixes the restart reproducibility issue.
Dougie discovered that the checksums written into the MOM restart file are not reliable for checking if answers have changed, due to differences in negative and positive zero representations.

Truncation Errors: (Minghang, Ezhil and Chris)

Minghang Li reported ongoing truncation errors, particularly near the Kara strait, Kerguelen Plateau and West Antarctica.
Various tests have been conducted, including changes to atmospheric forcing and bathymetry adjustments, and using GFDL’s OM5 MOM_input parameters.
Reducing the time step to 900s is recommended to see if fixes the truncations.
The truncation errors tend to occur at high latitudes with small grid cells. The team is considering whether to tolerate some level of truncations or implement local engineering fixes.

Configuration Updates and Documentation: (Chris)

Chris mentioned the deployment of new configuration documentation and requested no further additions to the old Wiki.
Plans for a more permanent URL and integration with ACCESS-NRI documentation are underway.
The new documentation will include general information about ACCESS-OM3 configurations, with specific configuration details provided in separate sections.

GitHub backup: (Chris)
Chris outlined strategy that the team have discussed and are planning on implementing. @AndyHoggANU suggested that it would be okay if COSIMA was included in the routine backup but is yet to be implemented by Micael, Aidan etc.

ezhilsabareesh8 · 8 May 2025 01:10

Meeting Summary & Minutes

Date: May 7, 2025
Attendees: @cbull, @dougiesquire, @helen, @ezhilabareesh8, @minghangli, @MartinDix
Minutes: @ezhilabareesh8

Agenda:

Mom Fork Update
Mom 6 Dev Call Update
Restarting OM3 25k Control Run
Historical Repro and Component Layout Changes
New Build System and Upcoming Control Run
Upstream Model Component Update

Mom Fork Update (Dougie)

Changes have been made in managing the MOM fork. All users should familiarize themselves with the new documentation in the wiki.
Feedback is important to continuously improve the workflow.

Mom 6 Dev Call Update

Mentioned the default parameter changes in the pipeline and the opportunity for feedback.
Proposal to move SIS2 into the MOM6 repo to simplify coupling work.
Los Alamos is clearing up copyright and licensing issues, allowing NOAA to participate in CICE development.

Restarting OM3 25k Control Run

Issues when restarting with the updated bathymetry, which has fewer ocean cells than the old one, causing errors during restarts.
Suggested testing both 2025.3.13 and 2025.3.25 bathymetries to better understand the errors and following the OM2 wiki method on updating restarts for new bathymetry, as documented here
To run standalone MOM6 tests to confirm the source of errors.

Historical Repro and Component Layout Changes

Minghang Li: Noted that changing the component layout (number of cores) changes the answers, even after disabling the auto-masker table.
Dougie Squire: Suggested that changing the layout within MOM should not change answers, but it needs further testing.

New Build System and Upcoming Control Run

Chris: Discussed the plan to use the new build system for the upcoming control run and sensitivity tests.
Anton: Confirmed that load balancing is runtime configuration and does not require a new build.
Dougie Squire: Mentioned to wait for the current MOM PR to be merged before updating.

Upstream Model Component Update

Anton: Plans to follow the convention from the last update, mainly looking at versions used by CESM.
Dougie Squire: Suggested waiting for the current MOM PR to be merged to avoid unnecessary updates.

ezhilsabareesh8 · 22 May 2025 01:13

Date: May 21, 2025
Chair: @cbull
Attendees: @cbull, @dougiesquire, @helen, @ezhilabareesh8, @anton, @MartinDix, @AndyHoggAnu, @PaulSpence, @aekiss, @sofarrell

Ocean Team Survey

Ocean team survey is available here
All staff, including those not formally in the ocean team, are encouraged to complete it.
Purpose: To gather feedback on team collaboration and workflows. Responses will inform potential changes.

Model Development Meeting Planning

Config Documentation Presentation (Anton): To support other teams needing configuration documentation.
Release Definitions Discussion: Clarify terms like alpha, beta, and release across the organization.
Config documentation discussion should be delayed until the documentation team finalizes a template.
The release discussion could be informal and framed as sharing the ocean team’s experience.
Potential to invite someone from the release team to participate.

Work Plan and Community Engagement

Hive forum discussion on workload and priorities is here
SAC Feedback: Emphasis on prioritizing an 8 km resolution model.
Concerns Raised:
- Risk of overcommitting to an 8 km alpha release by end of FY26.
- Need to balance 25 km evaluation (for CM3) with community interest in 8 km.
Considerations:
- 25 km model is foundational for CM3 and must be evaluated first.
- Community is more engaged with high-resolution Antarctic modeling (8 km).
- Discussion on whether to proceed with 25 km and 8 km in parallel or sequentially.

Evaluation Paper Planning

Purpose: To formalize model evaluation, engage the community, and provide documentation.
Scope Discussion:
- Whether to focus on 25 km, 8 km, or include other configurations (e.g., WaveWatch III, Wombat).
- Comparisons between OM2 and OM3, and potentially between OM3 resolutions.
- Whether to publish a peer-reviewed paper or a technical report with DOI.
Community Engagement:
- Paper could encourage ECRs to contribute.
- Need to ensure contributors to 25 km scripts are credited in future 8 km papers.
Action Items:
- Draft agenda for next week’s meeting to scope the paper.
- Use Hive forum here to coordinate and collect agenda items.

Alpha Release Status

Goal: Release a new alpha version this week to support Minghang’s work.
Status Update:
- Generic tracers: Complete
- Build flags: Pending confirmation
- MOM6 update: Awaiting merge
- MOM6 metrics: Delayed
Next Steps: Sprint planned to finalize configuration and documentation.

Mediterranean Sea Salinity Issue

Problem: Excessive salinity in the eastern Mediterranean in ESM 1.6 due to limited salt transport.
Findings:
- Bathymetry shows unrealistic depth and width at the Strait of Sicily.
- Maximum depth is only 50m in UV grid, compared to ~300m in ACCESS-OM2 01
Recommendations:
- Adjust Sicily strait depth and possibly width to improve flow.
- Compare salt transport and Nile runoff with ACCESS-OM2 01

Next Steps

Finalize alpha release
Paul to invite broader community to the evaluation paper scoping meeting.
Coordinate agenda and participation for evaluation paper meeting.

aekiss · 22 May 2025 01:27

Thanks @ezhilsabareesh8 - I’ve edited this to focus on the Strait of Sicily, not Gibraltar. @AndyHoggANU did I get the gist right?

cbull · 22 May 2025 03:42

@aekiss. Note there was also this follow up discussion.

minghangli · 11 June 2025 23:33

Date: 11 Jun 2025
Chair: @cbull
Minutes keeper: @minghangli
Attendees: @dougiesquire, @helen, @MartinDix, @AndyHoggAnu, @aekiss, @sofarrell

Agenda:

Sea Surface Height in MOM6: SSH vs zos
ACCESS-NRI workshop abstracts
compute use for the rest of the quarter

Meeting Minutes

New control run:

Configuration: 25km_JRA_RYF,
Build: 2025.05.001
Current runlength: 72 years completed (ongoing)

Sea Surface Height in MOM6: SSH vs zos

See previous discussion.
Agreement reached to use zos (CMOR-compliant variable) as the standard output.
zos includes corrections for sea ice inverse barometer and is most appropriate for sea level analysis and inter-model comparison. (Looked like both ssh and zos have open ocean inverse barometer effects.)

ACCESS-NRI workshop abstracts

Abstract submission deadline is 15 June.
All are encouraged to submit abstracts with a focus on scientific outcomes and research impact. @AndyHoggANU notes that the program committee will triage so don’t worry about being too coordinated – science talks will be prioritised.
@kieranricardo is likely putting something in for CM3.
@aekiss is open to talking/presenting om3 (others are welcome to join in)
@cbull will start a zulip thread/poll.

Compute use for the rest of the quarter

Remaining allocation: 4.25 MSU

$ nci_account -P tm70

Usage Report: Project=tm70 Period=2025.q2
=============================================================
    Grant:     9.75 MSU
     Used:     5.42 MSU
 Reserved:    81.79 KSU
    Avail:     4.25 MSU

@minghangli to conduct three ~60-year runs (1 control + 2 sensitivity), ~2 MSU expected.
@dougiesquire to initiate BGC runs at 25km and 100km resolutions.
@cbull may run simulations with the OM2-01-RYF configuration.
@anton to initiate C-Grid-CICE runs at 25km resolution.

Performance and Optimisation

Comparative runtime performance between OM2 and OM3 models.

Config	HGrid	VGrid	Cores	kSU/Year	Years/Day	kSU / 10¹⁰ Cells / Day / Core
ACCESS-OM2-025	1440×1080	50 levels	1824	7.2	12.2	6.19
ACCESS-OM3-025	1440×1142	75 levels	1664	10	8.28	4.03

Current OM3 demonstrates improved compute efficiency, requiring ~35% fewer core-hours per 10¹⁰ grid cells per day than OM2. Further tuning including load balancing and I/O efficiency is ongoing.
@minghangli will draft detailed documentation on model optimisation strategies and performance benchmarks.
@minghangli Ongoing coordination with the Software Transformation team to support performance enhancements will continue.

anton · 25 June 2025 22:55

Date: 25 Jun 2025
Chair: @cbull
Minutes keeper: @anton
Attendees: @dougiesquire @helen @sofarrell @ezhilsabareesh8

Minutes

Sub-mesoscale parameterisation

See [Parameter Testing] Sub-mesoscale parameterisation · Issue #435 · ACCESS-NRI/access-om3-configs · GitHub

AS showed some results from turning on a sub-mesoscale paramaterisations, showing low salinity in Arctic.

Its hard to say if low salinity is the confirmed cause of the sea-ice thermodynamics issue, but seems likely.

Suggestions to check:

is bathymetry shallow
how does model surface salinity compare to WOA23
check runoff volume at location
investigate turning off MASK_SRESTORE_UNDER_ICE
could use mushy rather than BL99 sea ice thermodynamics for better handling of v. low salinity
global SST plot compared to the baseline run for overall impact of changing parameterisation

SST bias:

@aekiss identified the SST biases in OM3 25km config seem higher and sometimes different sign than the OM2-025 biases. Needs investigation and longer run.

DATM/DROF time interpolation

AS showed Fix/check time in data atm/rof stream files · Issue #437 · ACCESS-NRI/access-om3-configs · GitHub

agreed to linear interpolate runoff and raise an issue to implement an improved runoff scheme

rho coordinate in ocean

Deferred discusson until AH is available. Dougie to organise a meeting.

github.com/ACCESS-NRI/access-om3-configs

Density layer diagnostics

opened 04:55AM - 02 Jun 25 UTC

AndyHoggANU

OM3-25k:P0

We should ensure that all MOM6-based configs have the capability for outputting …on density layers. To do this, we need to add something like the following to `MOM_input`: ``` NUM_DIAG_COORDS = 1 DIAG_COORDS = "rho2 RHO2 RHO" DIAG_COORD_DEF_RHO2 = "FILE:diag_rho2.nc,interfaces=rho2" ``` Note that we could choose to define the rho2 coordinate in a different way, like: ``` DIAG_COORD_DEF_RHO2 = "RFNC1:35,999.5,1028,1028.5,8.,1038.,0.0078125" ``` In addition, we'll need to alter the `diag_table` to allow for alternative output models, which needs the following at a minimum: ``` "ocean_month_rho2", 1, "months", 1, "days", "time" "ocean_model_rho2", "thkcello", "thkcello", "ocean_month_rho2", "all", "mean", "none",2 "ocean_model_rho2","umo", "umo", "ocean_month_rho2", "all", "mean", "none",2 "ocean_model_rho2","vmo", "vmo", "ocean_month_rho2", "all", "mean", "none",2 ``` I'm not even sure that `make_diag_table.py` can handle the above - if not, we may need to alter the script.

Compute

Quarter ends monday night - looks like its ok

ESM1.6

@ezhilsabareesh8 has updated mediteranean sea bathymetry - deepened sicily straight. There is some impact, but not enough yet. We probably need to try doing a test run with some widening of the straits but also compare model results to observations if possible.

SoF will have a look at the output from CICE5 runs as well. DB might be on leave this week.

anton · 9 July 2025 23:14

Date: 9 July 2025
Chair: @cbull
Minutes: @anton
Attendees: @ezhilsabareesh8 @minghangli @sofarrell @dougiesquire @MartinDix @helen @aekiss

Minutes

Initial conditions

Should we change the initial conditions to match the equaton of state?

Community preference for roquet_rho (also known as TEOS10), but we currently use wright.

We do have a pertubation run for roquet_rho, we think performance is worse but don’t have a number for this. Generally targetting a release for next week, which would be much simpler to release with the status quo.

Current initial conditions are conservative temperature and practical salinity which is a pre-TEOS10 implementation that we wouldn’t revert to and is incorrect for both wright and roquet_rho.

Switching on roquet_rho introduces some uncertainty about model stability.

Plan: Note this as a known issue in the release note but remain with the status quo.

Enthalpy terms.

@aekiss found two heat content diagnostics which are zero. @dougiesquire investigated

The config relies on the mediator to calculate some of the enthalpy terms, but they are only calculated when ALL the possible water flux fields are connected, but we only connect and use some of the fields. There fore there is no enthalpy fluxes for all water exchange through the coupler.

We can configure OM3 to calculate the enthalpy in MOM, and a future update to MOM allows this to be done in the mediator. The two approaches will have similar answers.

Aim to update the MOM config before 25km-1.0 beta release.

Surface pressure under the ice

Should we be setting MAX_P_SURF in MOM due to something other than 0. There is possible for ocean-sea-ice instability.

Probably turn off the limit on this. We didn’t have this set before bringing in OM5 parameters.

Dougie will do a longish run before before 25km-1.0 beta release if possible.

GM

Needs more investigation. Its currently off but we probably want it to be on. It was on for the first 19 years of the long control / spin-up run.

Experience from Access-OM2 at quarter degree says we should be using it. [Chris: afterthought it should definitely be on in the 1 degree, I suppose it isn’t currently?]

Turning on GM seems a possible contender to improve our biases (early runs with this on have SST trend of the opposite sign to the current config but this comparison involves lots of other parameter changes) . Once GM is turned on, likely need some tuning for co-efficient / choice of GM.

Config docs

Stubs for individual branches should ready to merge soon. @atteggiani is working on a centralised model/template for config docs in different repositories.

Can we have a test for dead links in the config-docs ? We think access-hive docs have this. We’ll ask @atteggiani

Thanks to Ezhil for his work over the last two years - he is moving on to a postdoc in Germany

next meeting - two weeks

Addendum by @dougiesquire

Based on the discussions in the TWG and subsequently (10/7/25 with @dougiesquire @anton @aekiss @minghangli @cbull ), I’ve started three sensitivity test runs (+1 cold start control). All are based on this configuration with the following changes to MOM_input:

No changes

ENTHALPY_FROM_COUPLER = False   !   [Boolean] default = False
                                ! If True, the heat (enthalpy) associated with mass entering/leaving the ocean
                                ! is provided via coupler.

MAX_P_SURF = -1.0               !   [Pa] default = -1.0
                                ! The maximum surface pressure that can be exerted by the atmosphere and
                                ! floating sea-ice or ice shelves. This is needed because the FMS coupling
                                ! structure does not limit the water that can be frozen out of the ocean and the
                                ! ice-ocean heat fluxes are treated explicitly.  No limit is applied if a
                                ! negative value is used.
ENTHALPY_FROM_COUPLER = False   !   [Boolean] default = False
                                ! If True, the heat (enthalpy) associated with mass entering/leaving the ocean
                                ! is provided via coupler.

MAX_P_SURF = -1.0               !   [Pa] default = -1.0
                                ! The maximum surface pressure that can be exerted by the atmosphere and
                                ! floating sea-ice or ice shelves. This is needed because the FMS coupling
                                ! structure does not limit the water that can be frozen out of the ocean and the
                                ! ice-ocean heat fluxes are treated explicitly.  No limit is applied if a
                                ! negative value is used.
ENTHALPY_FROM_COUPLER = False   !   [Boolean] default = False
                                ! If True, the heat (enthalpy) associated with mass entering/leaving the ocean
                                ! is provided via coupler.
USE_CONTEMP_ABSSAL = True       !   [Boolean] default = False
                                ! If true, the prognostics T&S are the conservative temperature and absolute
                                ! salinity. Care should be taken to convert them to potential temperature and
                                ! practical salinity before exchanging them with the coupler and/or reporting
                                ! T&S diagnostics.
USE_PSURF_IN_EOS = True         !   [Boolean] default = True
                                ! If true, always include the surface pressure contributions in equation of
                                ! state calculations.
EQN_OF_STATE = "ROQUET_RHO"     ! default = "WRIGHT"
                                ! EQN_OF_STATE determines which ocean equation of state should be used.
                                ! Currently, the valid choices are "LINEAR", "UNESCO", "JACKETT_MCD", "WRIGHT",
                                ! "WRIGHT_REDUCED", "WRIGHT_FULL", "NEMO", "ROQUET_RHO", "ROQUET_SPV" and
                                ! "TEOS10".  This is only used if USE_EOS is true.
TFREEZE_FORM = "TEOS_POLY"      ! default = "LINEAR"
                                ! TFREEZE_FORM determines which expression should be used for the freezing
                                ! point.  Currently, the valid choices are "LINEAR", "MILLERO_78", "TEOS_POLY",
                                ! "TEOS10"

minghangli · 23 July 2025 07:49

Date: 23 July 2025
Chair: @anton
Minutes: @minghangli
Attendees: @AndyHoggANU @cbull @claireyung @sofarrell @dougiesquire @MartinDix @helen @aekiss

Minutes

Repository Strategy for region configurations

Currently there are +50 branches crowding in access-om3-configs. There will be several new panan and WOMBAT runs will add another 3-5 branches soon. Three options were proposed,

create a separate regional-only repo to avoid clutter,
use personal forks for experiment work, then merge back via PR,
keep current repo for dev and release branches only,
main CI wont run on external forks.

The current decision/action is to,

continue to use access-om3-configs, everyone keeps an eye on deleting temporary branches once merged / obsolete.
Most preferred the 2nd option, while @anton to test CI-on-forks feasibility then report next meeting.
@helen will push two WOMBAT branches prefix branch names with helen/, and delete when work is merged.

Freezing temperature consistency between mom6 and cice

In the upcoming beta release of the 25km RYF configuration, MOM now uses TEOS-10 EOS with TEOS-Poly freezing form, while CICE continues to use a linear salinity-pressure relation. There are potential mismatch at the ocean-seaice interface. This might also create spurious sensisble heat fluxes or frazil artefects, especially under ice shelves.

Currently TEOS-10 in MOM6 provides 2 freezing forms, one is TEOS (exact) and the other is TEOS_POLY which is a 23-term polynomial fit (cheaper).

During coupling, MOM checks if the ocean cell is super-cooled. If so, it removes the latent heat of frazil ice and sends it as a flux to CICE, which at the same time, has its own routine to decide what the ice-ocean freezing point is. So if the two routines disagree, there will be a small mismatch.

For CICE, it uses a linear tfrz_option, which gives temperature only as a function of salinity. However for TEOS_poly in MOM, it evaluates the full TEOS-10 polynomial as a function of salinity and pressure and hence changes with pressure as well.

CICE is unaware of ocean pressure unless explictly passed. When near the sea surface, pressure is nearly 0 so pressure terms are negligible but inside deep cavities, pressure can be hundreds of bars which makes TEOS_poly more essential.

Some conclusions:

In short term, make both components use the same simple linear freezing form.
In long term, port the TEOS_POLY routine into CICE so both sides share the same formulation.

Sea-ice Initial condition

@aekiss found a bug in temperature and salinity initialisation, where there’s an unrealistic stripe at start-up. The reason behind is due to the default cice initial condition we are currently using. @anton will test using none instead of default to check if the simulation would crash.

For the 25km RYF beta release, it will retain current default initial ice for reproducibility. An issue will document the bias and the zero-ice spin-ups for research focuses (none initial ice).

COSIMA workshop talk (29 July)

@aekiss is going to extend AMOS talk: include config rationale, preliminary beta metrics, and (if ready) GPU performance notes from @edoyango.

25km RYF configuration beta release

@anton outlined the remaining tasks for the 25km RYF release as follows,

finalise citation.cff
merge to release branch,
update release note on github
finish/merge access-hive docs PR
make release note public on hive forum
make feedback thread public on hive forum.

We agreed to proceed with the current configuration without modifying the layout. The model will be run for 20–30 years, with a current performance of 5 model years per day. @minghangli is working on tuning the layout but these changes will be included in the next version, not the current release.

Last but not the least, a big thans to @anton for his leadership and dedication in driving the release process forward!

Topic		Replies	Views
COSIMA TWG Meeting Minutes 2024 TWG meeting , twg , notes , minutes	20	875	4 December 2024
COSIMA TWG Meeting Minutes 2023 TWG meeting , twg , notes , minutes	15	1750	11 December 2023
Ocean Team Workshop (12-15th November, 2024) TWG workshop , ocean	13	602	28 November 2024
COSIMA Working Group Meeting Minutes Working Group meeting , notes	25	2449	2 July 2025
COSIMA TWG Announce TWG cosima , twg , minutes	67	2765	21 July 2025

COSIMA TWG Meeting Minutes 2025

TWG meeting 12 Feb

Present: @aekiss, @anton, @MartinDix, @dougiesquire, @cbull, @minghangli, @manodeep

ACCESS-OM3-025 project board

ePBL

other items (project board)

Repro issue

MOM6 dev meeting on Tuesday

COSIMA twg update tomorrow

Timestep at 1 deg

Release team meeting update (Tommy, Aidan, Lachlan, Spencer, Jo, Dougie and Chris)

Bluelink invite

NCRIS/board update

ACCESS-NRI COSIMA 2025 training program

Next time…

Key points

1. Restart Reproducibility Issue

2. MOM6 dev meeting update

3. Bottom Roughness Method Discussion

4. Other updates

Agenda:

Mom Fork Update (Dougie)

Mom 6 Dev Call Update

Restarting OM3 25k Control Run

Historical Repro and Component Layout Changes

New Build System and Upcoming Control Run

Upstream Model Component Update

Ocean Team Survey

Model Development Meeting Planning

Work Plan and Community Engagement

Evaluation Paper Planning

Alpha Release Status

Mediterranean Sea Salinity Issue

Next Steps

Agenda:

Meeting Minutes

Minutes

Sub-mesoscale parameterisation

SST bias:

DATM/DROF time interpolation

rho coordinate in ocean

Compute

ESM1.6

Minutes

Initial conditions

Enthalpy terms.

Surface pressure under the ice

GM

Config docs

Addendum by @dougiesquire

Minutes

Repository Strategy for region configurations

Freezing temperature consistency between mom6 and cice

Sea-ice Initial condition

COSIMA workshop talk (29 July)

25km RYF configuration beta release

Related topics