Issues loading ACCESS-OM2-01 data from cycle 4

lidefi87 · 14 February 2023 07:31

Hello everyone! I am reposting here a question I asked via the CLEX slack channel.

I am trying to access sea ice concentration data from the 4th cycle of ACCESS-OM2-01. I am using the COSIMA cookbook to do this, but it is taking a long time to do anything. No data is loaded even after waiting for as long as 20 minutes. I tried restarting the kernel several times, but the problem persists. I also tried to load monthly sea ice concentration data, and I have been waiting for about 15 minutes I still do not have the data loaded into my Jupyter notebook.

For reference, loading data usually took 2-5 minutes using exactly the same method. I have also tried using two different conda environments (22.07 and 22.10) and that has not made a difference either.

I used the following line of code:

import cosima_cookbook as cc
import xarray

session = cc.database.create_session()
sic = cc.querying.getvar(‘01deg_jra55v140_iaf_cycle4’, ‘aice’, session, start_time = ‘1968’)

I am using the gadi_jupyter script to access GADI. As I mentioned before, I have done this before without any issues and did not have to wait this long. Are there any issues reported with GADI or perhaps the COSIMA cookbook? Or should I use ARE instead to run my scripts?

Additionally, there is at least one other person who experienced the same issue while loading data today. They decided to switch tasks because the cookbook was not loading any of the data they needed. Unlike me, they were using ARE to access GADI.

Any help will be appreciated.

Denisse

aekiss · 15 February 2023 01:03

I’m using analysis3-unstable, and this is also taking a really long time for me, and consuming an unreasonable amount of memory (over 25Gb).

CPU times: user 17min 27s, sys: 21min 5s, total: 38min 32s
Wall time: 38min 24s

angus-g · 15 February 2023 02:01

By adding a few arguments to skip the coordinate verification, this was relatively fast for me (on a login node):

In [42]: %time sis = cc.querying.getvar(
                       "01deg_jra55v140_iaf_cycle4", "aice", s,
                       start_time="1968", compat="override", coords="minimal"
                     )    
CPU times: user 2min 45s, sys: 1min 40s, total: 4min 26s
Wall time: 2min 54s

The long sys time indicates that it spent a long time doing IO, probably because it’s reading 4x2D grids out of every file, and then comparing them all. Maybe this doesn’t come up as badly for the ocean data because it’s not output on a curvilinear grid?

This was brought up before:

github.com/COSIMA/cosima-cookbook

Loading CICE data is very expensive

opened 05:50AM - 08 Jun 22 UTC

MartinDix

Loading a CICE variable takes much more time and memory than a MOM variable. E.g…. ``` import cosima_cookbook as cc session = cc.database.create_session() expt = '025deg_jra55_ryf9091_gadi' aice = cc.querying.getvar(expt, 'aice_m', session, n=120) ``` takes 90 s and several GB of memory (from notebook on OOD) compared to ``` sea_level = cc.querying.getvar(expt, 'sea_level', session, n=120) ``` which takes ~15s. Trying to load the full run for a CICE variable takes a crazy amount of memory. I think the issue is that the CICE variables have ``` aice_m:coordinates = "TLON TLAT time" ; ``` where `TLON` and `TLAT` are 2D variables included in the CICE files. MOM variables have ``` sea_level:coordinates = "geolon_t geolat_t" ; ``` where `geolon_t` and `geolat_t` are not in the files. I think this means that `xarray.open_mfdataset` is reading `TLON` and `TLAT` for each file to check if it has to concatenate on those coordinates. I couldn't see a way of persuading xarray that it should only try to concatenate on the time dimension.

With the suggested fix being to use decode_coords=False, as demonstrated in the ice plotting recipe. This gives me very similar timing to above:

In [61]: %time sis = cc.querying.getvar(
                       "01deg_jra55v140_iaf_cycle4", "aice", s,
                       start_time="1968", decode_coords=False
                     )
CPU times: user 2min 39s, sys: 1min 31s, total: 4min 11s
Wall time: 2min 34s

The difference between the two methods is that the first one will give you TLON, TLAT, ULON, ULAT (in addition to time) as coordinates on the resulting DataArray; whereas the second one will only give you a time coordinate – the spatial dimensions just have integer indices.

AndyHoggANU · 15 February 2023 02:19

Wondering if we can test whether this is:

a database problem, in that the cookbook is taking a long time to find the files/metadata; or
a structural problem with the way ice data is saved (either in cycle 4, or more generally?)

@rbeucher and @dougiesquire are looking at ways to improve data catalogues, and it would be nice to know if this problem depends on the database itself …

angus-g · 15 February 2023 03:13

In this case, it’s option 2: it’s just the way CICE’s sea ice data works (the GitHub issue has a little more technical detail – it would more generally apply to any output with 2D coordinates that get brought in). We could work around it in the cookbook, by doing things like:

different defaults for coords, compat, etc. options for open_mfdataset
default to decode_coords=False (I don’t think this is a good idea)
detect if a variable’s coordinates are 2D at query time and throw a warning, with suggestions for options to speed up the load and/or a link to a notebook demonstrating the difference

Part of the issue is that maybe we can’t (shouldn’t?) necessarily rely on all the files within an experiment being self-consistent to the point that we can use the nested method of concatenating files and assuming that the coordinates all line up. That probably falls more on the data cataloguing side of things.

dougiesquire · 15 February 2023 04:24

Yes, I wonder if checking that all the data in an experiment can be concatenated could be something that is done once when the experiment is added to the database? Then compat="override" and coords="minimal" could safely be used by default.

Topic		Replies	Views
Building a COSIMA dataset on time-averaged files Technical python , cosima	10	452	21 February 2023
Cookbook database on highres model, how can I run though it faster? Technical	6	259	28 March 2023
Ice_read_nc_xy: Cannot find variable licefw_io COSIMA	12	476	18 January 2023
Which ACCESS-OM2 IAF configuruation is which? COSIMA	11	464	7 February 2023
"database disk image is malformed" // COSIMA Cookbook failure? COSIMA cosima , cosima-cookbook	7	247	12 October 2023

Issues loading ACCESS-OM2-01 data from cycle 4

Related topics