Cosima cookbook updating needs

JuliaN · 10 November 2022 00:36

We discussed briefly during the breakout sessions possible improvements to the cookbook:

It is currently not scaling well with higher resolution models (>1/10 of a degree). Could this be improved?
Many experiments in the database are not well documented and lack metadata. We should enforce mandatory metadata for the experiments indexed in the database.

Please add to the list any other points for improvement

Aidan · 10 November 2022 01:22

Good points @JuliaN.

We’ve long aspired to have some test codes to use as a basis for optimisation, but it hasn’t gone very far. Still a worthy goal!

github.com/COSIMA/cosima-cookbook

Optimisation/best-practice xarray and dask programming patterns

opened 05:27AM - 28 Aug 20 UTC

aidanheerdegen

help wanted

Many people report problems with running calculations on large datasets, and wou…ld like some general advice on the best approaches for tackling large problems. There are lots of parameters that determine the success/efficiency of a calculation: 1. Order of operations 2. Calculating intermediate results 3. Dask chunking 4. netCDF chunking on disk 5. Number of dask workers (or not using a scheduler/dask at all) 6. Number of threads and amount of memory per worker It becomes very complex very quickly. One approach is to have some representative test calculations that can then be used as a target for optimisation. These test calculations can be run whenever there are infrastructure or algorithm changes to check there has been no degradation in performance, or if they might be further improved. If that sounds like a useful idea then we need people to propose calculations that they know to be strenuous as possibilities for optimisation/best-practicification*. Ideally these would be fairly compact, reproducible chunks of code. ping @AndyHoggANU @aekiss @adele157 @navidcy @angus-g * not a real word

The idea of only indexing data with sufficient metadata has been floated before, and I agree, it is long overdue.

rbeucher · 11 November 2022 01:39

Hi @JuliaN ,

Do you have a specific recipe in mind?
I am looking at setting up an environment on Gadi to do that sort of scaling exercise.
I do think that is something that we need to run routinely.
A use case to get us started would be great!

Thanks!

Romain

navidcy · 14 November 2022 00:17

From the top of my head I remember that the overturning stream function example can be quite demanding when using the 0.1 degree output.

https://cosima-recipes.readthedocs.io/en/latest/documented_examples/Zonally_Averaged_Global_Meridional_Overturning_Circulation.html#gallery-documented-examples-zonally-averaged-global-meridional-overturning-circulation-ipynb

@AndyHoggANU might have more in mind?

navidcy · 14 November 2022 00:27

We also discussed that we want some sort of automation to ensure that all notebooks run using the latest conda-analysis environment and that no output has moved or deleted, etc.

This is related to Automating jupyterbook running with CI tools - #3 by navidcy

adele-morrison · 14 November 2022 02:28

Yes I second using overturning as a test example. Calculating time series of overturning for a couple of hundred years of the RYF takes a long time and is a metric that is used a lot.

Aidan · 14 November 2022 04:05

This can be done relatively easily with an automated job that updates the kernel version in the file, without even having to run it. It is definitely something worthwhile to automate so that recipes are always correct when run at NCI.

rbeucher · 14 November 2022 22:38

OK. I’ll add that one to my To-do list then

Topic		Replies	Views
7th March 2025 - Introduction to Cosima-Cookbook/Cosima-Recipes & how to make it go - loading data fast 2025 training program	4	196	17 March 2025
COSIMA recipes / on-ramping for new people Workshops cosima , workshop , cosima-recipes , cosima-workshop-2022	8	465	21 November 2022
How can COSIMA improve? discussion at the COSIMA workshop 2023 Workshops cosima , cosima-workshop-2023	2	338	7 September 2023
How can we get new COSIMA recipes? Workshops cosima-recipes , model-evaluation , med , cosima-workshop-2022	11	438	10 November 2022
Continuous Testing of COSIMA Recipes COSIMA cosima-recipes , model-evaluation , med	6	293	9 November 2022

Cosima cookbook updating needs

Related topics