Zarr inodes

sam.green · 7 November 2025 04:05

Following on from the discussion about reducing Zarr inodes at the working group meeting today and in relation to Experiment Proposal: Processing Global km-scale Hackathon Data

If anyone has had success with storing Zarr with Shards or using a zip format then could you please respond in this thread with how you did that. I’ve had mixed success with this so far so it would be very useful to know what others are doing.

christianstassen · 7 November 2025 04:23

Hi Sam,

I have used code like this:

def zip_zarr(zarr_filename, zip_filename):
"""Zip a zarr collection.
Parameters
----------
zarr_filename : str
    Path to (unzipped) zarr collection
zip_filename : str
    Path to output zipped zarr collection
"""

with zipfile.ZipFile(
    zip_filename, "w", compression=zipfile.ZIP_STORED, allowZip64=True
) as fh:
    for root, _, filenames in os.walk(zarr_filename):
        for each_filename in filenames:
            each_filename = os.path.join(root, each_filename)
            fh.write(each_filename, os.path.relpath(each_filename, zarr_filename))

to zip an existing Zarr directory. This, I believe, reduces the inode usage to 1.

I can than re-open the zipped Zarr file with

inputs = zarr.storage.ZipStore(zip_filename, mode='r')
ds = xr.open_zarr(inputs)

However, in my testing zipping the Zarr file up comes with a penalty when trying to read it back in. In my testing reading took double the amount of time compared to the un-zipped (original) Zarr file.

mdsumner · 9 November 2025 03:22

GDAL can access data within a zip via /vsizip, and that will perform better than native zip if using SOzip. I’m working on a GDAL backend for xarray (best improv is to be able to use multidim), so I will try this out at some point.

Topic		Replies	Views
Zarr 2.14.0 includes experimental support for sharding Technical python , storage	12	486	27 August 2025
Very different zarr file sizes from virtually identical write operations Technical zarr , python	6	370	7 July 2023
Xarray to_zarr causing errors in new xp65 env that weren't present in hh5 General python , help , dask , climate-conda-enviro	7	147	12 August 2025
Issue using parcels in analysis3 Ocean	5	65	5 November 2025
Help running OceanParcels using xp65 COSIMA help , community-help	4	43	1 December 2025

Zarr inodes

Related topics