(Re-posting from the Pangeo discourse)
With release 2023.3.1
, dask
offers a new method for rechunking arrays called p2p
that rechunks large arrays at constant memory. This enables workloads that were previously un-runnable and required you to rechunk datasets ahead of time using tools like rechunker
.
See Rechunking large data at constant memory in Dask [experimental] - HPC - Pangeo