xarray icon indicating copy to clipboard operation
xarray copied to clipboard

Concurrent loading of coordinate arrays from Zarr

Open shoyer opened this issue 4 years ago • 1 comments

When you open a dataset with Zarr, xarray loads coordinate arrays corresponding to indexes in serial. This can be slow (multiple seconds) even with only a handful of such arrays if they are stored in a remote filesystem (e.g., cloud object stores). This is similar to the use-cases for consolidated metadata.

In principle, we could speed up loading datasets from Zarr into Xarray significantly by reading the data corresponding to these arrays in parallel (e.g., in multiple threads).

shoyer avatar Mar 30 '21 02:03 shoyer