Dmitri Chubarov

Results 8 comments of Dmitri Chubarov

Can confirm the issue with xarray 2022.6.0 and dask 2022.9.2. The latest versions available on conda-forge. The issue might be related to netcdf4 and hdf5 libraries. Will try to update...

libnetcdf, netcdf4 and hdf5 are at their latest versions available on conda-forge

Adding a gdb [stackrace.txt](https://github.com/pydata/xarray/files/9741351/stackrace.txt) from corefile obtained with ``` docker run -v /mnt/fs:/my_s3_fs -it --rm --ulimit core=-1 --privileged netcdf:latest /bin/bash ``` and ``` sudo sysctl -w kernel.core_pattern=/tmp/core-%e.%p.%h.%t python mcve.py ```

Will try to reproduce this with h5py. For the bug to show up the file has to be large enough. That is why my example has a 2D array variable...

The first one results in a segfault: ```python import numpy as np import xarray as xr import pandas as pd N_TIMES = 48 time_vals = pd.date_range("2022-10-06", freq="20 min", periods=N_TIMES) ds...

It seems that we need the time variable to reproduce the problem. The following code does not fail: ``` import numpy as np import xarray as xr import pandas as...

datatype seems to be not important. But the two variables are required to get a segfault. The following with just floats produces a segfault ``` import numpy as np import...

I had to change ints and floats to doubles to reproduce the issue. ```python import h5py N_TIMES = 48 with h5py.File("/my_s3_fs/test.nc", mode="w") as f: time = f.create_dataset("time", (N_TIMES,), dtype="d") time[:]...