Tom Nicholas

Results 182 issues of Tom Nicholas

When combining along a new dimension using `coo_map`, `MultiZarrToZarr` fails to concatenate dimension coordinate variables, despite concatenating coordinate variables just fine. Minimal example: ```python from kerchunk.hdf import SingleHdf5ToZarr # Set...

I'm seeing what looks like non-deterministic behaviour with `MultiZarrToZarr`. It happens both with and without writing to parquet. Sometimes I get the expected result written, sometimes I don't. See the...

Kerchunk doesn't properly decode the JSON for zarr array-level attributes, instead leaving dictionaries as long strings. For example: ```python # create example netCDF4 file xr.tutorial.open_dataset('air_temperature').to_netcdf('air.nc') kerchunk.backends.SingleHdf5ToZarr('air.nc', inline_threshold=300).translate() ``` ```python {'version':...

### Problem Kerchunk user code currently passes around an obscure multiply-nested "reference dict" object. This is hard to read, interrogate, validate, or reason about. ### Suggestion Instead create a new...

### Problem The [API for Kerchunk's file format backend openers](https://fsspec.github.io/kerchunk/reference.html#file-format-backends) doesn't follow a consistent pattern. ### Suggestion Change the openers to each be a function returning a `VirtualZarrStore` (see #375),...

Here are two toy datasets designed to represent sections of a dataset that has variables living on a staggered grid. This type of dataset is common in fluid modelling (handling...

I'm trying to learn more about how fsspec works (so I can better understand the IO part of the Pangeo stack), but I'm confused by the structure of `fsspec/filesystem_spec`. Despite...

Hi, is there a way to retrospectively label PRs? We have thousands of unlabeled previous PRs and it might be nice to add labels so that we can search through...

Basically same question as https://github.com/gauteh/hidefix/issues/38#issue-2236324824 but for this library 😁 I'm building [VirtualiZarr](https://github.com/TomNicholas/VirtualiZarr), an evolution of [kerchunk](https://github.com/fsspec/kerchunk), that allows you to determine byte ranges of chunks in netCDF files, but...

help wanted
feature

**Description of Feature:** Auto-generate a DOI for each published repository. **What value is this feature adding to Source Cooperative?** Makes it easier for me to cite the location where the...

proposed feature