Tom Nicholas

Results 182 issues of Tom Nicholas

#221 introduced `merge_chunks`, a special-case of `rechunk` that can be implemented using `blockwise`. I noticed that whilst `reduction` calls `merge_chunks` directly, inside `ops.rechunk` the [primitive rechunk is always called](https://github.com/tomwhite/cubed/blob/93ad984e7b0445164ab11b3c3f3a3b7db6c3bc97/cubed/core/ops.py#L631C11-L631C11). Shouldn't...

core
optimization

We briefly discussed the difficulty of maintaining multiple executors. In https://github.com/tomwhite/cubed/pull/168#issuecomment-1542892933 I suggested expanding the CI to run different test jobs with different executors installed. I also saw [this dask...

runtime

I tried to set up the problem of calculating the anomaly with respect to the group mean from https://github.com/pangeo-data/distributed-array-examples/issues/4. I used fake data with the same chunking scheme instead of...

xarray-integration
benchmarks

To subset my data whilst getting around #196 I tried slicing using xarray's lazy indexing machinery before converting to cubed arrays using `.chunk` (a trick which when used with dask...

I was looking deeper into how to make https://github.com/pydata/xarray/issues/7813 work. So looks like the nodes are named when they are created by `Plan._new`. Q's: - Would it make sense to...

xarray-integration

![Screenshot from 2023-05-04 16-00-19](https://user-images.githubusercontent.com/35968931/236316223-14e7d144-97c0-4975-9ee9-cd76d7c180bc.png) returns ```python ╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮ │ /tmp/ipykernel_495009/1422487590.py:1 in │ │ │ │ │ │ /home/tom/Documents/Work/Code/cubed/cubed/core/array.py:112 in compute │ │ │ │ 111...

FYI this might be useful in cubed's tests too https://github.com/dask/dask/pull/9374

@yuvipanda pointed out that the name "cubed" is almost impossible to Google - there are so many other projects called cubed. (In our field "cubed" would also likely get confused...

- [x] Closes #9016 - [x] Tests added - [ ] User visible changes (including notable bug fixes) are documented in `whats-new.rst` - [ ] New functions/methods are listed in...

I'm building [VirtualiZarr](https://github.com/TomNicholas/VirtualiZarr), an evolution of [kerchunk](https://github.com/fsspec/kerchunk), that allows you to determine byte ranges of chunks in netCDF files, but then concatenate the virtual representation of those chunks using xarray's...