cubed issues

np.nanmean executes eagerly on cubed arrays but lazily on dask arrays

4

See image for demonstration. ![Screenshot from 2023-03-14 19-26-13](https://user-images.githubusercontent.com/35968931/225164219-0125df14-e3f9-46ee-85c2-8ec523093ec1.png) `np.nanmean` is called by xarray's `.mean()` method when `skipna=True`, which is the default.

TomNicholas

Create a Lithops executor that uses Python asyncio

Lithops uses an async programming model, but not Python asyncio. It would be nice if we could bridge the two, as then we'd be able to use the common asyncio...

tomwhite

runtime

Add unit test for Coiled executor

It should run in CI too, like the Modal one does.

tomwhite

runtime

Test with Zarr V3

1

Add a GH Actions workflow that runs Cubed tests using Zarr V3 storage. This will ensure that the upcoming work to update Zarr to support the V3 spec (see https://github.com/zarr-developers/zarr-python/discussions/1480)...

tomwhite

zarr

Executor feature comparison

6

(I wrote this to help track what works needs to be done on the executors, but it might be useful to add to the user docs at some point.) This...

tomwhite

documentation

runtime

Defer to merge_chunks in special cases of rechunk

4

#221 introduced `merge_chunks`, a special-case of `rechunk` that can be implemented using `blockwise`. I noticed that whilst `reduction` calls `merge_chunks` directly, inside `ops.rechunk` the [primitive rechunk is always called](https://github.com/tomwhite/cubed/blob/93ad984e7b0445164ab11b3c3f3a3b7db6c3bc97/cubed/core/ops.py#L631C11-L631C11). Shouldn't...

TomNicholas

core

optimization

Introduce a class to model parts of an operation's memory usage

Consider a simple blockwise operation with one input, where each task carries out the following steps: 1. read compressed Zarr chunk 2. decompress Zarr chunk to produce the input array...

tomwhite

memory

optimization

Blockwise fusion should use `numblocks` not `num_tasks`

See https://github.com/tomwhite/cubed/issues/284#issuecomment-1660425647

tomwhite

bug

optimization

Add scan / prefix sum primitive

13

If you're looking for something to do :), then scans would be a good thing to add. Dask calls this "cumreduction" (terrible name!) : and its a quite useful primitive...

dcherian

array api

core

Calculating the cost of a computation

2

It would be useful to provide numbers for actual resources used after a computation is complete so its cost can be calculated. We probably need: 1. total worker seconds 2....

tomwhite

enhancement

runtime

cubed
cubed copied to clipboard

Metadata

np.nanmean executes eagerly on cubed arrays but lazily on dask arrays

Create a Lithops executor that uses Python asyncio

Add unit test for Coiled executor

Test with Zarr V3

Executor feature comparison

Defer to merge_chunks in special cases of rechunk

Introduce a class to model parts of an operation's memory usage

Blockwise fusion should use `numblocks` not `num_tasks`

Add scan / prefix sum primitive

Calculating the cost of a computation

← Metadata

Owner

Metadata

cubed cubed copied to clipboard

Metadata

← Metadata

Owner

Metadata

cubed
cubed copied to clipboard