cubed issues

Testing different executors automatically

We briefly discussed the difficulty of maintaining multiple executors. In https://github.com/tomwhite/cubed/pull/168#issuecomment-1542892933 I suggested expanding the CI to run different test jobs with different executors installed. I also saw [this dask...

TomNicholas

runtime

Issue warning if a computation actually used more memory than `max_mem`

We know the peak memory used in every task, so if this value ever exceeds the amount specified in `max_mem` (specified by the user) then we can issue a warning....

tomwhite

good first issue

runtime

memory

Implement `var` and `std`

4

Similar to the implementation of `mean`.

tomwhite

good first issue

array api

Flow control for worker task submission

1

For very large computations when the number of tasks for an array is much greater than the number of workers, it may be desirable to have more control over task...

tomwhite

runtime

scaling

Pangeo Climate Anomalies example

9

I tried to set up the problem of calculating the anomaly with respect to the group mean from https://github.com/pangeo-data/distributed-array-examples/issues/4. I used fake data with the same chunking scheme instead of...

TomNicholas

xarray-integration

benchmarks

Pangeo TEM example

4

This issue is to explore running the "Transformed Eulerian Mean diagnostic" example in https://github.com/dcherian/ncar-challenge-suite/blob/main/tem.ipynb using Cubed. It uses Xarray, so needs https://github.com/pydata/xarray/pull/7019

tomwhite

xarray-integration

benchmarks

Support multiple outputs in blockwise

Currently we use Zarr structured arrays, which are likely to be slow (since they are not column based) - althought that would be worth checking first. https://github.com/tomwhite/cubed/blob/400dc9adcf21c8b468fce9f24e8d4b8cb9ef2f11/cubed/array_api/statistical_functions.py#L18-L48

tomwhite

primitive

zarr

optimization

Creating cubed arrays from lazy xarray data

8

To subset my data whilst getting around #196 I tried slicing using xarray's lazy indexing machinery before converting to cubed arrays using `.chunk` (a trick which when used with dask...

TomNicholas

Support `peak_measured_mem` on Windows

With #176 you can run Cubed on Window, but `peak_measured_mem` isn't implemented. To implement we could use `psutil`, but we should make it optional so that other platforms aren't required...

tomwhite

memory

Naming tasks by passing xarray metadata

2

I was looking deeper into how to make https://github.com/pydata/xarray/issues/7813 work. So looks like the nodes are named when they are created by `Plan._new`. Q's: - Would it make sense to...

TomNicholas

xarray-integration

cubed
cubed copied to clipboard

Metadata

Testing different executors automatically

Issue warning if a computation actually used more memory than `max_mem`

Implement `var` and `std`

Flow control for worker task submission

Pangeo Climate Anomalies example

Pangeo TEM example

Support multiple outputs in blockwise

Creating cubed arrays from lazy xarray data

Support `peak_measured_mem` on Windows

Naming tasks by passing xarray metadata

← Metadata

Owner

Metadata

cubed cubed copied to clipboard

Metadata

← Metadata

Owner

Metadata

cubed
cubed copied to clipboard