cubed
cubed copied to clipboard
Bounded-memory serverless distributed N-dimensional array processing
Use the `threads` executor to run some computations with free-threaded Python and see if it works. Ref: * [Free-threaded CPython is ready to experiment with!](https://labs.quansight.org/blog/free-threaded-python-rollout) * [py-free-threading.github.io](https://py-free-threading.github.io/) See also #497
There are a lot of parallel frameworks that Cubed Plans could be converted to (e.g. #499). We have executors for dask & beam, but instead of trying to write more...
Some possibilities: * Job arrays: https://slurm.schedmd.com/job_array.html (pointed out by @jeromekelleher) * funcX: https://funcx.org/ * Parsl: https://parsl-project.org/ cc @TomNicholas
In addition to accelerator support (e.g. via #304), Cubed could benefit ML users by providing [ray](https://github.com/ray-project/ray) executor: https://docs.ray.io/en/latest/ray-core/walkthrough.html Since Cubed is a serverless model, I bet it could get away...
We currently use a single Zarr array with a [structured data type](https://zarr-specs.readthedocs.io/en/latest/v2/v2.0.html#data-type-encoding) for storing intermediate outputs with multiple fields (such as the total and count when [computing the mean](https://github.com/cubed-dev/cubed/blob/e9de1aec8b1f86da20c128f4021a54351ccadcf6/cubed/array_api/statistical_functions.py#L34-L76)). Structured...
This is to support `nanmean`, `nansum` (see #153), and `pad` (#461). The previous attempt at this in #468 was reverted in #469, since it was causing problems when used with...
This is a possible solution to #418 Our model aims to predict the optimal `split_every` value that makes the reduction as fast as possible. This parameter affects the input data...
The following functions are currently implemented using `map_direct`, which assumes that the input arrays have been materialized as Zarr arrays so they can be accessed directly, typically in a way...
This came up in the context of sgkit where we want to load Zarr arrays that have an object dtype (even if we don't do any processing on them).