scivision icon indicating copy to clipboard operation
scivision copied to clipboard

Investigate/add examples of datasets in ZARR format to the scivision catalog

Open acocac opened this issue 2 years ago • 3 comments

Zarr is a relatively new cloud-based data format specifically for improving access to N-Dimensional arrays.

Zarr is unique because it is an implementation of a chunked storage library for data that supports parallel reads, parallel writes, and easily plugs into different storage systems like cloud object stores.

It'd be interesting to check some benchmark datasets across domains in this format and incorporate them in the Scivision catalog.

Some references:

acocac avatar Aug 08 '23 09:08 acocac

I started setting up a zarr dataset in this repo, but didn't have a good place to host the data (other than github): https://github.com/lowe-lab-ucl/cell-cycle-scivision

quantumjot avatar Aug 08 '23 09:08 quantumjot

I started setting up a zarr dataset in this repo, but didn't have a good place to host the data (other than github): https://github.com/lowe-lab-ucl/cell-cycle-scivision

Have you uploaded the data to GitHub? I've found the data.yml, but not the dataset.

acocac avatar Aug 08 '23 09:08 acocac

Include in recommendations for data providers? https://github.com/alan-turing-institute/scivision/issues/343

ots22 avatar Aug 22 '23 09:08 ots22