zarr-python
zarr-python copied to clipboard
An implementation of chunked, compressed, N-dimensional arrays for Python.
### Zarr version 3.0.0.beta ### Numcodecs version 0.13 ### Python Version 3.11 ### Operating System Mac ### Installation pip ### Description Zarr-Python 2 exposed `compressor` and `filters` as attributes on...
I'd like to open the conversation about what Zarr's core dependencies are for 3.0. Currently, this looks like: https://github.com/zarr-developers/zarr-python/blob/11312534ebe683d73cbbcc2da9e88933cb00cc14/pyproject.toml#L25-L34 Some of these are not used anymore (`asciitree` and `fasteners`) so...
#### Minimal, reproducible code sample, a copy-pastable example if possible Run the following twice: ```python import zarr if __name__ == "__main__": foo = zarr.open_group("/tmp/foo.zarr") if "bla" in foo.attrs: print("after loading:",...
Over in #1746 @dcherian discovered the following bug: ```python import zarr from zarr.array import Array from zarr.group import Group from zarr.store import MemoryStore import numpy as np store = MemoryStore(mode="w")...
### Describe the issue linked to the documentation The Group.create_array docstring lists the following for "compressor": ``` compressor: dict[str, JSON] | None = None The compressor for the array. ```...
Users come to Zarr with a variety of array-like objects -- numpy arrays, or dask arrays, or xarray DataArrays, zarr v2 arrays, zarr v3 arrays, etc. Imagine a venn diagram...
Similar to #2024, the "configuration" key of `chunk_key_encoding` is optional. If it's missing, a ValueError is raised: ``` File "/Users/jamoore/opt/challenge/zarr-python/src/zarr/common.py", line 130, in parse_named_configuration raise ValueError(f"Named configuration does not have...
The goal of this PR is to demonstrate one strategy to simplify the creation of arrays that use sharding. Don't consider merging this until we get a good look at...
#### Summary This is a ~~partial~~ implementation for zarr-developers/zarr-python#1773. So far, I have done the following: 1. Make `Metadata` abc generic over `Mapping[str, object]` 2. Define `TypedDict` classes for the...
The discussion in https://github.com/zarr-developers/zeps/pull/47 got me thinking: what if, instead of turning numpy arrays into bytes, we turn them into self-describing [Arrow Record Batches](https://arrow.apache.org/docs/python/generated/pyarrow.RecordBatch.html) and serialize them using the Arrow...