Stephan Hoyer
Stephan Hoyer
> We could raise an error asking the user to switch to `swap_dims`. This seems like a good idea In the long term, we like to decouple indexes from coordinate,...
Sounds good to me. The challenge will be defining a parallel computing API that works across all these projects, with their slightly different models.
I think the right word for this may be "schema". For applications and models (rather than data analysis), these sort of conventions can be super-valuable. I like the idea of...
`ncdump -h` (`xarray.Dataset.info()`) is one existing schema of sorts, but it's hardly machine readable.
> [Pandera](https://pandera.readthedocs.io) provides machine-readable schemas for Pandas and, as of version 0.7, [panderas has decoupled pandera and pandas types](https://github.com/pandera-dev/pandera/pull/559) to make pandera more useful for things like xarray. I haven't...
+1 for adding this to xarray. `to_ncml` would also be nice to have.
I have not thought much about APIs yet.
Maybe a separate project in xarray-contrib would make sense? I would be reluctant to add this into Xarray proper if we need a new external dependency for reading XML files....
To clarify: why is it a problem for you to get floating point values like 21.939998626708984 instead of 21.940000000000001? Is it a loss of precision in some downstream calculation? Both...
> A float32 values has 24 bits of precision in the significand, which is more than enough to store the 16-bits in in the original data; the exponent (8 bits)...