dataframe-api
dataframe-api copied to clipboard
RFC document, tooling and other content related to the dataframe API standard
From the discussions at EuroScipy with scikit-learn developers (cc @ogrisel), the following use case came to mind: assume you have a method that _transforms_ your data, a workflow could be:...
One of the "to be decided" items at https://github.com/data-apis/dataframe-api/blob/dataframe-interchange-protocol/protocol/dataframe_protocol_summary.md#to-be-decided is: _**Should there be a standard from_dataframe constructor function?** This isn't completely necessary, however it's expected that a full dataframe API...
When researching all possible dtypes with missing values in Vaex and observing how this is handled in Pandas implementation I found that there is a BooleanDtype in Pandas that gives...
I couldn't work out if the *interchange* dataframe (i.e. the dataframe returned from `__dataframe__()`) should also have a `__dataframe__()` method, e.g. ```python >>> import pandas as pd >>> df =...
hi all, great to see some continued work on this project after the original discussion from last year. I still think it's useful to allow libraries to "throw data over...
This is a follow up of the discussions in: - https://github.com/pydata-apis/dataframe-api/issues/6#issuecomment-639754182 - #11 (question: `pandas has parameters (bool_only, numeric_only) to let only apply the operation over columns of certain types...
We currenty list "datetime support" in the [design document](https://data-apis.org/dataframe-protocol/latest/design_requirements.html), and also listed it in the dtype docstring: https://github.com/data-apis/dataframe-api/blob/27b8e1cb676bf10704d1dfc3dca0d0d806e2e802/protocol/dataframe_protocol.py#L142 But at the moment the spec doesn't say anything about *how* the...
I think it's useful to think through concrete use cases on how the interchange protocol could be used, to see if it covers those use cases / the desired APIs...
Is its use similar as in Arrow, such that if you slice a string array, that you still back it by the same buffers, but the offset and length of...
Validity mask is a missing value representation that depends on the Column in the protocol. If `describe_null()` is meant to describe missing values at the column level for a given...