pandas2 icon indicating copy to clipboard operation
pandas2 copied to clipboard

Design documents and code for the pandas 2.0 effort.

Results 58 pandas2 issues
Sort by recently updated
recently updated
newest added

Obvious / currently supported [see here](https://wesm.github.io/pandas2-design/internal-architecture.html#high-level-logical-type-proposal) xref #20 - integer - unsigned integer - float - complex - boolean - datetime (ns) - datetime w/tz (ns) - timedelta (ns) -...

dtypes

nuff said at least where possible / documented

compat

Rs "new" pipes combined with easily added functions more or less made Rs data handling much easier to read and to extend than pandas. The advantage is IMO twofold: -...

API

pandas already supports lazy evaluation via numexpr. adding numba allows the possibility of generation of lazy expressions (and recent versions support ahead-of-time compilation). I would simply make these requirements; they...

performance

I will work on a full document for this to get the conversation started, but this can be the placeholder for our discussion about COW

performance
API
compat

I really like the proposal so far 👍 Are there any plans to provide a semi-stable C++/Cython API that could be used by other projects for things beyond simple pandas...

compat

I was poking around pandas to see if someone had implemented the equivalent of a vectorized if-then-else. For example, similar to `np.where`, but pandas friendly ``` python df['category'].isin(values).ifelse(df.group_a, df.group_b) ```...

performance
API

xref #9 Maybe we can collect a list of pandas issues that have happened in and around this. - https://github.com/pydata/pandas/issues/14170 I've found it's valuable to be able to consistently compute...

missing data

pandas would potentially benefit from a more efficient decimal dtype, possibly using libmpdec (what CPython uses internally) for the internal implementation.

enhancement
dtypes