Wes McKinney

Results 59 issues of Wes McKinney

What can pandas provide in the way of a C/C++/Cython API to better enable upstack ML / statistical libraries? @ogrisel @amueller, who might have some good perspectives?

We'll want to develop a microbenchmark suite to compare pandas 2.0 perf versus 1.x, especially in microbenchmarks. What's the right tool for this? asv, vbench?

Placeholder for this part of the project

dtypes
API

I will work on a full document for this to get the conversation started, but this can be the placeholder for our discussion about COW

performance
API
compat

I was poking around pandas to see if someone had implemented the equivalent of a vectorized if-then-else. For example, similar to `np.where`, but pandas friendly ``` python df['category'].isin(values).ifelse(df.group_a, df.group_b) ```...

performance
API

xref #9 Maybe we can collect a list of pandas issues that have happened in and around this. - https://github.com/pydata/pandas/issues/14170 I've found it's valuable to be able to consistently compute...

missing data

pandas would potentially benefit from a more efficient decimal dtype, possibly using libmpdec (what CPython uses internally) for the internal implementation.

enhancement
dtypes

xref #15 I brought this up at SciPy 2015, but there's a significant performance win available in expressions like: ``` df[boolean_cond].groupby(grouping_exprs).agg(agg_expr) ``` If you do this currently, it will produce...

performance
memory-use

pandas's row indexes introduces a level of semantic incompatibility with other systems that occasionally causes problems for users who are using both pandas and some other system. Functionally, this mainly...

API