pandas2 icon indicating copy to clipboard operation
pandas2 copied to clipboard

Design documents and code for the pandas 2.0 effort.

Results 58 pandas2 issues
Sort by recently updated
recently updated
newest added

On some contemplation, I am thinking it may lead to overall cleaner C++ code if we use exceptions for error reporting instead of status codes. These exceptions should be truly...

https://github.com/pythonql/pythonql/wiki/PythonQL-Intro-and-Tutorial#13-pythonql-and-pandas looks interesting

API

What can pandas provide in the way of a C/C++/Cython API to better enable upstack ML / statistical libraries? @ogrisel @amueller, who might have some good perspectives?

We'll want to develop a microbenchmark suite to compare pandas 2.0 perf versus 1.x, especially in microbenchmarks. What's the right tool for this? asv, vbench?

https://github.com/llllllllll/libpy looks like interesting way to construct expressions

parallel `.apply`, see [here](https://github.com/pydata/pandas/issues/13111) xref [dask](https://github.com/dask/dask) & [distributed](https://github.com/dask/distributed) This issue is a placeholder for discussion w.r.t. how much pandas 2.0 should be in charge of out-of-core / parallel operations. For...

performance

IIRC this from the design docs, but wanted to make an issue to remember. We want to have a set of lazily computed array attributes. Sometimes these can be set...

performance

examine approach & integrations for the IO subsystems. pay attention to copies, memory mapping, and parallelization. let this be a master issue, with specific discussions in separate issues as needed....

performance
IO