pandas2
pandas2 copied to clipboard
Design documents and code for the pandas 2.0 effort.
On some contemplation, I am thinking it may lead to overall cleaner C++ code if we use exceptions for error reporting instead of status codes. These exceptions should be truly...
https://github.com/pythonql/pythonql/wiki/PythonQL-Intro-and-Tutorial#13-pythonql-and-pandas looks interesting
http://quantstack.net/xtensor
What can pandas provide in the way of a C/C++/Cython API to better enable upstack ML / statistical libraries? @ogrisel @amueller, who might have some good perspectives?
We'll want to develop a microbenchmark suite to compare pandas 2.0 perf versus 1.x, especially in microbenchmarks. What's the right tool for this? asv, vbench?
https://github.com/llllllllll/libpy looks like interesting way to construct expressions
parallel `.apply`, see [here](https://github.com/pydata/pandas/issues/13111) xref [dask](https://github.com/dask/dask) & [distributed](https://github.com/dask/distributed) This issue is a placeholder for discussion w.r.t. how much pandas 2.0 should be in charge of out-of-core / parallel operations. For...
IIRC this from the design docs, but wanted to make an issue to remember. We want to have a set of lazily computed array attributes. Sometimes these can be set...
examine approach & integrations for the IO subsystems. pay attention to copies, memory mapping, and parallelization. let this be a master issue, with specific discussions in separate issues as needed....