arrow
arrow copied to clipboard
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
### Describe the enhancement requested Presently, `Dataset` has methods to perform several actions—`sort_by`, `join`, and `join_asof`—with Acero. It would be especially helpful to provide a method to perform aggregations on...
### Rationale for this change The final Python 3.13.0 is out now, so we can update those versions * GitHub Issue: #43519
### Describe the bug, including details regarding any error messages, version, and platform. I'm using the python pyarrow module and running into this strange behavior when casting strings into a...
This PR is a fresh continuation of https://github.com/apache/arrow/pull/13487 which was closed purely in favor of this one. The reason for this is that the other PR carried a lot of...
With pandas' [PDEP-14](https://pandas.pydata.org/pdeps/0014-string-dtype.html) proposal, pandas is planning to introduce a default string dtype in pandas 3.0 (instead of the current object dtype). This will become the default in pandas 3.0,...
### Rationale for this change We have a document that uses clang-cl: https://arrow.apache.org/docs/developers/cpp/windows.html#building-on-windows-arm64-using-ninja-and-clang But we don't have a job that uses clang-cl. So clang-cl build may be broken unexpectedly. ###...
### Describe the enhancement requested We have a document that uses clang-cl: https://arrow.apache.org/docs/developers/cpp/windows.html#building-on-windows-arm64-using-ninja-and-clang But we don't have a job that uses clang-cl. ### Component(s) C++
### Rationale for this change Described in https://github.com/apache/arrow/issues/44054. ### What changes are included in this PR? Implement three functions to extract PG's DATA/TIMESTAMP data type. They use a common class...
### Describe the bug, including details regarding any error messages, version, and platform. There are use-cases where I cannot use the "normal" filtering with hive-partitioning and must `open_dataset` directly on...
### Describe the enhancement requested Is there a larger plan to start adding compute kernels for the binary view types? I see dedicated issues like https://github.com/apache/arrow/issues/43010 but I'm not sure...