arrow
arrow copied to clipboard
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
This PR addresses both the JIRA issue cited (pre-generate pyarrow.compute) and also a dev thread that suggests creating the ability to add in python docs for functions that inherit from...
I have a Java library that is writing an Arrow Table to a VectorSchemaRoot object in memory. And I want read the data with C++. But it keeps getting an...
My computer has 64 GB RAM and the parquet files are not large at all. However, they take surprisingly long time to read. I don't know what is the potential...
**Primary Goal:** Create a scanner that "cancels" properly. In other words, when the scan node is marked finished then all scan-related thread tasks will be finished. This is different than...
arrow v.6.0.0.2 results in the following error when attempting read_feather on a 32 GB feather file: Error: Invalid: Invalid read (offset = 7140512496, size = -956703880)
Changes introduced by [ARROW-16653](https://github.com/apache/arrow/pull/13506) were not written up by NEWS.md.
Hello everyone, I tried to build and test pyarrow on ppc64le platform with CUDA. I encounter segmentation fault at [test_cuda.py/test_foreign_buffer#L292](https://github.com/apache/arrow/blob/e90472e35b40f58b17d408438bb8de1641bfe6ef/python/pyarrow/tests/test_cuda.py#L292). If i comment out the test_foreign_buffer, then rest of the...
While building for Arch Linux, I’m observing 4 tests failures in the aforementioned suite: ``` [ RUN ] TestRandomInt64QuantileKernel.Overlapped /build/arrow/src/apache-arrow-7.0.0/cpp/src/arrow/compute/kernels/aggregate_test.cc:3234: Failure Value of: (quantiles[j] == numeric_scalar->value) || (std::isnan(quantiles[j]) && std::isnan(numeric_scalar->value))...