arrow
arrow copied to clipboard
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
…f a file for Parquet format
Bumps hadoop-common from 3.3.2 to 3.3.3. [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=org.apache.hadoop:hadoop-common&package-manager=maven&previous-version=3.3.2&new-version=3.3.3)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a...
This PR adds support for more types of queries that include calls to R code (i.e., `map_batches(..., .lazy = TRUE)`, user-defined functions in mutates, arranges, and filters, and custom extension...
After: https://github.com/apache/arrow/pull/13640, seems like O_DIRECT is not a good idea, so let's use posix_fadvise to control the page cache to address the issue mentioned in: https://issues.apache.org/jira/browse/ARROW-14635. To test it, use...
I wanted to extract the schema of the table as json, but I noticed that the Python and R packages apparently only output the schema as a single string. So...
Will fix [ARROW-13763](https://issues.apache.org/jira/browse/ARROW-13763) A separate Jira issue will be made to address closing files in V2 ParquetDataset, which needs to be handled in the C++ layer. Adds context manager to...
BREAKING CHANGE Two changes for "internal fields" (fields within ListTypes and MapTypes): * ListType and MapType internal field names only matter in cases where metadata also matters (but can be...
**Summary** An additional parameter in Flight do_put to specify chunk size in R. **Problem** Currently, all data is sent through in a single message. It's a likely scenario that users...