arrow
arrow copied to clipboard
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
### Rationale for this change The boost-cpp feedstock was archived. ### What changes are included in this PR? Switch to the new package name `libboost-devel`. ### Are these changes tested?...
### Rationale for this change https://github.com/apache/arrow/blob/7a36fcc8b7456bea52911f9601b26be51d16265a/python/pyarrow/src/arrow/python/numpy_to_arrow.cc#L499-L500 Should check and work together with `safe` paramter. ### What changes are included in this PR? This PR proposes to implement overflow check in...
### Describe the usage question you have. Please include as many useful details as possible. We are doing one small poc wherein we are comparing the performance when we load...
This is useful for example when an existing buffer can be immediately reused. This is currently used for [storage of strings in json parsing](https://github.com/apache/arrow/blob/master/cpp/src/arrow/json/parser.cc#L60), so it'd be straightforward to refactor...
Docs: Need to investigate the best way to do this maybe just see if we can build our client on GCP (once a protobuf definition is published to [https://github.com/googleapis/googleapis/tree/master/google)?](https://github.com/googleapis/googleapis/tree/master/google)) This...
This was discussed in here: For testing/benchmarking dictionary encoding its useful to control the number of repeated values and it would also be good to optionally include null values. The ...
I tried to fix it but no matter what option I pass for `--line-ending` to `cmake-format` it converts LF line endings to CRLF. **Reporter**: [Wes McKinney](https://issues.apache.org/jira/browse/ARROW-4548) / @wesm **Note**: *This...
Both the C++ [implementation of parquet writer for arrow](https://github.com/apache/arrow/blob/master/cpp/src/parquet/arrow/writer.cc#L1174) and the [Python code bound to it](https://github.com/apache/arrow/blob/master/python/pyarrow/_parquet.pyx#L911) appears denominated in the **number of rows** (without making it very explicit). Whereas: (1)...
As the timezone database is not available on windows operating system, the cast timestamp test cases that uses timezone apis are failing. Tests are currently disabled on windows platform. Need...
`clang-format` supports command line option `-output-replacements-xml` which (in the case of no required changes) outputs: ``` ``` Using this option during `check-format` instead of using python to compute a diff...