David Wendt

Results 57 issues of David Wendt

## Description Disables a `ContiguousSplitUntypedTest` that simply creates a very large (over 3GB) column to test the output buffer size does not overflow. The gtests ends requiring 25GB of device...

3 - Ready for Review
libcudf
improvement
non-breaking

## Description Removes call to `strings_column_view::offsets_begin()` call from `nvtext::generate_ngrams()`. A future PR will deprecate the `offsets_begin()` function which hardcodes to int32 type. ## Checklist - [x] I am familiar with...

2 - In Progress
libcudf
improvement
non-breaking

## Description The `cudf::test::to_host` function is updated to handle int32 and int64 offset types for strings columns when copying data to host memory. This function is used with `cudf::test::print()` as...

3 - Ready for Review
libcudf
improvement
non-breaking

## Description Updates `cudf::detail::has_nonempty_null_rows` to use the offsetalator instead of hardcoded integer type. ## Checklist - [x] I am familiar with the [Contributing Guidelines](https://github.com/rapidsai/cudf/blob/HEAD/CONTRIBUTING.md). - [x] New or existing tests...

2 - In Progress
libcudf
improvement
non-breaking

## Description Splitting out changes in PR #14824 to make it easier to review. The changes here simply move `replace_slice()` and `replace_nulls()` from `replace.cu` into their own source files. The...

3 - Ready for Review
libcudf
CMake
strings
improvement
non-breaking

## Description The `cudf::hashing::spark_murmurhash3_x86_32()` function was moved to the Spark plugin since it had common code with the Spark implementation of `xxhash_64` (also implemented in the plugin). This change deprecates...

2 - In Progress
libcudf
CMake
improvement
breaking

## Description This reworks the `cudf::strings::detail::copy_range()` function to use the offsetalator instead of accessing the output offsets directly. Also refactored the code to remove the unnecessary template arguments. And added...

3 - Ready for Review
libcudf
CMake
improvement
non-breaking

## Description The logic used by `strings::split()` functions is refactored into its own utility for reuse with `strings::replace` and possibly other strings and text functions. ## Checklist - [x] I...

3 - Ready for Review
libcudf
strings
improvement
non-breaking

## Description Updates `cudf::row_bit_count()` to use the offsetalator to compute chars size for a strings column. ## Checklist - [x] I am familiar with the [Contributing Guidelines](https://github.com/rapidsai/cudf/blob/HEAD/CONTRIBUTING.md). - [x] New...

3 - Ready for Review
libcudf
improvement
non-breaking

## Description Fixes `cudf::strings::extract_all()` to use `cudf::detail::make_offsets_child_column` so it properly computes the output-size and checks for overflow when building offsets for a lists column. Also undo some changes from #14745...

3 - Ready for Review
libcudf
strings
improvement
non-breaking