gaohao95

Results 10 issues of gaohao95

#### What happens? Semi join should include columns from the left table only. However, the substrait extension will produce a join relation and a projection relation for semi join and...

This PR intends to assign a more accurate set of inames for the assignment instructions added for CSE node.

By default, the memory pool size used is the total GPU memory - 500MB. During some OOM runs, we observed using smaller memory pool solves the OOM issue. This indicates...

`cudaStreamDefault` is a flag used for passing to `cudaStreamCreateWithFlags` instead of a valid stream. We should replace that with either `rmm::cuda_stream_default` or `0`.

We should check the device memory usage, and compare it with what we projected in the modeling, with and without pipelining. We should consider whether the extra device usage is...

We should profile and improve the computation-communication overlap efficiency on - a single node DGX with NVLink - multiple DGX nodes connected with IB

The error checking utilities of this repo (currently located at `src/error.cuh`) should be aligned with cuDF's error checking utilities (`cudf/utilities/error.hpp`). I believe this will allow more code reuse. For example,...

**Is your feature request related to a problem? Please describe.** I wish libcudf's parquet reader reports the number of rows read per file. Consider the following example, ```c++ std::vector file_paths;...

feature request
libcudf
cuIO