Jakub Homola

Results 10 issues of Jakub Homola

Hello, this is a minor issue, but it is quite inconvenient -- one needs to cast the first parameter of `hipMallocPitch` to `void**` in HIP. In CUDA (`cudaMallocPitch`), the cast...

Under Investigation

If I understand correctly, with the new rocm-4.5.x branches, this HIP repository contains only several things common to both NVIDIA and AMD platforms. Then there is the [hipamd](https://github.com/ROCm-Developer-Tools/hipamd) repository, which...

Consider the following HIP program: ``` #include #include __global__ void my_kernel(int * data_in, int * data_out) { int idx = blockIdx.x * blockDim.x + threadIdx.x; data_out[idx] = __shfl_down(data_in[idx], 16); }...

Trying to compile the HIP program ``` #include __global__ void my_kernel() { extern __shared__ int dyn_shmem[]; } int main() { int dyn_shmem_size = 64; hipLaunchKernelGGL(my_kernel, 4, 32, dyn_shmem_size, 0); hipDeviceSynchronize();...

The values of warpSize read from the hipDeviceProps_t variable and the kernel builtin variable warpSize are different, which is very unexpected. Consider the following HIP program: ``` #include #include __global__...

The following example line of code ``` hipMalloc(&d_x, count * sizeof(float)); ``` fails to compile (using g++ 9.4.0) with error ``` saxpy.hip.cpp:46:15: error: invalid conversion from ‘float**’ to ‘void**’ [-fpermissive]...

### Is your feature request related to a problem? Please describe. I am querying the required workspace buffer size for manual workspace management using `rocblas_start_device_memory_size_query` and `rocblas_stop_device_memory_size_query`. When calling the...

Hi, can we pass the same buffer as input and output argument of oneMKL functions? Is "buffer aliasing" allowed? E.g. the function `oneapi::mkl::sparse::trsv`. It takes multiple buffers as arguments, namely...

the `oneapi::mkl::sparse::release_matrix_handle` function actually takes `oneapi::mkl::sparse::matrix_handle_t *` as the first argument (according to icpx compiler errors), not `oneapi::mkl::sparse::matrix_handle_t`, as written in the syntax section of the page. docs version 1.2-rev-1...

Hello, I am trying to trace my AMDGPU application with Omnitrace, but I am running into an issue with a host callback function. Using `hipStreamAddCallback` I submit a host function...