njw1123 issues

Results 5 issues of


                                            njw1123

UCX Build Failure: Go Bindings Compilation Error – Cannot Find cuda and ucx Package Paths

env ``` NVIDIA-SMI 535.183.01 Driver Version: 535.183.01 CUDA Version: 12.2 ``` install ``` ./autogen.sh ./contrib/configure-release --prefix=/opt/ucx --enable-shared --disable-static --disable-doxygen-doc --enable-optimizations --enable-cma --enable-devel-headers --with-cuda=/usr/local/cuda --with-verbs --with-dm --enable-mt make -j 8 ```...

[Question]: Does it support one-sided GPU-to-GPU communication across nodes using GPUDirect RDMA?

Does the performance match expectations [only about one-fourth of the peak]?

My setup is a single server with 8 H20 GPUs connected via NVLink (NV18 topology). Each link provides about 26 GB/s, so the theoretical aggregate bandwidth is around 400 GB/s....

Bug

[question]: Does UCX support GPUDirect Async type communication?

When performing cross-node GPU communication, will UCX automatically choose the GPUDirect Async style of communication, or will it at most use only the GPUDirect RDMA type of communication?

njw1123

UCX Build Failure: Go Bindings Compilation Error – Cannot Find cuda and ucx Package Paths

[Question]: Does it support one-sided GPU-to-GPU communication across nodes using GPUDirect RDMA?

Does the performance match expectations [only about one-fourth of the peak]?

[question]: Does UCX support GPUDirect Async type communication?

What are the CUDA driver and hardware requirements for UCX version 1.18.0?