Raul Akhmetshin

Results 47 comments of Raul Akhmetshin

Different network devices can be used for different data transfer operations. `mlx5_0` is a name of IB network device. This device is used in your case. This information can be...

You can use `ucx_info` tool. E.g. `ucx_info -d | grep Device | sort --unique` will show you all devices that can be used by UCX. You can also use `ibstat...

> @brminich / @ivankochin / @rakhmets Can you help with restarting the failed tests? I don't expect the failures to be related to the PR. > > @brminich when time...

Should we revert 0e92c1d8fa749daa4c1b9ee4be6c807f4688eb75 in this PR to confirm the fix?

Please find the following conclusions based on CI logs. The failed tests are tests containing call to `nvmlInit`. The tests fails because they are aborted by watchdog at the end...

Hi @G-Ragghianti, We merged PR https://github.com/openucx/ucx/pull/9475 that should fix the issue. We also pushed the changes to v1.16.x branch (https://github.com/openucx/ucx/pull/9487).

The first compilation error was fixed by https://github.com/openucx/ucx/pull/8804

Could you please check if this patch https://github.com/openucx/ucx/pull/9795 fixes the second issue?