k8s-rdma-shared-dev-plugin
k8s-rdma-shared-dev-plugin copied to clipboard
Version: v1.3.2 RDMA device plugin log: As can be seen from the log, when kubelet restart, it triggers context canceled and restart will block because channel size is 0, context...
I have 4 InfiniBand (IB) network cards on my server, and I have shared one of them with the following configuration: ```yaml rdmaSharedDevicePlugin: deploy: true resources: - name: rdma_shared_device_a ifNames:...
Hi, We deploy k8s-rdma-shared-dev-plugin (`artprod.dev.bloomberg.com/ds/yweng14/nvidia/cloud-native/k8s-rdma-shared-dev-plugin:v1.3.2`) on our clusters and find that the the socket (ib.sock) is not recreated. We print some debugging message in the `Restart()` function and find it...
rdmaSharedDevicePlugin: deploy: true resources: - name: rdma_shared_device_a ifNames: [ibs10, ibs11, ibs18, ibs19] I have deployed the network-operator using the configuration shown above, and I see that on the node it...
We should improve the projects README. the general way to use it with k8s is utilizing secondary network CNI such as macvlan or ipoib (or any CNI essentially can create...
previously this was done in travis CI.
My server environment will use connectx-7 vpi and intel 810-XXV nic card. So i need to register intel nic into rdma resource pool. Is it possible to use intel nic?...
my problem is same to https://github.com/Mellanox/k8s-rdma-shared-dev-plugin/issues/72. when I run ib_write_bw in pod, I get a error as follows: ``` [root@mofed-test-cx5-bond-pod2 /]# ib_write_bw -d mlx5_0 -F --report_gbits ************************************ * Waiting for...
to resolve issue #96
We found that the plugin show that the num of devices is 0. Then I started to position why it is 0. 1. I checked the log of the plugin,...