spiderpool icon indicating copy to clipboard operation
spiderpool copied to clipboard

dynamically RDMA device allocation

Open weizhoublue opened this issue 1 year ago • 0 comments

What would you like to be added?

应用场景思考 (1)简化配置 (2)主机上 master 网卡名 不一致 (3)小于 8 卡,根据 GPU 亲和性 动态分配 (4)多 RDMA 域网络下,子网规划有偏差,根据调度节点,来动态 分配 IP 子网 --- 这个由 IPAM 和 子网通配 来解决,而不是 在 多个 multus 实例名中选择一个 来解决 ipam 判断,如果 master 网卡有 ip 地址,那么 ip 要属于 子网,才可用 https://github.com/spidernet-io/spiderpool/blob/main/docs/usage/network-topology-zh_CN.md

apiVersion: spiderpool.spidernet.io/v2beta1
kind: SpiderMultusConfig
metadata:
  name: gpu1-sriov
  namespace: spiderpool
spec:
  cniType: ib-sriov
  ibsriov:
    resourceName: spidernet.io/gpu1sriov
    rdmaIsolation: true
    ippools:
      ipv4: ["gpu1-*"]  // 或者  ipv4: ["gpu1-block1", "gpu1-block2"]

Why is this needed?

No response

How to implement it (if possible)?

No response

Additional context

No response

weizhoublue avatar Dec 31 '24 12:12 weizhoublue