quzhongling

Results 2 comments of quzhongling

- 重新编译kubelet源码,替换原有kubelet。修改代码的地方是https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/cm/devicemanager/endpoint.go 中, dial方法添加参数grpc.WithDefaultCallOptions(grpc.MaxCallRecvMsgSize(1024\*1024\*16)) - 效果: ``` [root@jenkins ~]# kubectl inspect gpushare NAME IPADDRESS GPU0(Allocated/Total) GPU1(Allocated/Total) GPU2(Allocated/Total) GPU3(Allocated/Total) GPU4(Allocated/Total) GPU5(Allocated/Total) GPU6(Allocated/Total) GPU7(Allocated/Total) GPU Memory(MiB) 192.168.68.13 192.168.68.13 0/12066 0/12066 0/12066 0/12066...

获取单个pod使用的显存大小,可参考 https://github.com/zlingqu/nvidia-gpu-mem-monitor