cheyang comments

Results 76 comments of


                                            cheyang

feat: re-setup master when master role restarted or recreated and recover dataset

@SimonCqk please fix the source code check, thanks.

Add env and volume information in MPS situation.

Thanks for your contributions! @Sakuralbj Could you please add the guide and samples for others to understand the how to use it? Thanks.

question about pods with multiple containers

Thanks, @YuxiJin-tobeyjin @monstercy , I think you are right. It should be handled. I will take a look at this later. If you have solutions, your contributions are welcome.

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致

这里依赖是Pod在调度器是按顺序bind的，而且在bind过程中已经加了锁。是能够保证顺序性的。

device login fail to register

I guess it's because Nvidia's customized kubernetes only has [v1alpha version](https://github.com/NVIDIA/kubernetes/blob/master/pkg/kubelet/apis/deviceplugin/v1alpha/api.proto). It's Nvidia's implementation. It's not compatible with Kuberentes' community version. I think it will be fine if you try...

Failed due to invalid configuration: no server found for cluster "local"

Looks like the issue is `Failed due to invalid configuration: no server found for cluster "local"`. Please check your kube config. How about running `kubectl get nodes`?

Can't pull your docker image

Please try to ping registry.cn-hangzhou.aliyuncs.com. And check the result: ``` ping registry.cn-hangzhou.aliyuncs.com PING registry.cn-hangzhou.aliyuncs.com (120.55.105.209) 56(84) bytes of data. 64 bytes from 120.55.105.209 (120.55.105.209): icmp_seq=1 ttl=94 time=35.2 ms 64 bytes...

cheyang

feat: re-setup master when master role restarted or recreated and recover dataset

Add env and volume information in MPS situation.

question about pods with multiple containers

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致

device login fail to register

Failed due to invalid configuration: no server found for cluster "local"

Can't pull your docker image

GPU-mem is the whole GB value, not MB value

GPU-mem is the whole GB value, not MB value

GPU-mem is the whole GB value, not MB value