Binyang Li

Results 37 comments of Binyang Li

You can use `kubectl describe ds nvidia-device-plugin -n kube-system` to find the reason. The message means it is stuck when executing kubectl command

Can you run the script with `-vvvv`, this flag will print the verbose log which will help to debug

I believe there is only one SKU in a vc. So you can get the gpu count in a VC via `https://cluster-name/api/v2/virtual-clusters`

Can you paste the full kubelet configuration? You need to set --cpu-manager-policy="static". The default value is none. BTW PAI doesn't hack for kubelet, so you will get more help from...

Question about this: 1. Can all runtime plugins merge to `prerequisites`? If so we could deprecated runtime extra field. Make `prerequisites` the official way? 2. Maybe I can treat `prerequisites`...

Your job in waiting status. Can you click `Go to job event page` to get more information? Add please provide your job config yaml

Will restart the node solve this problem?

We are in maintenance mode. No new feature, just bug fix. @scarlett2018 should we add some notification for the project status in the home page?