Binyang Li
Binyang Li
You can use `kubectl describe ds nvidia-device-plugin -n kube-system` to find the reason. The message means it is stuck when executing kubectl command
@suiguoxin @siaimes Any comments?
Can you run the script with `-vvvv`, this flag will print the verbose log which will help to debug
Any NFS related logs?
I believe there is only one SKU in a vc. So you can get the gpu count in a VC via `https://cluster-name/api/v2/virtual-clusters`
Can you paste the full kubelet configuration? You need to set --cpu-manager-policy="static". The default value is none. BTW PAI doesn't hack for kubelet, so you will get more help from...
Question about this: 1. Can all runtime plugins merge to `prerequisites`? If so we could deprecated runtime extra field. Make `prerequisites` the official way? 2. Maybe I can treat `prerequisites`...
Your job in waiting status. Can you click `Go to job event page` to get more information? Add please provide your job config yaml
Will restart the node solve this problem?
We are in maintenance mode. No new feature, just bug fix. @scarlett2018 should we add some notification for the project status in the home page?