LiuXiang
LiuXiang
## Description Consolidate node annotation check if node annotation is nil, it will report err, because node annotation is optional.
because ssn.Tiers is not initiated and plugins has not OnSessionOpen() to register callback to jobValidFns in openSession() function, ssn.Tiers is not initiated, it's nil, and the plugins has not OnSessionOpen()...
### Ask your question I'm curious, why aren't there any health status metrics for every GPU card? I check the NVIDIA/go-dcgm has function like HealthCheckByGpuId(gpuId uint) https://github.com/NVIDIA/go-dcgm/blob/main/pkg/dcgm/api.go#L102-L105 , and if...
When I use volcano, the podgroup status sometimes is pending, because of proportion plugin reject at job enqueueable stage. And it record event about "queue resource quota insufficient", but I'm...