Xigang Wang
Xigang Wang
> I think we should add an IgnoreNotFound for this: > > https://github.com/kubernetes-sigs/kueue/blob/0cd304dfaf87dc672b7aef79f72816de07281c65/pkg/controller/core/workload_controller.go#L339-L358 > > But it should be placed outside the function, not inside. WDYT? @mbobrovskyi Fixed. Please take...
@mimowo @tenzen-y For approve. Thanks!
@tenzen-y I'll check it now.
/cc @gabesaba @mimowo PTAL. Thanks!
@mimowo For approve.
@mimowo Take a look.
I understand that we should not simply handle problematic clusters by using continue. if a cluster fails, we should inform the users. cc @ikaven1024
I have a question: If a member cluster fails, should the pods in the failed cluster still respond to user requests? If they do respond, how can the client be...
/cc @RainbowMango @XiShanYongYe-Chang @ikaven1024 Let's take a look at this issue together.
@XiShanYongYe-Chang Unable to receive events from all member clusters through the aggregated apiserver, suspecting that the watch is blocked in `cache.watch()`. ``` clusters := c.getClusterNames() for i := range clusters...