Andrei Kvapil
Andrei Kvapil
This change increases the startupProbe’s failureThreshold from the default value of 3 to 10. With a probe every 5 seconds, that gives etcd up to 50 seconds to start. In...
``` I1101 06:50:28.044916 1 agent.go:440] updating node taints I1101 06:50:28.103291 1 agent.go:276] managing node taints failed: failed to update node taints: Operation cannot be fulfilled on nodes "srv1": the object...
#### What type of PR is this? /kind bug #### What this PR does / why we need it: Fixes `ConvertToVersion()` to prefer the object's GVK field over the first...
### Description Hello, we faced with the issue where cluster stopped working after one specific master node become offline. Two alive `ovn-central` couldn't create a cluster with no visible logs...
The discussion originally started in Helm repository https://github.com/helm/helm/issues/11376 I think this repo is better place to continue this feature consideration > The implementation of resource.Quantity in Kubernetes is quite complex,...
**Description** When node have problem with stucked operations, it can brake OpenNebula itself, eg it may be broken disk subsutem, disconneted target or some other problem. OpenNebula runs a lot...
By default, ZFS freezes I/O on ZVOLs when the underlying pool has failed. As a result, DRBD devices on top may get stuck: they can keep running and report UpToDate...
This is an alternative proposed solution to the issue described in https://github.com/vmware-tanzu/velero/pull/9447 ## Problem Tasks in Prepared phase were not included in periodic enqueue (only Accepted phase was included). This...
# Fix race condition in data upload controller concurrent limit check ## Summary Fixed a race condition where multiple DataUpload tasks were wasting time on expensive `GetExposed` operations even when...
**What steps did you take and what happened:** We observed multiple DataUpload tasks getting stuck in `Prepared` phase for extended periods (18+ hours) without progressing to `InProgress` phase. The tasks...