Pods are considered ready while loading and cannot serve
Related: https://github.com/dragonflydb/dragonfly/issues/5881
Problem: Currently, using the standard Docker healthcheck for both liveness and readiness probes causes issues during rolling updates when:
- New pods pass health checks while still loading datasets into memory
- Kubernetes kills old pods before new pods are truly ready to serve traffic
- This results in service downtime during deployments (3+ minutes as mentioned in #5881)
Hi @vyavdoshenko I would like to be assigned on this issue cc @Abhra303
@miledxz We don't assign issues to comunity authors. You can provide your solution and mention this issue in PR.
@vyavdoshenko sounds good ! thank you for fast response !
I added a started draft PR here: https://github.com/dragonflydb/dragonfly-operator/pull/412 gentle ping for @Abhra303,
Im open for any feedback, the goal was to do some initial work to see am I on good track
guys thank you for your time !