contour Readiness probe failed

trafficstars

Hello,

We regularly have pods which are in error and which do not restart correctly. The only solution remains to delete the pod.

Warning Unhealthy 37m (x29 over 39h) kubelet Readiness probe failed: Get "http://10.200.6.5:8002/ready": context deadline exceeded (Client.Timeout exceeded while awaiting headers) Warning Unhealthy 32m (x12 over 39h) kubelet Liveness probe failed: Get "http://10.200.6.5:8090/healthz": context deadline exceeded (Client.Timeout exceeded while awaiting headers) Warning Unhealthy 2m1s (x1396 over 94m) kubelet Readiness probe failed: HTTP probe failed with statuscode: 503

The issue happen more and more, recreating the pod can't be a solution. How can I fix the issue ? Is there any logs I can check to have more information about what happen ?

Kubernetes version is 1.24.14 Contour version is 1.23.6

The same issue was already present with previous version of kubernetes and contour.

Regards.

Sep 21 '23 09:09 yannrenarddav

Hey @yannrenarddav! Thanks for opening your first issue. We appreciate your contribution and welcome you to our community! We are glad to have you here and to have your input on Contour. You can also join us on our mailing list and in our channel in the Kubernetes Slack Workspace

Sep 21 '23 09:09 github-actions[bot]

https://projectcontour.io/docs/1.26/troubleshooting/envoy-container-draining/ has some suggestions that may help

Sep 21 '23 14:09 skriss

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

Nov 21 '23 00:11 github-actions[bot]

I experience this issue on the envoy pods, too. Usually the envoy /ready endpoint returns a valid response within < 100ms, but sometimes its response time takes over 5 seconds. I am not sure why.

Dec 12 '23 15:12 PSanetra

https://github.com/projectcontour/contour/issues/4540 may be at play here

Dec 12 '23 15:12 skriss

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

Feb 20 '24 00:02 github-actions[bot]

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

After 60d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

Mark this Issue as fresh by commenting
Close this Issue
Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

Mar 27 '24 00:03 github-actions[bot]

contour contour copied to clipboard

Readiness probe failed

contour
contour copied to clipboard