calico icon indicating copy to clipboard operation
calico copied to clipboard

calico node is keep restarting

Open abbasahmed40 opened this issue 11 months ago • 3 comments

these are the logs for calico node ..

2024-03-14 17:12:31.490 [INFO][67] felix/status_combiner.go 98: Reporting combined status. id=proto.WorkloadEndpointID{OrchestratorId:"k8s", WorkloadId:"kube-system/calico-kube-controllers-5bffccb9c6-s8pdn", EndpointId:"eth0"} status="up"
2024-03-14 17:12:32.317 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:33.382 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:34.453 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:35.521 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:36.582 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:37.647 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:38.714 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:39.778 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:40.845 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:41.919 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host
2024-03-14 17:12:42.984 [ERROR][67] felix/health.go 392: Health endpoint failed, trying to restart it... error=listen tcp: lookup localhost on 8.8.8.8:53: no such host

abbasahmed40 avatar Mar 14 '24 17:03 abbasahmed40

I am using calico version 3.26 and k8 version is 1.27.11

abbasahmed40 avatar Mar 14 '24 17:03 abbasahmed40

Im experiencing the exact issue, any work around?

petercorrea avatar Mar 14 '24 19:03 petercorrea

This looks like a problem with your pod's domain name resolution configuration.

Calico uses a health endpoint within the container which attempts to talk to the local felix daemon using localhost, which means it needs to resolve localhost to the local pod's interface. Normally I'd expect this to happen without DNS in the loop, e.g., via the container's /etc/hosts. However it's possible that your cluster is set up in such a way that this isn't happening.

https://kubernetes.io/docs/tasks/administer-cluster/dns-custom-nameservers/ https://kubernetes.io/docs/tasks/network/customize-hosts-file-for-pods/

Have you customized DNS or host resolution at all on the Calico pods?

caseydavenport avatar Mar 19 '24 16:03 caseydavenport

@abbasahmed40 @petercorrea any update?

tomastigera avatar Apr 02 '24 16:04 tomastigera

@caseydavenport thanks ,its working now

abbasahmed40 avatar Apr 02 '24 20:04 abbasahmed40