serving icon indicating copy to clipboard operation
serving copied to clipboard

Add configurable probe timeout for activator to support high-latency environments

Open bindrad opened this issue 1 month ago • 2 comments

What version of Knative?

knative-v1.20.0-28-gff5c15ac5-dirty

Expected Behavior

Activator should be able to successfully probe queue-proxy health endpoints in environments with higher network latency.

Actual Behavior

Activator health check probes timeout with context deadline exceeded errors when network latency exceeds the hardcoded 300ms probe timeout. This is particularly problematic in service mesh environments where Envoy adds additional latency to the request path.

Steps to Reproduce the Problem

  1. Deploy Knative Serving with a service mesh (e.g., Istio) in a higher network latency system.
  2. Deploy a Knative service
  3. Observe activator logs showing probe timeout errors when network latency > 300ms
  4. Check curl response times to queue-proxy /healthz endpoint - they exceed 300ms due to mesh overhead

bindrad avatar Nov 20 '25 10:11 bindrad

Hi @bindrad ,

could you explain how your setup looks like if you have >300ms latency inside of you Kubernetes cluster? 300ms is basically a roundtrip to the other side of earth so it seems there are major issues inside of your cluster if this is actually the case - even with a service mesh in place.

Nevertheless I'm also not a big fan of hard coded defaults (even if 300ms is IMO plenty). I'll give your PR a look

linkvt avatar Nov 24 '25 13:11 linkvt

/assign @bindrad

linkvt avatar Nov 24 '25 13:11 linkvt