serving
serving copied to clipboard
Kubernetes-based, scale-to-zero, request-driven compute
This brings back "change private K8s service to be a headless service (#15170)" with some additional changes. When we set 'ClusterIP: None' on the private service the activator can't perform...
Changes - change private K8s service to be a headless service - test passthrough with autoscaler
## Context Istio provides 2 mechanisms for customising the spec of the `istio-proxy` sidecar container. Limited customisation is possible via annotations (.e.g `sidecar.istio.io/proxyCPU`) but you can also include a container...
## Ask your question here: We use knative and kserve project in our product to provide inference services with auto-scaling. In some cases, we need to pause to the auto-scaling...
Hello everyone, I'm working on creating an _OAuth 2.0_ service using _Kubeflow, Istio, and KServe_. As part of this setup, I need to add a custom response header to my...
## In what area(s)? /area API ## Describe the feature Adding support for the "hostPath" volume type in Knative Serving would allow users to mount directories from the host node's...
Fixes #15487 ## Proposed Changes This changes the Revision reconciler to contain a code path that changes the ContainerHealthy condition from False to True as the old code path is...
## Ask your question here: We have a kubernetes cluster with many inference services. After all the inference services were restarted, we noticed the istio-proxy container in activator pods were...
## In what area(s)? /area autoscale ## Describe the feature Although Knative autoscaling can maintain a number of minimum replicas per revision, I think this is only limited to actions...
## Ask your question here: Hi. I'm working on a small POC to create some knative Services. The image I'm providing for the pod currently contains a small flask app...