serving icon indicating copy to clipboard operation
serving copied to clipboard

Kubernetes-based, scale-to-zero, request-driven compute

Results 353 serving issues
Sort by recently updated
recently updated
newest added

## Describe the feature ### Context In Vercel Functions, the `waitUntil()` method is a highly valuable feature. It allows developers to enqueue asynchronous tasks to be performed during the lifecycle...

kind/feature

## In what area(s)? /area networking ## Describe the feature Today to use GRPC or HTTP/2 for ingress into your service, you have to explicitly label the ports (see the...

kind/feature
area/networking
triage/accepted

## Describe the feature As K8s community is actively working on moving to resource claims it would be great to add support for it at some point (adding this for...

kind/feature
area/API
lifecycle/frozen

/area autoscale ## Describe the feature I would like to request to appropriately handle the Pods with [shcedulingGates](https://kubernetes.io/docs/concepts/scheduling-eviction/pod-scheduling-readiness/) (a.k.a gated pods). I'm wondering if the KPA should not scale deployment...

kind/feature
area/autoscale

Hi all, I installed Knative Serving with Kourier as networking option. I'm working in scraping the metric revision_request_latencies_bucket directly from the queue-proxy via address pod_ip:9091/metrics. As far as I can...

kind/question

Copy of https://github.com/knative-extensions/net-certmanager/issues/353 > I'd like to have a way to use a different cert-manager issuer for particular services. This could be achieved in one of two ways: > >...

triage/accepted

## In what area(s)? Remove the '> ' to select > /area autoscale Other classifications: > /kind good-first-issue ## What version of Knative? > main ## Expected Behavior While reconciling...

kind/bug
lifecycle/stale

## Describe the feature **Is this a bug report or feature request?** * Feature Request **What should the feature do:** Hi, I'm Harshita. I’m working with [CNCF and the Google...

kind/feature

## Describe the feature Some large language models are too big such that they don't fit in a single node. As a result, the serving workload should be represented as...

kind/feature
area/API
area/autoscale

For a knative revision, one can send the response headers and some data before the timeout. It is then seemingly possible to write data longer than the timeout, possibly forever....

kind/bug