plumbing icon indicating copy to clipboard operation
plumbing copied to clipboard

Some triggered Tekton jobs should have resource requests/limits

Open abayer opened this issue 2 years ago • 6 comments

The ones I notice right now are the plumbing-image-build and pull-pipeline-kind-k8s-v1-21-e2e PR PipelineRuns, and the build-and-push-test-runner cronjob triggered PipelineRun. I've seen the test-runner image builds cause OOMs on their nodes, and the plumbing-image-build one I'm looking at right now is at over 5gb memory used. The pull-pipeline-kind-k8s-v1-21-e2e pods that I've seen have ranged between 2 and 4gb memory used.

None of them (or any of the other Tekton PipelineRuns, for that matter) have any requests or limits configured, so they can end up on the same node, or a node with one of the other high memory usage pods always running in the cluster (i.e., prometheus and kafka) and cause problems. Given that dogfooding is hardcoded to 5 n1-standard-4s, with ~13gb allocatable memory, it's pretty easy for just a few of the high memory pods to end up on the same node and swamp it.

abayer avatar Jul 05 '22 19:07 abayer

Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale with a justification. Stale issues rot after an additional 30d of inactivity and eventually close. If this issue is safe to close now please do so with /close with a justification. If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot avatar Nov 16 '22 18:11 tekton-robot

Stale issues rot after 30d of inactivity. Mark the issue as fresh with /remove-lifecycle rotten with a justification. Rotten issues close after an additional 30d of inactivity. If this issue is safe to close now please do so with /close with a justification. If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle rotten

Send feedback to tektoncd/plumbing.

tekton-robot avatar Dec 16 '22 18:12 tekton-robot

Rotten issues close after 30d of inactivity. Reopen the issue with /reopen with a justification. Mark the issue as fresh with /remove-lifecycle rotten with a justification. If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

tekton-robot avatar Jan 15 '23 18:01 tekton-robot

@tekton-robot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity. Reopen the issue with /reopen with a justification. Mark the issue as fresh with /remove-lifecycle rotten with a justification. If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tekton-robot avatar Jan 15 '23 18:01 tekton-robot

/remove-lifecycle rotten

afrittoli avatar Feb 03 '23 12:02 afrittoli

/lifecycle frozen

afrittoli avatar Feb 03 '23 12:02 afrittoli