cluster-api-provider-aws Requeue after EKS cluster creation request is sent / cluster transitioned to UPDATING

trafficstars

/kind refactor

Describe the solution you'd like Currently, CAPA does not requeue for EKS cluster creation / update, instead, it continues on reconciliation.

Creation: It waits for cluster to be active (https://github.com/kubernetes-sigs/cluster-api-provider-aws/blob/main/pkg/cloud/services/eks/cluster.go#L58-L79)
Update: It continues on reconciliation for example for cluster version update (https://github.com/kubernetes-sigs/cluster-api-provider-aws/blob/4274a5ab5c9dc391045840d6e45e17fe0cbab3dc/pkg/cloud/services/eks/cluster.go#L531)

CAPA should requeue (probably after a duration) after creation request is sent for creation / cluster transitioned to UPDATING for update.

Environment:

Cluster-api-provider-aws version: v1.0.0

Nov 15 '21 19:11 richardchen-db

@richardchen-db: This issue is currently awaiting triage.

If CAPA/CAPI contributors determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Nov 15 '21 19:11 k8s-ci-robot

/area provider/eks /priority important-soon /help

Nov 16 '21 08:11 richardcase

@richardcase: This request has been marked as needing help from a contributor.

Guidelines

Please ensure that the issue body includes answers to the following questions:

Why are we solving this issue?
To address this issue, are there any code changes? If there are code changes, what needs to be done in the code and what places can the assignee treat as reference points?
Does this issue have zero to low barrier of entry?
How can the assignee reach out to you for help?

For more details on the requirements of such an issue, please see here and ensure that they are met.

If this request no longer meets these requirements, the label can be removed by commenting with the /remove-help command.

In response to this:

/area provider/eks /priority important-soon /help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Nov 16 '21 08:11 k8s-ci-robot

To give some extra detail/background:

creation - the original implementation of EKS in CAPA made a lot of use of the Wait functions in the AWS SDK to pause (i.e. wait) reconciliation until the EKS cluster reached the desired status. After discussions over a number of different issues/office hours we decided it would be better to requeue using reconcile.Result{RequeueAfter:} so that we don't block the reconciliation loop.
update - I consider this a bug as we wait until the status of the EKS cluster is UPDATING and then continue on with the reconciling things like encryption, tags, oidc. Really, we should requeue and continue to requeue until the cluster changes from UPDATING to a new state (ACTIVE if successful).

I think we should split this into 2 separate issues as we have a refactor and a bug here.

Nov 16 '21 09:11 richardcase

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Apr 24 '22 18:04 k8s-triage-robot

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

May 24 '22 19:05 k8s-triage-robot

/remove-lifecycle rotten

Jun 02 '22 17:06 sedefsavas

/unassign

Jul 08 '22 21:07 richardcase

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Oct 06 '22 22:10 k8s-triage-robot

/remove-lifecycle stale

Oct 10 '22 10:10 richardcase

This issue is labeled with priority/important-soon but has not been updated in over 90 days, and should be re-triaged. Important-soon issues must be staffed and worked on either currently, or very soon, ideally in time for the next release.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Deprioritize it with /priority important-longterm or /priority backlog
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

Mar 05 '23 17:03 k8s-triage-robot

From office hours 2023-04-03:

/triage accepted /help

Apr 03 '23 16:04 richardcase

This issue is labeled with priority/important-soon but has not been updated in over 90 days, and should be re-triaged. Important-soon issues must be staffed and worked on either currently, or very soon, ideally in time for the next release.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Deprioritize it with /priority important-longterm or /priority backlog
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

Jul 02 '23 17:07 k8s-triage-robot

cluster-api-provider-aws cluster-api-provider-aws copied to clipboard

Requeue after EKS cluster creation request is sent / cluster transitioned to UPDATING

Guidelines

cluster-api-provider-aws
cluster-api-provider-aws copied to clipboard