cluster-api-provider-aws icon indicating copy to clipboard operation
cluster-api-provider-aws copied to clipboard

:sparkles: Initial dedicated hosts implementation

Open rvanderp3 opened this issue 5 months ago • 17 comments
trafficstars

/kind feature

What this PR does / why we need it:

Enable machines to be provisioned on to dedicated hosts.

Special notes for your reviewer: Resumes work started in cluster-api-provider-aws#5504.

Checklist:

  • [x] squashed commits
  • [ ] includes documentation
  • [x] includes emoji in title
  • [x] adds unit tests
  • [x] adds or updates e2e tests

Release note:

Adds the capability of provisioning machines on to dedicated hosts.

rvanderp3 avatar Jun 12 '25 12:06 rvanderp3

LGTM label has been added.

Git tree hash: eeb699a9acb60c59cfb1b6da396ab9b431428e5d

k8s-ci-robot avatar Jun 16 '25 18:06 k8s-ci-robot

/test ?

mtulio avatar Jun 16 '25 18:06 mtulio

@mtulio: The following commands are available to trigger required jobs:

/test pull-cluster-api-provider-aws-build
/test pull-cluster-api-provider-aws-build-docker
/test pull-cluster-api-provider-aws-e2e-blocking
/test pull-cluster-api-provider-aws-test
/test pull-cluster-api-provider-aws-verify

The following commands are available to trigger optional jobs:

/test pull-cluster-api-provider-aws-apidiff-main
/test pull-cluster-api-provider-aws-e2e
/test pull-cluster-api-provider-aws-e2e-clusterclass
/test pull-cluster-api-provider-aws-e2e-conformance
/test pull-cluster-api-provider-aws-e2e-conformance-with-ci-artifacts
/test pull-cluster-api-provider-aws-e2e-eks
/test pull-cluster-api-provider-aws-e2e-eks-gc
/test pull-cluster-api-provider-aws-e2e-eks-testing

Use /test all to run the following jobs that were automatically triggered:

pull-cluster-api-provider-aws-apidiff-main
pull-cluster-api-provider-aws-build
pull-cluster-api-provider-aws-build-docker
pull-cluster-api-provider-aws-e2e-blocking
pull-cluster-api-provider-aws-test
pull-cluster-api-provider-aws-verify

In response to this:

/test ?

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot avatar Jun 16 '25 18:06 k8s-ci-robot

/test pull-cluster-api-provider-aws-e2e

mtulio avatar Jun 16 '25 18:06 mtulio

/test pull-cluster-api-provider-aws-e2e-eks

mtulio avatar Jun 16 '25 18:06 mtulio

I manually canceled the e2e I triggered as e2e is still under investigation!

mtulio avatar Jun 16 '25 18:06 mtulio

/hold

rvanderp3 avatar Jun 16 '25 20:06 rvanderp3

/hold cancel

rvanderp3 avatar Jun 17 '25 19:06 rvanderp3

/test pull-cluster-api-provider-aws-e2e

rvanderp3 avatar Jun 17 '25 19:06 rvanderp3

/test pull-cluster-api-provider-aws-e2e-eks

rvanderp3 avatar Jun 17 '25 19:06 rvanderp3

/hold cancel

rvanderp3 avatar Jun 17 '25 19:06 rvanderp3

/test pull-cluster-api-provider-aws-e2e-blocking

rvanderp3 avatar Jun 17 '25 19:06 rvanderp3

/test pull-cluster-api-provider-aws-e2e

rvanderp3 avatar Jun 17 '25 20:06 rvanderp3

/hold

placing hold while I look in to related boskos changes. i'm noticing that dedicated hosts aren't being torn down. I think it's because they still have an instance on them when the defer is called that releases the host.

rvanderp3 avatar Jun 17 '25 20:06 rvanderp3

LGTM label has been added.

Git tree hash: 181d6760c750dc67860fde2635caf4a286e1436b

k8s-ci-robot avatar Jun 18 '25 15:06 k8s-ci-robot

/hold cancel

rvanderp3 avatar Jul 07 '25 15:07 rvanderp3

@richardcase would you be able to take a look at this one? thanks!

rvanderp3 avatar Jul 24 '25 19:07 rvanderp3

i see lint is failing, checking that out now

rvanderp3 avatar Jul 24 '25 20:07 rvanderp3

/hold

rvanderp3 avatar Jul 28 '25 19:07 rvanderp3

changes implemented due to underlying function signature changes. placing hold while testing.

rvanderp3 avatar Jul 28 '25 20:07 rvanderp3

/test ?

richardcase avatar Jul 29 '25 13:07 richardcase

@richardcase: The following commands are available to trigger required jobs:

/test pull-cluster-api-provider-aws-build
/test pull-cluster-api-provider-aws-build-docker
/test pull-cluster-api-provider-aws-e2e-blocking
/test pull-cluster-api-provider-aws-test
/test pull-cluster-api-provider-aws-verify

The following commands are available to trigger optional jobs:

/test pull-cluster-api-provider-aws-apidiff-main
/test pull-cluster-api-provider-aws-e2e
/test pull-cluster-api-provider-aws-e2e-clusterclass
/test pull-cluster-api-provider-aws-e2e-conformance
/test pull-cluster-api-provider-aws-e2e-conformance-with-ci-artifacts
/test pull-cluster-api-provider-aws-e2e-eks
/test pull-cluster-api-provider-aws-e2e-eks-gc
/test pull-cluster-api-provider-aws-e2e-eks-testing

Use /test all to run the following jobs that were automatically triggered:

pull-cluster-api-provider-aws-apidiff-main
pull-cluster-api-provider-aws-build
pull-cluster-api-provider-aws-build-docker
pull-cluster-api-provider-aws-e2e-blocking
pull-cluster-api-provider-aws-test
pull-cluster-api-provider-aws-verify

In response to this:

/test ?

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot avatar Jul 29 '25 13:07 k8s-ci-robot

/test pull-cluster-api-provider-aws-e2e-eks /test pull-cluster-api-provider-aws-e2e

richardcase avatar Jul 29 '25 13:07 richardcase

Looks good from my side, however, until the aws-janitor change merges lets hold this:

/hold

Also lets check a larger set of e2e's

richardcase avatar Jul 29 '25 13:07 richardcase

@rvanderp3 - looks like we will need to initially have the e2e test for the dedicated hosts disabled. I'll message you to discuss.

richardcase avatar Jul 29 '25 14:07 richardcase

/hold cancel

testing looks good on my side

rvanderp3 avatar Jul 29 '25 16:07 rvanderp3

@rvanderp3: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
pull-cluster-api-provider-aws-e2e c2cee8322f0a61e702229c0abce12c13bc4808d9 link false /test pull-cluster-api-provider-aws-e2e

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

k8s-ci-robot avatar Jul 29 '25 17:07 k8s-ci-robot

/retest-required

rvanderp3 avatar Jul 31 '25 15:07 rvanderp3

LGTM label has been added.

Git tree hash: 4a9d5bce713bd76208f054efb69d01026b413323

k8s-ci-robot avatar Jul 31 '25 20:07 k8s-ci-robot

/assign @richardcase

rvanderp3 avatar Aug 01 '25 17:08 rvanderp3