eksctl icon indicating copy to clipboard operation
eksctl copied to clipboard

[Bug] Karpenter setting interruptionQueueName renamed to interruptionQueue

Open pstast opened this issue 10 months ago • 7 comments

What were you trying to accomplish?

Use Karpenter with spot instance

What happened?

Instances were abruptly terminated without prior notice. Karpenter was not receiving notifications from AWS events queue about spot interruptions.

How to reproduce it?

Used this Karpenter configuration in ClusterConfig:

karpenter:
  version: '0.35.2'
  createServiceAccount: true
  withSpotInterruptionQueue: true

Cause

eksctl inserts settings.interruptionQueueName value to Helm charts values during Karpenter installation. But this helm value has been renamed to settings.interruptionQueue (without "Name") at the end.

See the current Helm chart values: https://github.com/aws/karpenter-provider-aws/blob/main/charts/karpenter/values.yaml#L194

The change was introduced in the following pull request merged 3 months ago: https://github.com/aws/karpenter-provider-aws/pull/4840

As a result, INTERRUPTION_QUEUE environment value is not passed to karpenter pods and the queue is not enabled. After fix (rename value name), INTERRUPTION_QUEUE appears in pod environment variables.

Suggested fix

Rename settings.interruptionQueueName to settings.interruptionQueue in user-provided Helm values when installing Karpenter.

pstast avatar Apr 02 '24 11:04 pstast

Hello pstast :wave: Thank you for opening an issue in eksctl project. The team will review the issue and aim to respond within 1-5 business days. Meanwhile, please read about the Contribution and Code of Conduct guidelines here. You can find out more information about eksctl on our website

github-actions[bot] avatar Apr 02 '24 11:04 github-actions[bot]

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar May 03 '24 01:05 github-actions[bot]

There is no solution yet (at least to my knowledge)

ralf-cestusio avatar May 07 '24 18:05 ralf-cestusio

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Jun 07 '24 01:06 github-actions[bot]

This issue was closed because it has been stalled for 5 days with no activity.

github-actions[bot] avatar Jun 13 '24 01:06 github-actions[bot]

This should not be closed.

worldofgeese avatar Jun 13 '24 04:06 worldofgeese

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Aug 18 '24 01:08 github-actions[bot]