aws-node-termination-handler icon indicating copy to clipboard operation
aws-node-termination-handler copied to clipboard

Fail the app on missing queue

Open LiorRabin opened this issue 2 years ago • 2 comments

Describe the feature Fail the app on missing queue if queue mode is enabled.

Is the feature request related to a problem? We're using NTH with enableSqsTerminationDraining set to true If we don't specify the queueURL correctly (or there is no queue in AWS) the application logs a warning but keeps being healthy. Why not fail the app in this case?

LiorRabin avatar Oct 03 '23 05:10 LiorRabin

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. If you want this issue to never become stale, please ask a maintainer to apply the "stalebot-ignore" label.

github-actions[bot] avatar Nov 02 '23 17:11 github-actions[bot]

Hi @LiorRabin

I deployed NTH with a bogus SQS queue URL and the logs indicated that NTH did shutdown

2024/01/29 19:28:42 INF Started monitoring for events event_type=SQS_MONITOR
2024/01/29 19:28:44 WRN There was a problem monitoring for events error="AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist." event_type=SQS_MONITOR
2024/01/29 19:28:46 WRN There was a problem monitoring for events error="AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist." event_type=SQS_MONITOR
2024/01/29 19:28:48 WRN There was a problem monitoring for events error="AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist." event_type=SQS_MONITOR
2024/01/29 19:28:50 WRN There was a problem monitoring for events error="AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist." event_type=SQS_MONITOR
2024/01/29 19:28:50 WRN Stopping NTH - Duplicate Error Threshold hit.
panic: AWS.SimpleQueueService.NonExistentQueue: The specified queue does not exist.

I think the Pod's restartPolicy is causing the NTH container to be restarted.

cjerad avatar Jan 29 '24 20:01 cjerad

We were not able to replicate the issue and NTH did shutdown from the logs, closing this issue. Thanks!

LikithaVemulapalli avatar May 23 '24 14:05 LikithaVemulapalli