argo-rollouts icon indicating copy to clipboard operation
argo-rollouts copied to clipboard

[Bug] Second subscriber not triggered and retries not initiated when the first subscriber fails

Open kzcPo opened this issue 1 year ago • 0 comments

Describe the bug

I have set up a series of subscribers that should automatically switch to the next one upon failure of the previous, along with an expected retry mechanism under certain conditions.

The configuration is as follows:

subscriptions: |
    # subscription for on-sync-status-unknown trigger notifications
    - recipients:
      - feishu
      triggers:
      - on-rollout-completed
    - recipients:
      - jenkins
      triggers:
      - on-rollout-completed

In practice, however, when the first-subscriber encounters a failure, it does not seem to trigger the execution of the second-subscriber, nor does it initiate any retries as configured. Instead, the process seems to terminate prematurely.

Version

argo-rollouts:v1.6.0

Logs

time="2024-02-01T03:26:18Z" level=info msg="Start processing" resource=gray/message
time="2024-02-01T03:26:18Z" level=info msg="Trigger on-rollout-completed result: [{[0].2jmj7l5rSw0yVb_vlWAYkK_YBwk  [rollout-completed] false}]" resource=gray/message
time="2024-02-01T03:26:18Z" level=info msg="Processing completed" resource=gray/message
time="2024-02-01T03:26:18Z" level=error msg="notification error: request to {{\n    \"msg_type\": \"interactive\",\n    \"card\": {\n      \"config\": {\n        \"wide_screen_mode\": true\n      },\n      \"elements\": [\n        {\n          \"tag\": \"markdown\",\n          \"content\": \"**应用**:message\\n**环境**:gray\\n**状态**:Paused\\n**镜像**:[**fpm**]xxx.com/xxx/message:online_social_release_v1\"\n        }\n      ],\n      \"header\": {\n        \"template\": \"blue\",\n        \"title\": {\n          \"content\": \"message 更新通知 Rollout completed\",\n          \"tag\": \"plain_text\"\n        }\n      }\n    }\n}\n POST https://open.feishu.cn/open-apis/bot/v2/hook/xxxx feishu} has failed with error code 400 : {\"code\":9499,\"msg\":\"too many request\",\"data\":{}}"
time="2024-02-01T03:26:18Z" level=error msg="Notifications failed to send for eventReason RolloutCompleted with error: request to {{\n    \"msg_type\": \"interactive\",\n    \"card\": {\n      \"config\": {\n        \"wide_screen_mode\": true\n      },\n      \"elements\": [\n        {\n          \"tag\": \"markdown\",\n          \"content\": \"**应用**:message\\n**环境**:gray\\n**状态**:Paused\\n**镜
像**:[**fpm**]xxx.com/xxx/message:online_social_release_v1\"\n        }\n      ],\n      \"header\": {\n        \"template\": \"blue\",\n        \"title\": {\n          \"content\": \"message 更新通知 Rollout completed\",\n          \"tag\": \"plain_text\"\n        }\n      }\n    }\n}\n POST https://open.feishu.cn/open-apis/bot/v2/hook/xxxx feishu} has failed with error code 400 : {\"code\":9499,\"msg\":\"too many request\",\"data\":{}}" event_reason=RolloutCompleted namespace=gray rollout=message
time="2024-02-01T03:26:18Z" level=info msg="Rollout completed update to revision 62 (585f786d7d): Completed blue-green update" event_reason=RolloutCompleted namespace=gray rollout=message

kzcPo avatar Feb 01 '24 06:02 kzcPo