helm-charts icon indicating copy to clipboard operation
helm-charts copied to clipboard

HTTP 504 response from charts.deliveryhero.io

Open hkan opened this issue 2 years ago • 22 comments

Hello,

The ChartMuseum instance you're running on charts.deliveryhero.io has been responding with HTTP 504 for pretty much all endpoints since earlier today.

I'm not sure if this is the correct place to mention this, I didn't know where else to post.

hkan avatar Dec 13 '23 11:12 hkan

Hey @hkan, I've applied a short term fix. Is the issue resolved for you?

max-rocket-internet avatar Dec 13 '23 11:12 max-rocket-internet

Damn Max, that's some impressive response time. 🤩 I just rerun the CI jobs, and they passed. Thank you for the quick action!

hkan avatar Dec 13 '23 11:12 hkan

OK nice. One of our chart museum pods had some I/O errors. Seems to be OK now. Thanks for reporting it.

max-rocket-internet avatar Dec 13 '23 11:12 max-rocket-internet

Hey @max-rocket-internet, the issue seems back

$ curl -v https://charts.deliveryhero.io/index.yaml
*   Trying 104.19.239.90:443...
* Connected to charts.deliveryhero.io (104.19.239.90) port 443 (#0)
* ALPN: offers h2,http/1.1
[...]
* using HTTP/2
* h2 [:method: GET]
* h2 [:scheme: https]
* h2 [:authority: charts.deliveryhero.io]
* h2 [:path: /index.yaml]
* h2 [user-agent: curl/8.1.2]
* h2 [accept: */*]
* Using Stream ID: 1 (easy handle 0x7fc8f300a800)
> GET /index.yaml HTTP/2
> Host: charts.deliveryhero.io
> User-Agent: curl/8.1.2
> Accept: */*
>
< HTTP/2 504
< date: Wed, 13 Dec 2023 13:22:50 GMT
< content-type: text/plain; charset=UTF-8
< content-length: 15
< x-frame-options: SAMEORIGIN
< referrer-policy: same-origin
< cache-control: private, max-age=0, no-store, no-cache, must-revalidate, post-check=0, pre-check=0
< expires: Thu, 01 Jan 1970 00:00:01 GMT
< set-cookie: ***
< server: cloudflare
< cf-ray: 834e8334aedfbffd-WAW
< alt-svc: h3=":443"; ma=86400
<
* Connection #0 to host charts.deliveryhero.io left intact

Hmmm I see we are periodically throwing 499s

image

max-rocket-internet avatar Dec 13 '23 14:12 max-rocket-internet

@max-rocket-internet, could you please take a look at this https://github.com/max-rocket-internet/k8s-event-logger/pull/45

pierluigilenoci avatar Dec 13 '23 14:12 pierluigilenoci

I've applied some changes but still not sure what the root cause is. I'll leave this open, let me know if there's further problems.

max-rocket-internet avatar Dec 13 '23 15:12 max-rocket-internet

@max-rocket-internet, my comment (and the linked PR) was relevant because it allows everyone to have a more resilient Chart Museum than Rocket's.

pierluigilenoci avatar Dec 14 '23 08:12 pierluigilenoci

@pierluigilenoci

was relevant because it allows everyone to have a more resilient Chart Museum

It doesn't, it just moves that single chart to github pages whereas in this repo there are many charts. Also I don't want to use github pages on my own account 🙂

max-rocket-internet avatar Dec 14 '23 09:12 max-rocket-internet

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Dec 29 '23 01:12 github-actions[bot]

For me, this is still an issue.

pierluigilenoci avatar Dec 29 '23 09:12 pierluigilenoci

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Jan 13 '24 01:01 github-actions[bot]

This is still an issue.

pierluigilenoci avatar Jan 15 '24 11:01 pierluigilenoci

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Jan 31 '24 01:01 github-actions[bot]

This is still an issue.

pierluigilenoci avatar Feb 02 '24 09:02 pierluigilenoci

@pierluigilenoci how often are you seeing this?

e.g. if I run this:

while true; do curl -w "%{response_code} %{time_total}\n" -s -o /dev/null https://charts.deliveryhero.io/index.yaml ; done

For me it's solid, left it running for about an hour, it's all like this:

200 0.297999
200 0.214413
200 0.177474
200 0.294827
200 0.310920

max-rocket-internet avatar Feb 02 '24 13:02 max-rocket-internet

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Feb 17 '24 01:02 github-actions[bot]

It still needs to be resolved. It happened once in a while, and it could happen again. What is DeliveryHero's Chart Registry SLA?

Possible solution? https://github.com/max-rocket-internet/k8s-event-logger/pull/45

pierluigilenoci avatar Feb 19 '24 12:02 pierluigilenoci

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Mar 05 '24 01:03 github-actions[bot]

No action corresponds to no solution.

pierluigilenoci avatar Mar 05 '24 07:03 pierluigilenoci

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Mar 20 '24 01:03 github-actions[bot]

No solution corresponds to the problem that is still present.

pierluigilenoci avatar Mar 20 '24 09:03 pierluigilenoci

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Apr 04 '24 01:04 github-actions[bot]

9XLR

pierluigilenoci avatar Apr 08 '24 08:04 pierluigilenoci

@pierluigilenoci as I mentioned here, it would be helpful to have more info from your side.

It happened once in a while

Is not helpful 🙂

Please run the command mentioned for a few hours or a day and post the results.

max-rocket-internet avatar Apr 08 '24 09:04 max-rocket-internet

This issue is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] avatar Apr 23 '24 01:04 github-actions[bot]

This is happening again, is there a planned fix? Thanks!

[2/10/24 8:00:34] ➜  while true; do curl -w "%{response_code} %{time_total}\n" -s -o /dev/null https://charts.deliveryhero.io/index.yaml ; done
504 60.226101
504 59.951381
504 61.321052
...

jjesusglia avatar Oct 02 '24 08:10 jjesusglia

@max-rocket-internet As I wrote last spring, the problem has not been solved, and it happened again. Ref: #606 The solution: #591

pierluigilenoci avatar Oct 02 '24 09:10 pierluigilenoci