website icon indicating copy to clipboard operation
website copied to clipboard

The blog post: device failures handling

Open SergeyKanzhelev opened this issue 11 months ago • 17 comments

Description

/sig node /cc @mrunalp

This is a blog post from the talk we gave on KubeCon NA 2024. Covers a lot of topics we will be working on in sig node and beyond this year in terms of reliability and extensibility.

SergeyKanzhelev avatar Jan 07 '25 19:01 SergeyKanzhelev

Pull request preview available for checking

Built without sensitive environment variables

Name Link
Latest commit 8f325240b07642e1a95b2a6dd5d9f555cac67a0b
Latest deploy log https://app.netlify.com/sites/kubernetes-io-main-staging/deploys/67e5c8d1b423440007cdb60f
Deploy Preview https://deploy-preview-49333--kubernetes-io-main-staging.netlify.app
Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

netlify[bot] avatar Jan 07 '25 20:01 netlify[bot]

@shannonxtreme I would appreciate any feedback on this.

SergeyKanzhelev avatar Jan 09 '25 19:01 SergeyKanzhelev

There's a backlog on blog article reviews @SergeyKanzhelev and the blog team are going to prioritize the older articles for publication. We should have this published within a few months, maybe sooner.

sftim avatar Jan 09 '25 22:01 sftim

/cc (I'm interested on the topic)

ffromani avatar Jan 15 '25 13:01 ffromani

@sftim do you know how to enable the description list plugin? I got error when using {{% description-list %}}:

4:58:31 PM: Error: error building site: process: readAndProcessContent: "/opt/build/repo/content/en/blog/_posts/2025-01-14-devices-failure-handling/index.md:53:1": failed to extract shortcode: template for shortcode "description-list" not found

Trying this: https://kubernetes.slack.com/archives/C1J0BPD2M/p1738688053787199?thread_ts=1738687962.938049&cid=C1J0BPD2M

SergeyKanzhelev avatar Feb 04 '25 16:02 SergeyKanzhelev

(you can use a ~CommonMark~ some de facto convention description list; no need to add a shortcode)

https://sebastiandedeyne.com/description-lists-in-markdown

sftim avatar Feb 04 '25 17:02 sftim

(you can use a ~CommonMark~ some de facto convention description list; no need to add a shortcode)

https://sebastiandedeyne.com/description-lists-in-markdown

this is how it rendered:

image

Am I doing it wrong?

SergeyKanzhelev avatar Feb 04 '25 17:02 SergeyKanzhelev

Try a blank line before “Inference“, @SergeyKanzhelev

sftim avatar Feb 04 '25 17:02 sftim

Also, nothing to do with Markdown, the inference work is often very quick, maybe as low as milliseconds or microseconds per evaluation, but the setup time to load the trained model can be minutes. You might want to capture that.

Language transformer models are a different beast but even these can do several tokens a second. What you wouldn't want to do is fire up a new instance per token (AWS Lambda style). Nope nope nope.

sftim avatar Feb 04 '25 17:02 sftim

Language transformer models are a different beast but even these can do several tokens a second. What you wouldn't want to do is fire up a new instance per token (AWS Lambda style). Nope nope nope.

I have this in the table. Good point!

SergeyKanzhelev avatar Feb 04 '25 19:02 SergeyKanzhelev

@sftim I believe I addressed all comments. Thank you for the review. What are the next steps?

SergeyKanzhelev avatar Feb 04 '25 19:02 SergeyKanzhelev

@sftim I believe I addressed all comments. Thank you for the review. What are the next steps?

We're short on capacity for blog reviews with a lead time around 1 to 8 weeks. We hope to improve that lead time. Please hold.

sftim avatar Feb 10 '25 23:02 sftim

/cc

sanposhiho avatar Feb 14 '25 12:02 sanposhiho

Thanks @sanposhiho for reviewing!

sftim avatar Feb 23 '25 21:02 sftim

No need to hold as, if it merges, it will merge as draft (and then a small follow up PR would get it published).

/hold cancel

sftim avatar Mar 28 '25 08:03 sftim

LGTM label has been added.

Git tree hash: b42bea8d1bb32131484cc3cbcf99f1bf2cfef15b

k8s-ci-robot avatar Mar 28 '25 10:03 k8s-ci-robot

/approve

lmktfy avatar May 06 '25 17:05 lmktfy

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ArangoGutierrez, lmktfy

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot avatar May 06 '25 17:05 k8s-ci-robot