node-problem-detector icon indicating copy to clipboard operation
node-problem-detector copied to clipboard

Move ReadonlyFilesystem Node Condition to a new configuration file

Open DigitalVeer opened this issue 1 year ago • 8 comments

We're enhancing NPD to support detection of various read-only scenarios (e.g., boot disk, local SSDs, network-attached drives). To support this, the ReadonlyFilesystem configuration is being moved from the kernel monitor into a dedicated plugin configuration file that will take over this new functionality.

DigitalVeer avatar Sep 20 '24 09:09 DigitalVeer

Welcome @DigitalVeer!

It looks like this is your first PR to kubernetes/node-problem-detector 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes/node-problem-detector has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. :smiley:

k8s-ci-robot avatar Sep 20 '24 09:09 k8s-ci-robot

Hi @DigitalVeer. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot avatar Sep 20 '24 09:09 k8s-ci-robot

/ok-to-test

AnishShah avatar Sep 20 '24 17:09 AnishShah

/retest /lgtm

mmiranda96 avatar Sep 20 '24 17:09 mmiranda96

You need to add your new plugin config file to https://github.com/kubernetes/node-problem-detector/blob/dc4200d805c9554d6585dba8d1a566fb20b6869b/config/systemd/node-problem-detector-metric-only.service#L11 (and potentially other files).

mmiranda96 avatar Sep 20 '24 17:09 mmiranda96

/cc @wangzhen127 /assign @wangzhen127

hakman avatar Sep 20 '24 19:09 hakman

To make sure it is backward compatible for people using the OSS config file directly, we need to add the new file to all the locations where kernel-monitor.json is invoked: https://github.com/search?q=repo%3Akubernetes%2Fnode-problem-detector%20kernel-monitor.json&type=code

wangzhen127 avatar Sep 20 '24 23:09 wangzhen127

@DigitalVeer please rebase and squash the commits.

hakman avatar Oct 09 '24 06:10 hakman

/retest /lgtm

hakman avatar Oct 09 '24 08:10 hakman

/retest

hakman avatar Oct 09 '24 09:10 hakman

/lgtm /retest

wangzhen127 avatar Oct 09 '24 15:10 wangzhen127

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: DigitalVeer, hakman, mmiranda96, wangzhen127

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot avatar Oct 09 '24 15:10 k8s-ci-robot

@wangzhen127 There's a problem with the failing tests. Seems to be something broken in some other related repo. Any ideas?

hakman avatar Oct 09 '24 16:10 hakman

/retest

wangzhen127 avatar Oct 15 '24 21:10 wangzhen127

/retest

wangzhen127 avatar Oct 15 '24 22:10 wangzhen127