node-problem-detector
node-problem-detector copied to clipboard
This is a place for various problem detectors running on the Kubernetes nodes.
Hello, We have a use case that is not quite fitting into the custom plugin monitor, and we were wondering if there is an interface to write a problem daemon...
Hey folks, This is originally generated by https://app.stepsecurity.io/securerepo and I updated with a few changes https://github.com/sozercan/node-problem-detector/pull/1 What this PR does: - Adds dependabot with a weekly cadence with grouping for...
`nethealth` doesn't seem to be used or referenced anywhere anymore. /cc @wangzhen127 @vteratipally
draino
hello , as draino is no longer maintained , can you please update your documentation accordingly? Is there also another tool which replaced draino and can be used with NPD?...
Bumps the actions-all group with 1 update: [github/codeql-action](https://github.com/github/codeql-action). Updates `github/codeql-action` from 3.26.7 to 3.26.8 Changelog Sourced from github/codeql-action's changelog. CodeQL Action Changelog See the releases page for the relevant changes...
In [filelog](https://github.com/kubernetes/node-problem-detector/blob/master/pkg/systemlogmonitor/logwatchers/filelog/log_watcher.go#L121) lines that don`t match the regex are logged as error with level warning. This results in lot of logs generated from the npd daemonset. Shouldn`t the log level...
Bumps [cloud.google.com/go/compute/metadata](https://github.com/googleapis/google-cloud-go) from 0.5.0 to 0.5.2. Release notes Sourced from cloud.google.com/go/compute/metadata's releases. compute/metadata: v0.5.2 0.5.2 (2024-09-20) Bug Fixes compute/metadata: Close Response Body for failed request (#10891) (e91d45e) Changelog Sourced from...
We're enhancing NPD to support detection of various read-only scenarios (e.g., boot disk, local SSDs, network-attached drives). To support this, the ReadonlyFilesystem configuration is being moved from the kernel monitor...
Custom plugins may spawn zombie processes, such as: ```bash # in NPD container Z Wed Nov 30 23:38:18 2022 netwo 1221841 Z Wed Nov 30 23:38:46 2022 netwo 1223132 Z...
Hi, as part of making our platform more reliable we are creating SLOs for all the core components, but for node-problem-detector we dont have any metric which can be used...