node-problem-detector icon indicating copy to clipboard operation
node-problem-detector copied to clipboard

This is a place for various problem detectors running on the Kubernetes nodes.

Results 51 node-problem-detector issues
Sort by recently updated
recently updated
newest added

#### Which issue(s) this PR fixes: this pr solve issue #457 . #### What this PR does / why we need it: This PR adds functionality of tainting and untainting...

cncf-cla: yes
size/L
lifecycle/rotten
ok-to-test

Vulnerability scan shows high CVEs for version 0.8.18. This issue to request when might these get fixed. ` trivy image --severity LOW,MEDIUM,HIGH,CRITICAL --ignore-unfixed --exit-code 3 --exit-on-eol 7 --scanners vuln registry.k8s.io/node-problem-detector/node-problem-detector:v0.8.18`...

Could we introduce a random delay boolean field in pluginConfig, so that npd will execute plugins on different node asynchronized and we won't have all nodes report on the same...

lifecycle/stale

```release-note This pr to add the feature on systemlogmonitor to trigger the condition only repeat match the rule pattern in a period. For example I want add a pattern that...

cncf-cla: yes
size/L
lifecycle/rotten
ok-to-test

This feature request aims to enhance the Node Problem Detector with the ability to monitor GPUs on nodes and detect issues. Currently NPD does not have direct visibility into GPUs....

kind/feature

Related kernel error messages are as below. kernel: XFS (dm-4): Internal error xfs_iunlink_remove at line 2038 of file fs/xfs/xfs_inode.c. Caller xfs_ifree+0x33/0x130 [xfs] kernel: XFS (dm-4): Corruption detected. Unmount and run...

lgtm
cncf-cla: yes
size/S
ok-to-test

Now NPD load rules from local file, from pod mount or builtin container; Do we have plan to support load rules from apiServer configmap content ? If so, I would...

kind/feature

/kind bug ### What happened? Running containers in privileged mode is not recommended as privileged containers run with all [linux capabilities](https://man7.org/linux/man-pages/man7/capabilities.7.html) enabled and can access the host's resources. Running containers...

lifecycle/stale
needs-kind

**Issue Description:** I would like to request the inclusion of a compatibility matrix in the README.md file of the `node-problem-detector` repository. The compatibility matrix would provide users with a clear...