microceph icon indicating copy to clipboard operation
microceph copied to clipboard

Microceph no longer tracking correct disks/osd

Open FLeiXiuS opened this issue 1 year ago • 3 comments

Randomly one of my OSDs became unavailable and downed. I immediately started troubleshooting and noticed the microceph disk list command showed the device that was originally at osd.8 as "available unpartitioned."

OSD.8 is the disk that is currently marked as down.

image

OSD.8 and OSD.9 some how are set to the exact same disk. Not sure how this happened as they were both added using the scsi-XXX names image

Current cluster is very unhappy because the OSD.8 is down/out. image

Not particularly sure how to proceed as I cannot remove the disk with microceph disk remove OSD.8 as that particular disk is also OSD.9? I reach a timeout when attempting to do so.

OSD.9 is currently available and online. image

OSD.8 is correctly marked in the ceph dashboard as being the right disk. image

Version: image

FLeiXiuS avatar Dec 19 '23 17:12 FLeiXiuS