microceph
microceph copied to clipboard
Microceph no longer tracking correct disks/osd
Randomly one of my OSDs became unavailable and downed. I immediately started troubleshooting and noticed the microceph disk list
command showed the device that was originally at osd.8
as "available unpartitioned."
OSD.8 is the disk that is currently marked as down.
OSD.8 and OSD.9 some how are set to the exact same disk. Not sure how this happened as they were both added using the scsi-XXX
names
Current cluster is very unhappy because the OSD.8 is down/out.
Not particularly sure how to proceed as I cannot remove the disk with microceph disk remove OSD.8
as that particular disk is also OSD.9? I reach a timeout when attempting to do so.
OSD.9 is currently available and online.
OSD.8 is correctly marked in the ceph dashboard as being the right disk.
Version: