linstor-server icon indicating copy to clipboard operation
linstor-server copied to clipboard

peer node id cannot be my own node id for empty resource definition

Open kvaps opened this issue 3 years ago • 1 comments

After removing all resources and restarting whole control-plane:

# linstor r l -a
╭────────────────────────────────────────────────────────────────╮
┊ ResourceName ┊ Node ┊ Port ┊ Usage ┊ Conns ┊ State ┊ CreatedOn ┊
╞════════════════════════════════════════════════════════════════╡
╰────────────────────────────────────────────────────────────────╯
# linstor rd l
╭───────────────────────────────────────────────────────────────────────────────────────────────────╮
┊ ResourceName                             ┊ Port ┊ ResourceGroup                           ┊ State ┊
╞═══════════════════════════════════════════════════════════════════════════════════════════════════╡
┊ pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3 ┊ 7053 ┊ sc-9e1697f9-0811-5b78-8d18-289b7e9f1173 ┊ ok    ┊
┊ pvc-dc8847ca-00df-42ad-a062-626d40529c29 ┊ 7020 ┊ sc-9e1697f9-0811-5b78-8d18-289b7e9f1173 ┊ ok    ┊
┊ pvc-e893e7fb-e17e-409f-8e90-aed64e20132d ┊ 7098 ┊ sc-9e1697f9-0811-5b78-8d18-289b7e9f1173 ┊ ok    ┊
┊ pvc-e004945b-5216-4731-ac9e-c525c7ec847c ┊ 7083 ┊ sc-9e1697f9-0811-5b78-8d18-289b7e9f1173 ┊ ok    ┊
╰───────────────────────────────────────────────────────────────────────────────────────────────────╯
# linstor rd d pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3
SUCCESS:
Description:
    Resource definition 'pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3' marked for deletion.
Details:
    Resource definition 'pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3' UUID is: 34a34176-9545-4af0-b077-d3bad7fd2c30
SUCCESS:
    Notified 'hf-kubevirt-03' that diskless resources of 'pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3' are being deleted
SUCCESS:
    Notified 'hf-kubevirt-02' that diskless resources of 'pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3' are being deleted
ERROR:
    (Node: 'hf-kubevirt-01') Failed to adjust DRBD resource pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3
Show reports:
    linstor error-reports show 61F261C9-39576-000054
# linstor error-reports show 61F261C9-39576-000054
ERROR REPORT 61F261C9-39576-000054

============================================================

Application:                        LINBIT�� LINSTOR
Module:                             Satellite
Version:                            1.17.0
Build ID:                           7e646d83dbbadf1ec066e1bc8b29ae018aff1f66
Build time:                         2021-12-09T07:27:52+00:00
Error time:                         2022-01-27 09:33:33
Node:                               hf-kubevirt-01

============================================================

Reported error:
===============

Description:
    Failed to adjust DRBD resource pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3

Category:                           LinStorException
Class name:                         ResourceException
Class canonical name:               com.linbit.linstor.core.devmgr.exceptions.ResourceException
Generated at:                       Method 'adjustDrbd', Source file 'DrbdLayer.java', Line #801

Error message:                      Failed to adjust DRBD resource pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3

Error context:
    An error occurred while processing resource 'Node: 'hf-kubevirt-01', Rsc: 'pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3''

Call backtrace:

    Method                                   Native Class:Line number
    adjustDrbd                               N      com.linbit.linstor.layer.drbd.DrbdLayer:801
    process                                  N      com.linbit.linstor.layer.drbd.DrbdLayer:391
    process                                  N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:846
    processResourcesAndSnapshots             N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:359
    dispatchResources                        N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:169
    dispatchResources                        N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:309
    phaseDispatchDeviceHandlers              N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:1071
    devMgrLoop                               N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:735
    run                                      N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:631
    run                                      N      java.lang.Thread:829

Caused by:
==========

Description:
    Execution of the external command 'drbdadm' failed.
Cause:
    The external command exited with error code 1.
Correction:
    - Check whether the external program is operating properly.
    - Check whether the command line is correct.
      Contact a system administrator or a developer if the command line is no longer valid
      for the installed version of the external program.
Additional information:
    The full command line executed was:
    drbdadm -vvv adjust pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3

    The external command sent the following output data:
    drbdsetup new-peer pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3 0 --_name=hf-kubevirt-02 --shared-secret=UJuSVi54vwtFgh4LdROw --cram-hmac-alg=sha1


    The external command sent the following error information:
    pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3: Failure: (162) Invalid configuration request
    additional info from kernel:
    peer node id cannot be my own node id
    Command 'drbdsetup new-peer pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3 0 --_name=hf-kubevirt-02 --shared-secret=UJuSVi54vwtFgh4LdROw --cram-hmac-alg=sha1' terminated with exit code 10


Category:                           LinStorException
Class name:                         ExtCmdFailedException
Class canonical name:               com.linbit.extproc.ExtCmdFailedException
Generated at:                       Method 'execute', Source file 'DrbdAdm.java', Line #593

Error message:                      The external command 'drbdadm' exited with error code 1


Call backtrace:

    Method                                   Native Class:Line number
    execute                                  N      com.linbit.linstor.layer.drbd.utils.DrbdAdm:593
    adjust                                   N      com.linbit.linstor.layer.drbd.utils.DrbdAdm:90
    adjustDrbd                               N      com.linbit.linstor.layer.drbd.DrbdLayer:735
    process                                  N      com.linbit.linstor.layer.drbd.DrbdLayer:391
    process                                  N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:846
    processResourcesAndSnapshots             N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:359
    dispatchResources                        N      com.linbit.linstor.core.devmgr.DeviceHandlerImpl:169
    dispatchResources                        N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:309
    phaseDispatchDeviceHandlers              N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:1071
    devMgrLoop                               N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:735
    run                                      N      com.linbit.linstor.core.devmgr.DeviceManagerImpl:631
    run                                      N      java.lang.Thread:829


END OF ERROR REPORT.
root@hf-kubevirt-01:/# cat /var/lib/linstor.d/pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3.res
# This file was generated by linstor(1.17.0), do not edit manually.
# Local node: hf-kubevirt-01
# Host name : hf-kubevirt-01

resource "pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3"
{

    options
    {
        quorum off;
    }

    net
    {
        cram-hmac-alg     sha1;
        shared-secret     "UJuSVi54vwtFgh4LdROw";
    }

    on hf-kubevirt-01
    {
        node-id    1;
    }

    on hf-kubevirt-02
    {
        node-id    0;
    }

    connection
    {
        host hf-kubevirt-01 address ipv4 192.168.242.35:7053;
        host hf-kubevirt-02 address ipv4 192.168.242.38:7053;
    }
}

kvaps avatar Jan 27 '22 09:01 kvaps

On the nodes still a lot of Inconsistent resources in Connecting and Unconnected states:

root@hf-kubevirt-02:/# drbdadm status pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3
pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3 role:Secondary
  hf-kubevirt-01 connection:Connecting
root@hf-kubevirt-01:/# drbdadm status pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3
pvc-4520ca42-4d53-4879-b662-63fd1f3ae4e3 role:Secondary

fixed by drbdsetup down all, they disappeared from linstor, but not removed LVs

kvaps avatar Jan 27 '22 09:01 kvaps