ansible-slurm
ansible-slurm copied to clipboard
You are not running a supported accounting_storage plugin
I'm trying out this Ansible role on a ubuntu 22.04 VM. When running the bare default example, things run smoothly until the following task:
TASK [galaxyproject.slurm : Create the slurmdbd cluster] *************************************************************************************************************************************
fatal: [129.70.51.119]: FAILED! => {"changed": true, "cmd": ["sacctmgr", "-i", "-n", "add", "cluster", "cluster"], "delta": "0:00:00.006080", "end": "2023-09-09 20:14:06.899666", "msg": "non-zero return code", "rc": 1, "start": "2023-09-09 20:14:06.893586", "stderr": "You are not running a supported accounting_storage plugin\nOnly 'accounting_storage/slurmdbd' is supported.", "stderr_lines": ["You are not running a supported accounting_storage plugin", "Only 'accounting_storage/slurmdbd' is supported."], "stdout": "", "stdout_lines": []}
I'm not really sure where to start digging into this, as I was hoping the default would "just work" 😉.
I had the same issue. Adding these two lines under vars moved me to the next problem (which I am stuck at)
slurm_config:
AccountingStorageType: "accounting_storage/slurmdbd"
I am giving up on this. I get the error, and have been trying to fix this for days now
TASK [ansible-slurm : Create the slurmdbd cluster] *************************************************************************************************************************
fatal: [compute-0]: FAILED! => {"changed": true, "cmd": ["sacctmgr", "-i", "-n", "add", "cluster", "cluster"], "delta": "0:00:00.006372", "end": "2023-10-11 14:09:35.829228", "msg": "non-zero return code", "rc": 1, "start": "2023-10-11 14:09:35.822856", "stderr": "sacctmgr: error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:localhost:6819: Connection refused\nsacctmgr: error: Sending PersistInit msg: Connection refused", "stderr_lines": ["sacctmgr: error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:localhost:6819: Connection refused", "sacctmgr: error: Sending PersistInit msg: Connection refused"], "stdout": "", "stdout_lines": []}