ansible-slurm icon indicating copy to clipboard operation
ansible-slurm copied to clipboard

You are not running a supported accounting_storage plugin

Open Midnighter opened this issue 1 year ago • 2 comments

I'm trying out this Ansible role on a ubuntu 22.04 VM. When running the bare default example, things run smoothly until the following task:

TASK [galaxyproject.slurm : Create the slurmdbd cluster] *************************************************************************************************************************************
fatal: [129.70.51.119]: FAILED! => {"changed": true, "cmd": ["sacctmgr", "-i", "-n", "add", "cluster", "cluster"], "delta": "0:00:00.006080", "end": "2023-09-09 20:14:06.899666", "msg": "non-zero return code", "rc": 1, "start": "2023-09-09 20:14:06.893586", "stderr": "You are not running a supported accounting_storage plugin\nOnly 'accounting_storage/slurmdbd' is supported.", "stderr_lines": ["You are not running a supported accounting_storage plugin", "Only 'accounting_storage/slurmdbd' is supported."], "stdout": "", "stdout_lines": []}

I'm not really sure where to start digging into this, as I was hoping the default would "just work" 😉.

Midnighter avatar Sep 09 '23 20:09 Midnighter

I had the same issue. Adding these two lines under vars moved me to the next problem (which I am stuck at)

    slurm_config:
      AccountingStorageType: "accounting_storage/slurmdbd"

jp-um avatar Oct 09 '23 14:10 jp-um

I am giving up on this. I get the error, and have been trying to fix this for days now

TASK [ansible-slurm : Create the slurmdbd cluster] *************************************************************************************************************************
fatal: [compute-0]: FAILED! => {"changed": true, "cmd": ["sacctmgr", "-i", "-n", "add", "cluster", "cluster"], "delta": "0:00:00.006372", "end": "2023-10-11 14:09:35.829228", "msg": "non-zero return code", "rc": 1, "start": "2023-10-11 14:09:35.822856", "stderr": "sacctmgr: error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:localhost:6819: Connection refused\nsacctmgr: error: Sending PersistInit msg: Connection refused", "stderr_lines": ["sacctmgr: error: slurm_persist_conn_open_without_init: failed to open persistent connection to host:localhost:6819: Connection refused", "sacctmgr: error: Sending PersistInit msg: Connection refused"], "stdout": "", "stdout_lines": []}

jp-um avatar Oct 11 '23 13:10 jp-um