aws-eda-slurm-cluster
aws-eda-slurm-cluster copied to clipboard
AWS Slurm Cluster for EDA Workloads
**Is your feature request related to a problem? Please describe.** Exostellar provides a nested virtualization solution on EC2 that predicts spot terminations far enough in advance to live migrate the...
**Is your feature request related to a problem? Please describe.** Customer needs to access custom S3 bucket to configure compute nodes to run jobs and they need a custom IAM...
On the page https://aws-samples.github.io/aws-eda-slurm-cluster/deploy-parallel-cluster/ Three issues that I ran into: 1) The Create users_groups.json secion has a duplicate of the table used later in "Configure submission hosts to use the...
I ran a --cdk-cmd update to update Instance selections. Then I realized I wanted an additional change, so I modified my config file, and ran the update again. Unfortunately, this...
I'm building a cluster with just nine instance types and certain instances are being culled to "reduce number of CRs" - this is unnecessary as I do not have many...
**Describe the bug** The compute node instance is launching, but slurmd can't launch the job because the admin1 user doesn't exist because users_groups.json is empty. This is supposed to be...
**Describe the bug** This used to work, but when I run the prebuilt RHEL8 Slurm commands from the head node I get the following error: ``` squeue: error: PluginDir: /opt/slurm/lib/slurm:...
**Describe the bug** Yum is hanging while trying to configure the RES submitter hosts.
**Is your feature request related to a problem? Please describe.** If someone has an existing cluster not created by aws-eda-slurm-cluster, can you create a script to configure the RES desktop...
**Describe the bug** If you try to configure a submitter host as a login node for a 4th cluster, the stack fails with the following error: ``` Resource handler returned...