aws-eda-slurm-cluster
aws-eda-slurm-cluster copied to clipboard
AWS Slurm Cluster for EDA Workloads
**Describe the bug** Currently ParallelCluster provides a CloudFormation template to create the Slurm database. This stack only creates an RDS database, but doesn't create an instance to run the slurmdbd...
**Is your feature request related to a problem? Please describe.** ParallelCluster currently only supports 1 OS and 1 CPU architecture in a cluster. Users should be able to define compute...
**Is your feature request related to a problem? Please describe.** Make sure that slurmrestd is working with ParallelCluster head node. **Describe the solution you'd like** Make sure that the example...
**Describe the bug** If a job in a high priority queue is pending because of licenses then Slurm will not preempt a lower priority job to free up a license....
* Add support for nss_slurm plugin * https://slurm.schedmd.com/nss_slurm.html * Removes the need of the user_groups.json file and creating local users and groups. * 1/21/22: Implemented but when I remove the...
* Investigate removing memory as a consumable resource and allocate 1 job per node. * This is more scalable according to https://slurm.schedmd.com/big_sys.html because the scheduler doesn't have to keep track...
**Is your feature request related to a problem? Please describe.** * Create script to update license configuration based on actual available licenses on license server. * Use a customizable API...
I haven't studied the following completely by reviewing the code, so apologies in advance if it exists. When I run the ansible playbook to install on my workstation machines so...
I was previously only allowing 1 memory size/core count combination to keep the number of compute resources down and also was combining multiple instance types in one compute resource if...
**Describe the bug** When specifying Tags in the ParallelClusterConfig they get overwritten.