aws-eda-slurm-cluster icon indicating copy to clipboard operation
aws-eda-slurm-cluster copied to clipboard

AWS Slurm Cluster for EDA Workloads

Results 39 aws-eda-slurm-cluster issues
Sort by recently updated
recently updated
newest added

**Describe the bug** Currently ParallelCluster provides a CloudFormation template to create the Slurm database. This stack only creates an RDS database, but doesn't create an instance to run the slurmdbd...

**Is your feature request related to a problem? Please describe.** ParallelCluster currently only supports 1 OS and 1 CPU architecture in a cluster. Users should be able to define compute...

**Is your feature request related to a problem? Please describe.** Make sure that slurmrestd is working with ParallelCluster head node. **Describe the solution you'd like** Make sure that the example...

**Describe the bug** If a job in a high priority queue is pending because of licenses then Slurm will not preempt a lower priority job to free up a license....

* Add support for nss_slurm plugin * https://slurm.schedmd.com/nss_slurm.html * Removes the need of the user_groups.json file and creating local users and groups. * 1/21/22: Implemented but when I remove the...

* Investigate removing memory as a consumable resource and allocate 1 job per node. * This is more scalable according to https://slurm.schedmd.com/big_sys.html because the scheduler doesn't have to keep track...

**Is your feature request related to a problem? Please describe.** * Create script to update license configuration based on actual available licenses on license server. * Use a customizable API...

I haven't studied the following completely by reviewing the code, so apologies in advance if it exists. When I run the ansible playbook to install on my workstation machines so...

I was previously only allowing 1 memory size/core count combination to keep the number of compute resources down and also was combining multiple instance types in one compute resource if...

**Describe the bug** When specifying Tags in the ParallelClusterConfig they get overwritten.