1click-hpc
1click-hpc copied to clipboard
Deploy your HPC Cluster on AWS in 20min. with just 1-Click.
you can see this in bootstrap.log ``` + /home/ec2-user/.local/bin/pcluster create-cluster --cluster-name hpc-1click-hpc365 --cluster-configuration config.us-east-1.yaml --rollback-on-failure false --wait { "message": "The security token included in the request is expired" } ```...
1click-hpc doesn't work with the latest version of Parallel Cluster, the OnNodeConfigured scripts of the HeadNode are failing; this seems to be related to a change to `/etc/parallelcluster/cfnconfig` introduced in...
Not sure if there's a setup step that I'm missing here but when I run the included Windows or Linux DCV job I get: ``` sbatch failed (parameters: -J Linux_Desktop...
Hi Nicola, Sean / Team, Is there a way to integrate SSO in this stack. @nicolaven I tried to integrate the octa but without success. Could you please help me...
when we use a non-zero minimum in cluster config for resources, they get alive at cluster launch. then this job-related check will never have a value of True: https://github.com/aws-samples/1click-hpc/blob/7a833d4a56dd42d28836b91938168cd4ca841e28/modules/40.install.monitoring.compute.sh#L59 because...
Hi, I noticed that enginframe creates a database on the same DB server as slurm accounting. while slurm can use a single accounting DB per organization https://aws.amazon.com/blogs/compute/enabling-job-accounting-for-hpc-with-aws-parallelcluster-and-amazon-rds/ is the same...
We got into scaling issue with the tagging in prolog script I understand the prolog is ran at every step and when many nodes are involved the job fails with...
Missing that bit, mapping job IDs to instance IDs...
We noticed in a multiuser HPC cluster with FSX attached, all users are able to browse and read files from all other users, even though writing is only possible on...
Since the /fsx/nice location is not unique to the cluster, starting multiple clusters with the same fsx will overwrite the portal data for older clusters