Leopoldo Pla Sempere
Leopoldo Pla Sempere
Current intensive tests from `run-tests.sh` are working. But I think that we should add new tests to be sure that all these changes regarding GPU management are correctly working before...
Temporary workaround? https://github.com/snakemake/snakemake/issues/343
Nope. As I said back then, we have had to run the whole workflow manually this whole time under SLURM.
Well, I think it is not that easy. Snakemake could need to take into account all jobs from a rule and wait until all input for those jobs are ready,...
As @johanneskoester said on Twitter (https://twitter.com/zngu/status/1499479835290308618), the Pull Request https://github.com/snakemake/snakemake/pull/1015 should allow grouping jobs using SLURM job arrays.
Hi. I reviewed the PR that implements the SLURM backend in Snakemake, and it doesn't allow performing job arrays yet. It still needs code work, as there is no reference...
Hello, @cmeesters. Thank you for your response. I do agree that a few thousand jobs shouldn't be an issue in a modern Supercomputer. But in our practical case, that's not...
Also happening here on a A100-PCIE-40GB using driver 530.30.02 and CUDA 12.1.
There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch.
> > There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch. >...