Leopoldo Pla Sempere

Results 21 comments of Leopoldo Pla Sempere

Current intensive tests from `run-tests.sh` are working. But I think that we should add new tests to be sure that all these changes regarding GPU management are correctly working before...

Temporary workaround? https://github.com/snakemake/snakemake/issues/343

Nope. As I said back then, we have had to run the whole workflow manually this whole time under SLURM.

Well, I think it is not that easy. Snakemake could need to take into account all jobs from a rule and wait until all input for those jobs are ready,...

As @johanneskoester said on Twitter (https://twitter.com/zngu/status/1499479835290308618), the Pull Request https://github.com/snakemake/snakemake/pull/1015 should allow grouping jobs using SLURM job arrays.

Hi. I reviewed the PR that implements the SLURM backend in Snakemake, and it doesn't allow performing job arrays yet. It still needs code work, as there is no reference...

Hello, @cmeesters. Thank you for your response. I do agree that a few thousand jobs shouldn't be an issue in a modern Supercomputer. But in our practical case, that's not...

Also happening here on a A100-PCIE-40GB using driver 530.30.02 and CUDA 12.1.

There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch.

> > There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch. >...