Leopoldo Pla Sempere comments

Results 21 comments of


                                            Leopoldo Pla Sempere

"neural_modifications" has lost some meaning as name of this branch

Current intensive tests from `run-tests.sh` are working. But I think that we should add new tests to be sure that all these changes regarding GPU management are correctly working before...

Job groups and SLURM job arrays support

Temporary workaround? https://github.com/snakemake/snakemake/issues/343

Job groups and SLURM job arrays support

Nope. As I said back then, we have had to run the whole workflow manually this whole time under SLURM.

Job groups and SLURM job arrays support

Well, I think it is not that easy. Snakemake could need to take into account all jobs from a rule and wait until all input for those jobs are ready,...

Job groups and SLURM job arrays support

As @johanneskoester said on Twitter (https://twitter.com/zngu/status/1499479835290308618), the Pull Request https://github.com/snakemake/snakemake/pull/1015 should allow grouping jobs using SLURM job arrays.

Job groups and SLURM job arrays support

Hi. I reviewed the PR that implements the SLURM backend in Snakemake, and it doesn't allow performing job arrays yet. It still needs code work, as there is no reference...

Job groups and SLURM job arrays support

Hello, @cmeesters. Thank you for your response. I do agree that a few thousand jobs shouldn't be an issue in a modern Supercomputer. But in our practical case, that's not...

Timeout waiting for RPC from GSP!

Also happening here on a A100-PCIE-40GB using driver 530.30.02 and CUDA 12.1.

Timeout waiting for RPC from GSP!

There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch.

Timeout waiting for RPC from GSP!

> > There is no environment. It triggered the bug several times in both 525 and 530 driver. It is a Machine Learning inference command line written in PyTorch. >...