gemBS icon indicating copy to clipboard operation
gemBS copied to clipboard

Scattering gemBS alignment step across a cluster

Open gilhornung opened this issue 3 years ago • 1 comments

Hello, We are using Cromwell on a Google Cloud Platform to analyze sequencing data. We found that, at least for bwa and WGS, scattering the reads in small chunks across several nodes with few cpus is cheaper than running the entire sample on one node with many cpus, when using preemptible instances. From the documentation I saw that for gemBS the recommendation is to run all steps on one machine with multiple cpus for parallelization. My question is whether it is possible to perform the map step of gemBS across many independent instances, and then merge the resulting bams using gemBS. Thank you, Gil

gilhornung avatar Jul 07 '21 12:07 gilhornung

In principle yes (we don't use cromwell, but we have a workflow based on Slurm which splits the run across a cluster without any major problems). If there is a shared file system for all nodes then it should work pretty much without modification, otherwise you'll have to copy the bams for each chunk into the same directory and use the --no-db option to gemBS for the merge step.

Simon

On Wed, Jul 7, 2021 at 2:31 PM gilhornung @.***> wrote:

Hello, We are using Cromwell on a Google Cloud Platform to analyze sequencing data. We found that, at least for bwa and WGS, scattering the reads in small chunks across several nodes with few cpus is cheaper than running the entire sample on one node with many cpus, when using preemptible instances https://cloud.google.com/compute/docs/instances/preemptible. From the documentation I saw that for gemBS the recommendation is to run all steps on one machine with multiple cpus for parallelization. My question is whether it is possible to perform the map step of gemBS across many independent instances, and then merge the resulting bams using gemBS. Thank you, Gil

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/heathsc/gemBS/issues/84, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAY465Z7NQFKJU4QVSD3JM3TWRCKNANCNFSM476SG54Q .

heathsc avatar Jul 07 '21 12:07 heathsc