bionode-watermill
bionode-watermill copied to clipboard
Parallelize BAM chunks
- parallelization with patches of bam
- for example, split bam into 20 sections if 20 cores available
- both options: concatenate vs. run analysis on regions
Example:
- get whole genome reference, produce BAM from it
- run, in parallel, a task on each chromosome
- for each chromosome task, after its done do further analysis AND
- after all chromosomes finish, do something that needs all of them (e.g. GWAS)
@tiagofilipe12 this is related to #66