sarek
sarek copied to clipboard
Potentially incompatible outputs are generated with BAM output and Spark MarkDuplicates
Description of the bug
Discussed with @maxulysse on Slack:
- Spark MarkDuplicates wants name sorted output, so BAMs are made as such
- If requesting BAM output instead of CRAM, merging happens with name sorted output, causing non-contiguous blocks to be close to each other
- Indexing of the merged BAM fails
The options should be made mutually exclusive so that broken output is not generated.
Command used and terminal output
nextflow run nf-core/sarek -r 3.5.1 --input samplesheet.csv -params-file parameters.yaml --outdir ./ -profile cluster -qs 2 -resume --igenomes_ignore
Relevant files
System information
- Nextflow version: 24.10.2
- Hardware: HPC
- Executor: SLURM
- Container engine: Apptainer
- OS: Debian 10