rnaseq
rnaseq copied to clipboard
Output CRAM as default
Description of feature
We should probably be outputting CRAM files as default instead of BAM (it is an nf-core recommended feature in the guidelines, after all).
This was mentioned by @apeltzer to @drpatelh back in 2020 here and looks like it shouldn't be a huge amount of work?
Quote from above:
We could just update the
main.nf
on nf-core/modules to have optional outputs and amend thescript
section accordingly if required. Or the additional arguments to produce CRAM could also come via theoptions.args
parameter provided by the developer.
CRAM outputs should save disk space for the end user and present few downsides. We can have a parameter to optionally still produce BAM instead if needed.
Agree, it would be nice to output CRAM by default 👍🏽 Definitely not impossible but definitely not trivial either 😅
If we want to do it properly, we would have to test and update all relevant nf-core/local modules as well as perform conversions back to BAM for those tools that don't support it. May need to put aside a few days/week to sort it.
Yup, agreed. Hopefully once a few pipelines start doing this it'll gain momentum because of the shared modules..
Sarek already does CRAM by default, right @maxulysse? So if we get the two most popular pipelines doing it, that should cover a lot of the most frequently used modules.
I'd say we go for CRAM everywhere.