rnaseq icon indicating copy to clipboard operation
rnaseq copied to clipboard

Parameter for using the --save_reference 'genome' directory

Open kaboroevich opened this issue 2 years ago • 3 comments

Description of feature

Using the --save_reference parameter retains all the intermediate reference files in the <RESULTS_DIR>/genome directory. However, from my understanding of the parameters documentation, for subsequent runs of the pipeline, each of the reference parameters (--fasta, --gtf, --star_index, --salmon_index, etc.) have to be defined individually. It would be helpful if the path to the genome folder could be provided to a single parameter, such as --use_saved_reference, from which the pipeline would extract the relevant paths.

kaboroevich avatar Apr 11 '23 04:04 kaboroevich

Hi @kaboroevich ! This is a good suggestion and something I have been thinking about to make it easier to re-use the references saved when using --save_reference. The idea with --save_reference has always been to move the genome folder somewhere else for re-use rather than being stored in the results folder for a single experiment. I will see if we can improve this behavior in later releases.

drpatelh avatar May 30 '23 10:05 drpatelh

Hi @drpatelh

I tried using the --save_reference option now in nf-core/rnaseq and I noticed that the salmon reference isn't saved in the genome/index/ directory, along with the STAR reference. Is this a bug, or is it by design?

amizeranschi avatar Jul 23 '23 13:07 amizeranschi