rnaseq icon indicating copy to clipboard operation
rnaseq copied to clipboard

DESEQ2_QC_STAR_SALMON timeout with rlog and 400 samples

Open ojziff opened this issue 3 years ago • 1 comments

I ran rnaseq v3.8.1 on 400 samples. Failed as I got a timeout after 8hrs on DESEQ2_QC_STAR_SALMON process. It was still running rlog after 8 hours with the message:

rlog() may take a long time with 50 or more samples,
vst() is a much faster transformation

I have specified --deseq2_vst which works but i wondered if --deseq2_vst could be made default?

ojziff avatar Jun 24 '22 10:06 ojziff

I second that it should be the default

Also, the timeout seems unnecessary - why produce an error when things could still finish correctly?

RaverJay avatar Jul 22 '22 07:07 RaverJay

I also agree with this proposal. I have been processing hundreds of RNA-seq datasets and this step of the pipeline fails after 8 hours.

Furthermore, since I have set automatic retries of [attempt number] * 8 hours - this would cause the pipeline to fail again after 16 hours and then again after 24 hours.

Consequently, the rnaseq pipeline terminates in error and takes around 2 days to process this PCA plot, which is only for QC purposes.

I think Oliver’s suggestion of having --deseq2_vst as the default option is a very good idea and should improve the pipeline substantially when processing a large number of samples.

Otherwise, thanks for maintaining this great pipeline!

Best,

Steven

StevenWingett avatar Aug 26 '22 15:08 StevenWingett

Agree too! Will be in the next release.

drpatelh avatar Aug 26 '22 17:08 drpatelh

Fixed in https://github.com/nf-core/rnaseq/pull/874/commits/a4be75f4cc55c19ae8a1df34cfa8acc7afc0853e

drpatelh avatar Sep 27 '22 10:09 drpatelh