ideal icon indicating copy to clipboard operation
ideal copied to clipboard

[Question]: Cutoff Guidance on "threshold on row means of normalized counts"

Open tamuanand opened this issue 1 year ago • 1 comments

Hi @federicomarini

Thanks a lot for the great package.

In the "counts overview tab", I wanted to seek guidance on the recommended cutoff to use when filtering based on "threshold on the row means of the normalized counts".

  • when it comes to "threshold on the row sums of the counts", there is some kind of guidance here - https://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#pre-filtering which states - Here we perform pre-filtering to keep only rows that have a count of at least 10 for a minimal number of samples. The count of 10 is a reasonable choice for bulk RNA-seq

I am trying to find any literature/guidance if I use "threshold on row means of normalized counts".

Thanks in advance.

tamuanand avatar Nov 14 '24 03:11 tamuanand

Hi @tamuanand , I think that is pretty much an arbitrary choice that

  • lets you focus on the genes meaningful to you, avoiding focusing on low expressed genes
  • lets you spend efficiently your "FDR budget" on the features that have better chance to be detected as DE

I tend to (re)read myself the vignette of DESeq2 in these cases. HTH Federico

federicomarini avatar Nov 15 '24 21:11 federicomarini