scPower icon indicating copy to clipboard operation
scPower copied to clipboard

Assessing prior model validity

Open ndejay opened this issue 1 year ago • 5 comments

Hello,

Thanks for publishing this great method.

I am looking to generate new priors from a pilot data set consisting of a large number (>10K) of deeply sequenced cells (>100K reads/cell) from Chromium 10X technology, and had some questions about how to fine-tune model training w.r.t recapitulation of the original data.

Comparison of gamma mixed fits with original means. Is it normal for the gamma mixed fit to underestimate 0s and overestimate moderately expressed genes?

000019

Fit a function for UMI counts dependent on read depth: 1

Estimation of median dispersion function for each cell type. In the example in the vignette, no relation was found between dispersion and UMI counts. In my pilot data set, I am seeing one. Is there a way to account for this?

00000a

Validation of expression probability model. The predicted number of expressed genes are off by a significant margin.

2

Many thanks in advance!

Nic

ndejay avatar Dec 01 '23 17:12 ndejay