diffxpy icon indicating copy to clipboard operation
diffxpy copied to clipboard

input expression matrix

Open wangjiawen2013 opened this issue 6 years ago • 1 comments

Dear, In the case of diffxpy (eg wald test, de.test.wald), what the input matrix should be ? raw counts or normalized data ? And, I found that diffxpy ran very slowly when the dataset is large ( 10,000 cells and 20,000 genes ), then how to speed up it ?

wangjiawen2013 avatar Nov 04 '19 03:11 wangjiawen2013

Hi @wangjiawen2013, this depends on the noise model that you want to use. For most scenarios, I recommend negative binomial noise and for that you should provided unprocessed counts, ie integer values that come out of the aligner. Let me know if that is unclear. We are working on the speed issue, we are introducing new estimation backends this week, I will comment again in this issue then! Would you mind sharing your diffxpy call, or just your formula_loc? Thanks! Best, David

davidsebfischer avatar Nov 11 '19 15:11 davidsebfischer