Negative values after pt.pp.dsb

Open ChaseTaylor939 opened this issue 1 year ago • 1 comments

Describe the bug Lots of my CITE-seq values are negative after running pt.pp.dsb. It seems like like they shouldn't be below 0. Am I just misunderstanding how pt.pp.dsb works?

To Reproduce

BAL_9282_JC_1 = mu.read_10x_h5('/home/taylor81/BAL_Data/9282-JC-1/ec5337_tdi30000_fpr_0.01_epochs150_filtered.h5')
BAL_9282_JC_1_raw = mu.read_10x_h5('/home/taylor81/BAL_Data/9282-JC-1/raw_feature_bc_matrix.h5')
pt.pp.dsb(BAL_9282_JC_1,BAL_9282_JC_1_raw,random_state = 1)

Expected behaviour I would expect no negative values, but I get negative values like the following example:

  Hu.CD101 Hu.CD103 Hu.CD105_43A3 Hu.CD107a Hu.CD112 Hu.CD119 Hu.CD11a Hu.CD11b Hu.CD11c Hu.CD122 Hu.CD123 Hu.CD124 Hu.CD127 Hu.CD13
CTGATCCCAATGGACG-1 -10.224 10.86788 -3.03904 -42.3412 -8.69139 -7.74294 -42.4369 -13.9509 -34.5928 -3.91137 -21.7118 -9.87767 -7.54718 9.31312


  • OS: 5.19.0-35-generic
  • Python 3.10.9

Package Version Editable project location

Additional context Add any other context about the problem here.

Looking at the publication for the dsb tool, Figure 4a shows some negative values for dsb normalized protein data. So it seems like we should expect some negative values. Link to figure:

