celltypist icon indicating copy to clipboard operation
celltypist copied to clipboard

Running celltypist with concatenated dataset

Open malonzm1 opened this issue 1 year ago • 3 comments

Hi,

I have datasets that I need to concatenate. Is it ok to run celltypist after concatenation or should I run it on the individual datasets before concatenation?

Thanks and good day.

Br,

malonzm1 avatar Jan 30 '24 11:01 malonzm1

@malonzm1, predicted_labels is not impacted, but majority_voting will be, due to the over-clustering step (different data structures will result in different clusterings).

ChuanXu1 avatar Jan 30 '24 23:01 ChuanXu1

@ChuanXu1 Thanks. Would you say that if I were interested in majority_voting it would be better to perform celltypist before concatenation?

malonzm1 avatar Jan 31 '24 14:01 malonzm1

@malonzm1, it depends. Usually, cells will be clustered in a better manifold when several datasets are combined and greater resolution is achieved, but sometimes clustering on a single dataset may reveal a better clustering shape (e.g., due to alleviated batch effect or optimal set of HVGs).

ChuanXu1 avatar Feb 02 '24 22:02 ChuanXu1

Should be resolved. Please reopen the issue if you have further questions.

ChuanXu1 avatar Aug 09 '24 22:08 ChuanXu1