CITE-seq-Count icon indicating copy to clipboard operation
CITE-seq-Count copied to clipboard

Filtering UMIs on read number

Open rule-110 opened this issue 3 years ago • 1 comments

I have a very deeply sequenced barcodes library and I expect a substantial amount of UMIs to not be real. We expect those "fake" UMIs to have very low read numbers. Is there a possibility to select only the UMIs that passes a certain read threshold. Also is it possible to have the distribution, like an histogram, of the number of reads per UMIs?

Thanks a lot

rule-110 avatar Apr 13 '21 15:04 rule-110

Hello @rule-110 ,

there is no thresholding at the moment. Mainly because I would rather have raw results come out for users to make their own choice post processing.

I would suggest a downsampling approach to get rid of the lowlevel UMIs for the moment.

Hoohm avatar Apr 20 '21 07:04 Hoohm