nanocompore icon indicating copy to clipboard operation
nanocompore copied to clipboard

Downsample to the same coverage as the lower coverage sample

Open tleonardi opened this issue 5 years ago • 2 comments

Very different coverage between samples might causes troubles during GMM clustering. At the moment we can downsample to a fixed coverage level, but it would be great to downsample each reference independently so that the sample with higher coverage is brought down to the same level as the sample with lower coverage.

tleonardi avatar Apr 05 '19 08:04 tleonardi

Are you thinking of doing it on a position per position fashion ?

a-slide avatar Apr 05 '19 08:04 a-slide

I would actually suggest to move the reference level downsampling from Whitelist directly to txComp at position level. The advantage of having it in the whitelist is that is saves time as we don't read the data discarded by the downsampling.

a-slide avatar Apr 09 '19 09:04 a-slide