modkit icon indicating copy to clipboard operation
modkit copied to clipboard

Different number of line in the bedMethyl table among samples

Open HeleneMeMartin opened this issue 2 years ago • 1 comments

Hello,

I process 6 different samples that I mapped against the same reference genome (the mitochondria) using Dorado. Using Samtool coverage, I checked that all my sample covered 100% of my genome with with a minimum depth of 1. I then used modkit to generate a bedMethyl table and it turns out that my 6 samples have different number of rows. Using modkit, I used the option --filter-threshold.

Does bedMethyl table output only site that have at least 1 called higher than the threshold ? So that there is no lines like Nmod = 0 Ncanonical = 0 Nother mod = 0 Nvalid_cov = 0 Nfail = 15

Thank you !

HeleneMeMartin avatar Nov 07 '23 17:11 HeleneMeMartin

Hello @HeleneMeMartin

Does bedMethyl table output only site that have at least 1 called higher than the threshold ? So that there is no lines like Nmod = 0 Ncanonical = 0 Nother mod = 0 Nvalid_cov = 0 Nfail = 15

That's correct, as mentioned in the documentation, rows with Nvalid_cov = 0 are not reported. If you have a use case where this is a problem I could add a flag in the next release that outputs positions where all of the modification calls fall below the pass threshold.

ArtRand avatar Nov 07 '23 17:11 ArtRand