remora icon indicating copy to clipboard operation
remora copied to clipboard

Support for Standalone 5mC and/or 5mCG in dorado Basecalling Model Version 4.3.0

Open wietingj opened this issue 2 months ago • 1 comments

Hello everyone,

as of dorado basecalling model version 4.3.0, only 5mC_5hmC or 5mCG_5hmCG modified bases are supported, not 5mC or 5mCG alone. This leads to problems in the downstream analysis, especially in the statistical evaluation of differential methylation, as 5mC and 5hmC are then combined into one count. For modkit this is addressed in its limitations (https://nanoporetech.github.io/modkit/limitations.html), but the same problem exists in e.g. NanoMethViz / DSS etc. as well.

As far as I know, there is currently no option for separate statistical evaluation of the respective modifiers from a combined 5mC_5hmC modbam file, so it would be desirable if standalone 5mC or 5mCG could also be supported in the current model versions. Or is there an option I am missing?

Thanks for your feedback.

wietingj avatar May 03 '24 13:05 wietingj

I think the modkit command modkit adjust-mods --ignore h is the command for which you are searching. Please let me know if this does not resolve this issue.

For the further question of "separate statistical evaluation of the respective modifiers from a combined 5mC_5hmC modbam file", I'm not quite sure I understand what you mean. Could you expand on this a bit further?

marcus1487 avatar May 03 '24 13:05 marcus1487

Hopefully this has resolved your issue. If you have further questions please reopen this issue.

marcus1487 avatar May 22 '24 11:05 marcus1487