PopPUNK icon indicating copy to clipboard operation
PopPUNK copied to clipboard

Improve distance estimation

Open nickjcroucher opened this issue 3 years ago • 2 comments

It may be worth testing whether corrections to the pairwise distances could be employed as described in https://gitlab.pasteur.fr/GIPhy/JolyTree, as we are now observing core Hamming distances >0.1; as an initial test, it could be employed in the tree visualisation, but may even be helpful in resolving within strain/between strain distances.

nickjcroucher avatar Nov 20 '20 17:11 nickjcroucher

Comparing equation 3 in the paper to the current Monte Carlo method and/or using this method for drawing the tree?

johnlees avatar Nov 20 '20 17:11 johnlees

Sorry if I was unclear - rather than correcting for false positive matches, it would be a correction for multiple substitutions occurring at the same site - this should not change small distances, but would increase larger distances (which are systematically underestimated), which might help separate within & between strain distances where Hamming distances are large. But at a simpler level, it might improve the phylogenies.

nickjcroucher avatar Nov 20 '20 19:11 nickjcroucher